What DokuScan does.
No fixed document types. No rigid fields. You define what gets extracted.
Get started freeDocument Processing
PDF, Word, Excel, CSV, TXT, XML and images are processed.
PDF and images
Digital PDFs are read directly. Scanned documents and photos via OCR — even poorly scanned originals.
Word, Excel, CSV, XML
Word documents (.docx), Excel files (.xlsx/.xls), CSV, TXT and XML are processed directly without OCR.
Multilingual
German, English, Polish, Spanish and French are processed.
Supported input formats
Custom Templates
Create a template once — reuse it as often as you like.
Define your own fields
You specify which fields you need. The AI extracts exactly those fields from the document.
Smart Tables — AI detects columns automatically NEU
In Smart mode, the AI determines the column structure of a table itself — no manual row field definition needed.
Reusable
Templates are saved and suggested on the next upload.
Export
Export data in a structured format — with the column names you defined.
Excel (.xlsx)
Open directly in Excel — columns match your field definitions.
CSV
Semicolon, comma or tab — you choose the delimiter.
JSON
For developers and API integrations.
XML NEU
Structured XML export for system integrations.
API & Webhooks NEU
Use DokuScan programmatically and integrate it into your own systems.
REST API
Start scans via API, retrieve templates and receive results in structured JSON format.
Webhooks
Automatic HTTP notifications when a scan is completed or failed. HMAC-signed for security.
Manage API Keys
Up to 5 API keys per account. Create and revoke at any time — directly in your settings.
Batch Processing
Upload up to 25 documents at once.
Multi-upload
Upload up to 25 files at once — PDF, Word, Excel, JPG, PNG up to 20 MB.
PDF splitting
Multi-page PDFs are automatically split. Each page is processed as a separate scan.
Duplicate protection
Already uploaded files are automatically detected and skipped.
Review & Correct
Review and correct extracted data before exporting.
Inline editing
All extracted fields can be edited directly — individual values and table rows.
Confidence display
Each scan shows how confident the extraction was. Low confidence = pay extra attention.
Document preview
Original document directly next to extracted fields — no switching back and forth.
Privacy & Security
GDPR compliant. German servers. No training on your data.
German servers
All data is stored in the EU — S3 Frankfurt.
No AI training
Your documents are not used to train AI models.
GDPR compliant
Complete data separation between accounts. Right to deletion at any time.
All features available immediately.
10 free scans. No subscription. No credit card.
Get started free