Document vision · OCR
Mistral OCR
Mistral's flagship document understanding — equations, tables, layout, multilingual. Via Mistral partnership.
Mistral OCR is Mistral AI’s document-understanding API — designed to read PDFs, screenshots, and image-heavy documents with state-of-the-art layout preservation, equation parsing, table reconstruction, and multilingual coverage.
Hosting model — Mistral partnership
The current generation (Mistral OCR 3 / mistral-ocr-2512) is proprietary and not open-weight — distributed via Mistral’s la Plateforme API. We surface it as a hosted endpoint on ScaLabs Cloud through a partnership with Mistral: same OpenAI-compatible interface as our other utility APIs, backend routed through Mistral’s infrastructure under a B2B agreement.
The earlier Mistral OCR-25-03 has weights available on request from Mistral. If your workload requires fully on-shore Kathmandu hosting (no data transit to Mistral’s infrastructure), we can host OCR-25-03 directly — ask us.
When to pick Mistral OCR over GLM-OCR
- Complex layouts. Multi-column academic papers, financial reports, forms with mixed text-and-table content. Mistral OCR reconstructs the reading order more reliably.
- Equations and tables. SOTA on LaTeX equation extraction and table structure preservation.
- You’ll tolerate a small upcharge. Mistral OCR is roughly 2× the price of GLM-OCR per page; the accuracy uplift on complex documents justifies it for many enterprise workloads.
When to pick GLM-OCR instead
- High volume, simple documents. Receipts, invoices, screenshots — GLM-OCR is half the price and accurate enough.
- Data-residency requirements that forbid transit to Mistral. GLM-OCR runs entirely on our Kathmandu hardware; Mistral OCR 3 transits through Mistral’s infrastructure (unless you arrange the 25-03 on-shore setup).
- Simple text extraction with no layout requirements.
Pricing
EUR 0.001 per page. Flat. Billed against your tenant card on file; no deposit required.
Limits
- Per-tenant rate limit: 30 pages per second
- Image size limit: 50 MB per page (PDFs split automatically)
- Supported formats: png, jpg, webp, pdf, tiff
Output formats
Same as GLM-OCR: raw text, markdown with preserved structure, JSON with a caller-supplied schema, or interleaved text + bounding boxes for audit.
Best for
- Complex documents — academic papers with equations, financial tables, multi-column layouts
- Document-heavy enterprise workflows that exceeded GLM-OCR's accuracy floor
- Workflows that need preserved layout fidelity (tables, lists, headings)
- Multilingual document corpora — strong on European + Asian scripts
Upstream source: mistral.ai/news/mistral-ocr
Request Mistral OCR access
Get an API key.
Straight pay-per-use against the published rate. No deposit, no minimums. Tell us what you're building and we'll send your API key and endpoint URL within one working day.