HeadlinesBriefing favicon HeadlinesBriefing.com

Mistral OCR 4: 170‑Language, High‑Speed Document Engine

Hacker News •
×

Mistral OCR 4 launches as a compact, multilingual engine that returns text, bounding boxes, and typed‑block labels. The model covers 170 languages across ten language groups and runs in a single container, enabling self‑hosted deployments that keep data on‑premise. It targets enterprise search, RAG, and document‑centric pipelines for high‑volume, cost‑efficient processing, and strict compliance needs.

Benchmarking places OCR 4 ahead of competitors, scoring 85.20 on Olm OCRBench and winning human preference tests by an average of 72% over every system evaluated. Internal crawl tests confirm top performance across eight language groups, including rare scripts where many OCRs falter. These results stem from 600+ real‑world documents spanning finance, legal, and technical domains.

Cost efficiency shines: the API charges $4 per 1,000 pages, halved to $2 with batch requests, while the no‑code Document AI option runs at $5 per 1,000 pages. Aidan Donohue of Rogo noted OCR 4 matches top accuracy while cutting latency eightfold and costs by a factor of seventeen in high‑volume, compliant processing scenarios today.

With structured output—including bounding boxes, block types, and confidence scores—OCR 4 empowers downstream agents to act on documents, from form‑filling to compliance checks. Its single‑container design and open‑source toolkit integration make it a practical choice for firms prioritizing data sovereignty and rapid deployment, and enables developers to iterate quickly on custom ingestion pipelines without vendor lock‑in.