Mistral OCR 3

by Mistral AI

Self-HostedProprietary

High-fidelity document understanding service with structured Markdown/HTML output, optimized for enterprise-scale document processing at $1-2 per 1,000 pages.

OCRLayout AnalysisTable ExtractionData Extraction

Overview

Mistral OCR 3 (mistral-ocr-2512) is Mistral AI's third-generation OCR service for extracting text and embedded images from PDFs and images while preserving document structure. Released December 2025, it replaces the deprecated OCR v25.03 and OCR 2.

The model outputs LLM-ready Markdown and reconstructs tables as HTML (including complex structures with rowspan/colspan), making it well-suited for RAG pipelines and downstream parsing. It demonstrates a 74% win rate over Mistral OCR 2 on forms, scanned documents, complex tables, and handwriting.

Available via API (model ID: mistral-ocr-2512) and the Document AI Playground in Mistral AI Studio. Pricing is $2 per 1,000 pages standard, $1 per 1,000 pages via Batch API. Self-hosting is available for organizations with strict data privacy requirements, though typically requires enterprise engagement.

Strengths

  • Outputs LLM-ready Markdown and structured HTML tables (rowspan/colspan preserved)
  • Strong performance on cursive handwriting, forms, and low-quality scans
  • 74% win rate over Mistral OCR 2 on complex documents
  • Industry-leading pricing ($1-2 per 1,000 pages)
  • Extracts embedded images alongside text
  • Doc-as-prompt capabilities for structured JSON extraction

Limitations

  • Not open-source (closed weights, proprietary model)
  • Self-hosting requires enterprise engagement (not download-and-run)
  • SaaS-first: typical usage sends documents to Mistral's API
  • Structured extraction annotations cost more than basic OCR

Best Use Cases

  • RAG pipeline document parsing
  • Invoice, receipt, and form extraction
  • High-volume archival digitization
  • Handwritten and historical document processing
  • Enterprise document search and indexing