olmOCR 2

Overview

olmOCR 2 is Allen AI's open OCR model designed for extracting text from scientific papers, technical documents, and research materials. Built on their OLMo language model foundation, it combines visual understanding with strong text generation capabilities.

The 7B/8B parameter model excels at handling the complex layouts common in academic papers: multi-column text, inline equations, figures with captions, and bibliographies. olmOCR-2-8B scores 80.4 on olmOCR-Bench, placing it among the top PDF linearization systems.

Allen AI also created olmOCR-Bench, the benchmark used to evaluate PDF-to-text quality across the industry. As part of their commitment to open research, olmOCR is fully open-source with permissive licensing.

Overview

Strengths

Limitations

Best Use Cases