Why did we open-source our inference engine? Read the post

lightonai/LightOnOCR-2-1B

📄 Paper | 📝 Blog | 🚀 Demo | 📊 Dataset | 📓 Finetuning

Overview

Architecture
LightOnOCR
Parameters
1.0B
Tasks
Extract
Outputs
Text (Markdown)
License
apache-2.0
Languages
en, fr, de, es, it, nl, pt, sv, da, zh, ja

Benchmarks

olmOCR-bench

general retrieval en

Performance L4 b1 c4

Self-hosted inference for search & document processing

Cut API costs by 50x, boost quality with 85+ SOTA models, and keep your data in your own cloud.

Github 1.9K

Contact us

Tell us about your use case and we'll get back to you shortly.