Why did we open-source our inference engine? Read the post

← Catalog

opendatalab/MinerU2.5-Pro-2604-1.2B

Open comparison →

Primitive: /extract · Extract · qwen2_vl

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

MultimodalMultilingualEntities

Overview

Hardware: — drives latency, throughput & cost

Size1.2B params
Tasks /extract
Licenseapache-2.0
Languageszh, en
Latency
Throughput
Cost /1M tok

Cost is approximate — computed from list GPU prices; your actual price depends on the provider you deploy SIE with.

Extraction

Output kindsEntities
Inputsimage
Max sequence length

Benchmarks

olmOCR-Bench

general ocr en

Document text extraction accuracy across arxiv math, old scans, multi-column layouts, and tables

Corpus: 1,403 Queries: 1,403
Quality
accuracy 0.5910
Reference →

Open source inference for agents

Open-source inference for the models behind your agents. Run it yourself, or let us run it for you.

Github 2.1K

Contact us

Tell us about your use case and we'll get back to you shortly.

Apply for an inference grant

Free capacity on our hosted cluster for selected projects. Tell us what you run and we reply by email.