opendatalab/MinerU2.5-Pro-2604-1.2B

Primitive: /extract · Extract · qwen2_vl

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

MultimodalMultilingualEntities

Overview

Hardware: — drives latency, throughput & cost

Cost is approximate — computed from list GPU prices; your actual price depends on the provider you deploy SIE with.

general ocr en

Document text extraction accuracy across arxiv math, old scans, multi-column layouts, and tables

Corpus: 1,403 Queries: 1,403

Quality

accuracy 0.5910