Why did we open-source our inference engine? Read the post

PaddlePaddle/PaddleOCR-VL-1.5

PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Overview

Architecture
PaddleOCR-VL
Parameters
959M
Tasks
Extract
Outputs
Text (Markdown)
License
apache-2.0
Languages
en, zh, multilingual

Self-hosted inference for search & document processing

Cut API costs by 50x, boost quality with 85+ SOTA models, and keep your data in your own cloud.

Github 1.9K

Contact us

Tell us about your use case and we'll get back to you shortly.