naver-clova-ix/donut-base-finetuned-docvqa
Donut model fine-tuned on DocVQA. It was introduced in the paper OCR-free Document Understanding Transformer by Geewok et al. and first released in this repository.
Overview
Benchmarks
DocVQA
Visual question answering on document images
Corpus: 5,188 Queries: 5,188
Quality
anls 0.6350
Performance L4-SPOT b1 c4
Performance L4 b1 c16