Why did we open-source our inference engine? Read the post

Qwen/Qwen3-VL-Reranker-2B

The Qwen3-VL-Embedding and Qwen3-VL-Reranker model series are the latest additions to the Qwen family, built upon the recently open-sourced and powerful Qwen3-VL foundation model.

Overview

Architecture
qwen3_vl
Parameters
2.1B
Tasks
Score
Outputs
Score
Max Sequence Length
32,768 tokens
License
apache-2.0

Benchmarks

AskUbuntuDupQuestions

technology reranking en

Duplicate question detection from AskUbuntu

Corpus: 6,743 Queries: 360
Quality
ndcg at 10 0.6553
map at 10 0.5009
mrr at 10 0.7718
Performance L4 b1 c4
Reference →

Self-hosted inference for search & document processing

Cut API costs by 50x, boost quality with 85+ SOTA models, and keep your data in your own cloud.

Github 2.0K

Contact us

Tell us about your use case and we'll get back to you shortly.