Why did we open-source our inference engine? Read the post

Alibaba-NLP/gte-reranker-modernbert-base

Architecture
Parameters
150M
Tasks
Score
Outputs
Score
Max Sequence Length
8,192 tokens
License

Benchmarks

AskUbuntuDupQuestions

technology reranking en

Quality
ndcg at 10 0.6701
map at 10 0.5148
mrr at 10 0.7570
Performance L4 b1 c16
Query TPS 6.2K
Query p50 41.9ms

CMedQAv1Reranking

medical reranking zh

Quality
map at 10 0.4989
mrr at 10 0.5905

CMedQAv2Reranking

medical reranking zh

Quality
map at 10 0.5024
mrr at 10 0.5880

MMarcoReranking

general reranking zh

Quality
map at 10 0.2271
mrr at 10 0.2373
Performance L4 b1 c16

T2Reranking

general reranking zh

Quality
map at 10 0.5537
mrr at 10 0.7882

Self-hosted inference for search & document processing

Cut API costs by 50x, boost quality with 85+ SOTA models, and keep your data in your own cloud.

Github
1.5K

Contact us

Tell us about your use case and we'll get back to you shortly.