mixedbread-ai/mxbai-edge-colbert-v0-32m (Score)
The crispy, lightweight ColBERT family from Mixedbread.
Overview
Benchmarks
AskUbuntuDupQuestions
Duplicate question detection from AskUbuntu
Corpus: 6,743 Queries: 360
Quality
ndcg at 10 0.5270
map at 10 0.3626
mrr at 10 0.6088
CMedQAv1-reranking
Quality
ndcg at 10 0.2974
map at 10 0.2385
mrr at 10 0.3151
CMedQAv2-reranking
Quality
ndcg at 10 0.3143
map at 10 0.2534
mrr at 10 0.3350
CQADupstackPhysicsRetrieval
Duplicate question retrieval from StackExchange Physics
Corpus: 38,314 Queries: 1,039
Quality
ndcg at 10 0.4085
map at 10 0.3530
mrr at 10 0.4184
CosQA
Code search with natural language queries
Corpus: 6,267 Queries: 500
Quality
ndcg at 10 0.2817
map at 10 0.2095
mrr at 10 0.2271
FiQA2018
Financial opinion mining and question answering
Corpus: 57,599 Queries: 648
Quality
ndcg at 10 0.3671
map at 10 0.2868
mrr at 10 0.4356
LegalBenchConsumerContractsQA
Question answering on consumer contracts
Corpus: 153 Queries: 396
Quality
ndcg at 10 0.6842
map at 10 0.6258
mrr at 10 0.6315
MMarcoReranking
Multilingual MARCO passage reranking (Chinese)
Quality
ndcg at 10 0.1659
map at 10 0.1394
mrr at 10 0.1394
NFCorpus
Biomedical literature search from NutritionFacts.org
Corpus: 3,593 Queries: 323
Quality
ndcg at 10 0.3570
map at 10 0.2609
mrr at 10 0.5680
SCIDOCS
Citation prediction, document classification, and recommendation for scientific papers
Corpus: 25,656 Queries: 1,000
Quality
ndcg at 10 0.1597
map at 10 0.0897
mrr at 10 0.2878
SciFact
Scientific claim verification using research literature
Corpus: 5,183 Queries: 300
Quality
ndcg at 10 0.7094
map at 10 0.6627
mrr at 10 0.6819
StackOverflowQA
Programming question answering from Stack Overflow
Corpus: 19,931 Queries: 1,994
Quality
ndcg at 10 0.5188
map at 10 0.4640
mrr at 10 0.4769
T2Reranking
Chinese passage ranking benchmark
Quality
ndcg at 10 0.6764
map at 10 0.4979
mrr at 10 0.7179