nomic-ai/nomic-embed-text-v2-moe

Blog | Technical Report | AWS SageMaker | Atlas Embedding and Unstructured Data Analytics Platform

Overview

Architecture

NomicBERT

Parameters

475M

Tasks

Encode

Outputs

Dense

Dimensions

Dense: 768

Max Sequence Length

2,048 tokens

License

apache-2.0

Languages

en, es, fr, de, it, pt, pl, nl, tr, ja, vi, ru, id, ar, cs, ro, sv, el, uk, zh, hu, da, no, hi, fi, bg, ko, sk, th, he, ca, lt, fa, ms, sl, lv, mr, bn, sq, cy, be, ml, kn, mk, ur, fy, te, eu, sw, so, sd, uz, co, hr, gu, ce, eo, jv, la, zu, mn, si, ga, ky, tg, my, km, mg, pa, sn, ha, ht, su, gd, ny, ps, ku, am, ig, lo, mi, nn, sm, yi, st, tl, xh, yo, af, ta, tn, ug, az, ba, bs, dv, et, gl, gn, gv, hy

View on HuggingFace → Fine-tuned from nomic-ai/nomic-embed-text-v2-moe-unsupervised

Benchmarks

CQADupstackPhysicsRetrieval

scientific retrieval en

Duplicate question retrieval from StackExchange Physics

Corpus: 38,314 Queries: 1,039

Performance L4 b1 c16

Corpus 13.0K tok/s

Corpus p50 149.6ms

Query 1.2K tok/s

Query p50 143.2ms

Reference →

CosQA

technology retrieval en

Code search with natural language queries

Corpus: 6,267 Queries: 500

Performance L4 b1 c16

Corpus 807 tok/s

Corpus p50 595.7ms

Query 139 tok/s

Query p50 634.4ms

Reference →

NanoFiQA2018Retrieval

finance retrieval en

Smaller subset of the FiQA financial QA dataset

Quality

ndcg at 10 0.5207

map at 10 0.4283

mrr at 10 0.5634

Performance L4 b1 c16

Corpus 20.1K tok/s

Corpus p50 135.4ms

Query 1.7K tok/s

Query p50 119.2ms

Reference →

SCIDOCS

scientific retrieval en

Citation prediction, document classification, and recommendation for scientific papers

Corpus: 25,656 Queries: 1,000

Performance L4 b1 c16

Corpus 2.4K tok/s

Corpus p50 1.3s

Query 74 tok/s

Query p50 1.7s

Reference →

StackOverflowQA

technology retrieval en

Programming question answering from Stack Overflow

Corpus: 19,931 Queries: 1,994

Performance L4 b1 c16

Corpus 24.1K tok/s

Corpus p50 145.6ms

Query 33.4K tok/s

Query p50 142.9ms

Reference →

Overview

Benchmarks

CQADupstackPhysicsRetrieval

CosQA

NanoFiQA2018Retrieval

SCIDOCS

StackOverflowQA

Self-hosted inference for search & document processing