Migrate to SIE
If you’re already running embedding inference somewhere else, these guides give you the smallest path to running it on SIE. Working before/after code per provider, plus a mapping from every provider concept to its SIE counterpart.
Pick your starting point:
OpenAI Drop-in via OpenAI-compatible endpoint, or the native SIE SDK. Eliminate per-token cost and rate limits.
Cohere Replace embed-v3 and rerank-v3.5 with catalog embedding models and bge-reranker-v2-m3. Self-hosted, no rate limits.
TEI (HuggingFace) Replace N single-model TEI containers with one SIE cluster. Typed sparse and multivector outputs in one call.
Infinity Same OpenAI-compatible API, multi-model in one process, with managed deployment tooling.
Fastembed Move from in-process ONNX to out-of-process serving. Share GPU memory across app processes.
Modal Consolidate N Modal @app.function endpoints into one SIE deployment. Flat cost, no cold starts.
How to verify a migration
Section titled “How to verify a migration”Each guide ships before/after code in the page. Run both legs on a small corpus from your own domain, print the embeddings, and check they look sane. That is a sanity check, not sign-off.
For sign-off, run your own retrieval eval against both legs:
- Same checkpoint (Fastembed, TEI, Infinity, Modal-with-same-model). Cosine should sit at 0.999 or higher across items. If it doesn’t, the config differs (pooling, normalization, dtype); the guide’s caveats section calls out where.
- Different model (OpenAI → E5, Cohere → Stella or E5). Absolute cosine carries no signal across spaces. Run recall@k on a labeled set you trust, or a BEIR/MTEB slice that resembles your domain.
Choosing a target model
Section titled “Choosing a target model”| Source model | Closest SIE model | Re-embed |
|---|---|---|
text-embedding-3-small | intfloat/e5-base-v2 | yes |
text-embedding-3-large | Alibaba-NLP/gte-Qwen2-1.5B-instruct | yes |
embed-english-v3.0 (Cohere) | NovaSearch/stella_en_400M_v5 | yes |
rerank-v3.0 (Cohere) | BAAI/bge-reranker-v2-m3 | n/a |
TEI / Infinity / Fastembed bge-* | same checkpoint on SIE | no |
| sentence-transformers on Modal | same checkpoint on SIE | no |
Browse the full model catalog for everything SIE serves out of the box.
Need a guide that isn’t here?
Section titled “Need a guide that isn’t here?”Open an issue at
superlinked/sie,
or send a PR adding a new page to the migrate/ directory in
superlinked/sie-web.