ABCDEFGHIJKLMNOPQRSTUVWXYZAAABAC
1
全稱(廠商/模型)向量維度輸入 tokens 上限API 價格 1M tokens簡稱平均命中率 Hit Rate平均倒數排名 MRRLicenseLink備註
2
Voyage voyage-3-large1024320000.18voyage-3-large0.9876896650.9363584311https://blog.voyageai.com/2025/01/07/voyage-3-large/
3
Voyage voyage-multilingual-21024320000.12voyage-multilingual-20.97366160890.9034449852https://docs.voyageai.com/docs/embeddings
4
Voyage voyage-31024320000.06voyage-30.965359290.8944555778https://blog.voyageai.com/2024/09/18/voyage-3/
5
微軟 multilingual-e5-large1024512微軟-multilingual-e5-large0.95791583170.8850081115開源https://huggingface.co/intfloat/multilingual-e5-large
6
智源 bge-m310248192智源-bge-m30.95619811050.8784282851開源https://huggingface.co/BAAI/bge-m3北京智源人工智能研究院
7
微軟 multilingual-e5-small384512微軟-multilingual-e5-small0.95505296310.8722922035開源https://huggingface.co/intfloat/multilingual-e5-small
8
微軟 multilingual-e5-base768512微軟-multilingual-e5-base0.95219009450.8694245634開源https://huggingface.co/intfloat/multilingual-e5-base
9
Nomic Embed Text V2768512nomic-embed-text-v2-moe0.95133123390.867363298開源https://www.nomic.ai/blog/posts/nomic-embed-text-v2https://huggingface.co/nomic-ai/nomic-embed-text-v2-moe
10
Voyage voyage-3-lite512320000.02voyage-3-lite0.94846836530.8625441359https://docs.voyageai.com/docs/embeddings
11
Cohere embed-multilingual-v3.010245120.1Cohere-multilingual-v30.93501288290.8540652734https://docs.cohere.com/docs/cohere-embed
12
Cohere embed-multilingual-light-v3.0384512Cohere-multilingual-light-v30.93300887490.8422177689https://docs.cohere.com/docs/cohere-embed
13
JinaAI jina-embeddings-v3102481920.1jina-embeddings-v3 0.92442026910.8255463308開源 CC-BY-NChttps://huggingface.co/jinaai/jina-embeddings-v3https://jina.ai/news/jina-embeddings-v3-a-frontier-multilingual-embedding-model/
14
Google Vertex text-multilingual-embedding-00276820480.1
Google-Vertex-multilingual-002
0.92155740050.8272258803
https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/text-embeddings-api
15
infgrad/stella-base-zh-v27681024infgrad/stella-base-zh-v20.91898081880.8193959347開源https://huggingface.co/infgrad/stella-base-zh-v2基於 商汤科技研究院 piccolo 模型的衍生模型
16
Chuxin-Embedding1024?
chuxin-llm/Chuxin-Embedding
0.91669052390.8179740433開源https://huggingface.co/chuxin-llm/Chuxin-Embedding
17
infgrad/stella-large-zh-v210241024infgrad/stella-large-zh-v20.91611795020.8135270541開源https://huggingface.co/infgrad/stella-large-zh-v2
18
智源 bge-base-zh-v1.5768512智源-bge-base-zh-v1.50.90609791010.8034449852開源https://huggingface.co/BAAI/bge-base-zh-v1.5
19
智源 bge-large-zh-v1.51024512智源-bge-large-zh-v1.50.90523904950.7999427426開源https://huggingface.co/BAAI/bge-large-zh-v1.5
20
OpenAI text-embedding-3-large307281910.13OpenAI-3-large0.90438018890.7895314438https://platform.openai.com/docs/guides/embeddings/embedding-models用非同步的 Batch API 價錢是 0.07 便宜約一半
21
OpenAI text-embedding-3-large (降維至1536d)153681910.13OpenAI-3-large-1536d0.90208989410.7821357https://platform.openai.com/docs/guides/embeddings/embedding-models
22
Voyage voyage-large-2-instruct1024160000.12Voyage-large-2-instruct0.89750930430.7829897891https://docs.voyageai.com/docs/embeddings
23
网易有道 bce-embedding-base_v1 768512网易有道-bce-base_v10.89321500140.7780656551開源https://huggingface.co/maidalun1020/bce-embedding-base_v1
24
合合信息 acge_text_embedding17921024合合信息-acge_text_embedding0.88834812480.778256513開源https://huggingface.co/aspire/acge_text_embedding
25
infgrad/stella-mrl-large-zh-v3.5-1792d1792512stella-mrl-large-zh-v3.50.88061837960.7773976524開源https://huggingface.co/infgrad/stella-mrl-large-zh-v3.5-1792d
26
智源 bge-small-zh-v1.5512512智源-bge-small-zh-v1.50.87746922420.765955721開源https://huggingface.co/BAAI/bge-small-zh-v1.5
27
台智雲 ffm-embedding15362048台智雲-ffm0.87632407670.7585504342https://tws.twcc.ai/service/embedding/
28
阿里巴巴 gte-Qwen2-1.5B-instruct153632000阿里巴巴-gte-Qwen2-1.5B0.87432006870.7307758374開源https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct需要 CUDA 得吃很多 GPU RAM,我是用 Google Colab 開 GPU 跑的
29
JinaAI jina-embeddings-v2-base-zh76881920.02Jina-v2-base-zh0.87346120810.7599627827開源https://huggingface.co/jinaai/jina-embeddings-v2-base-zh
30
OpenAI text-embedding-3-small153681910.02OpenAI-3-small0.86830804470.7533352419https://platform.openai.com/docs/guides/embeddings/embedding-models價格便宜! 而且用非同步的 Batch API 還可以再便宜一半!
31
Mistral mistral-embed102481920.1mistral-embed0.85714285710.7357047428https://docs.mistral.ai/capabilities/embeddings/
32
OpenAI text-embedding-ada-002153681910.1OpenAI-ada-0020.85685657030.7433199733https://platform.openai.com/docs/guides/embeddings/embedding-models
33
阿里巴巴达摩院 gte-base-zh768512阿里巴巴-gte-base-zh0.85628399660.7308235519開源https://huggingface.co/thenlper/gte-base-zh
34
阿里巴巴达摩院 gte-small-zh512512阿里巴巴-gte-small-zh0.85628399660.7302891497開源https://huggingface.co/thenlper/gte-small-zh
35
数元灵-Dmeta-embedding-zh7681024数元灵-Dmeta-embedding-zh0.84884053820.7247638133開源https://huggingface.co/DMetaSoul/Dmeta-embedding-zh
36
阿里巴巴达摩院 gte-large-zh1024512阿里巴巴-gte-large-zh0.83481248210.7027197252開源https://huggingface.co/thenlper/gte-large-zh
37
minishlab/m2v_multilingual_output256?
minishlab/m2v_multilingual_output
0.61179501860.4740385533開源https://huggingface.co/minishlab/M2V_multilingual_output
38
Cohere embed-english-v3.010245120.1Cohere-en-v30.49012310330.3662086077https://docs.cohere.com/docs/cohere-embed
39
Nomic embed-text-v1.57688192Nomic-embed-text-v1.50.48210707130.3582116614開源https://huggingface.co/nomic-ai/nomic-embed-text-v1.5真完全開源的模型 https://blog.nomic.ai/posts/nomic-embed-text-v1
40
Cohere embed-english-light-v3.03845120.1Cohere-en-light-v30.45061551670.3348077107https://docs.cohere.com/docs/cohere-embed
41
minishlab/potion-base-32M512?minishlab/potion-base-32M0.34726596050.2447704934開源https://github.com/MinishLab/model2vec
42
SBERT all-MiniLM-L6-v2384128all-MiniLM-L6-v20.25994846840.169100105開源https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2這是 https://sbert.net/ SentenceTransformers 官網上的範例程式用的,很多教材用這個舉例,但是中文根本 gg
43
Google Gemini text-embedding-00476820480.1Google-Gemini-0040.064987117090.04142570856https://ai.google.dev/gemini-api/docs/embeddings顯然沒有做中文,你應該註冊 GCP Vertex 改用 text-multilingual-embedding-002 多語言模型!
44
Google gemini-embedding-exp-03-0730728000?
gemini-embedding-exp-03-07
rate limit 太低無法跑完
https://ai.google.dev/gemini-api/docs/rate-limits#free-tier
https://developers.googleblog.com/en/gemini-embedding-text-model-now-available-gemini-api/
45
46
作者 ihower | 內容詳見: https://ihower.tw/blog/archives/12167
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100