ABCDEFGHIJKLMNOPQRSTUVWXYZAAAB
1
OrganizationModel NameParametersContextTypeLicenseReleaseNotesSee AlsoProperties
2
SkyworkSkywork-MoE22Ba146B8KEN/CNLawful2024-6-3
https://github.com/SkyworkAI/Skywork-MoE
3
LLM360K265B2KGeneralApache 2.02024-5-29
https://www.llm360.ai/blog/several-new-releases-to-further-our-mission.html
4
IEIT-YuanYuan2.0-M323.7Ba40B8KUnknown2024-5-28
https://huggingface.co/IEITYuan/Yuan2-M32-hf
5
DeepSeekDeepSeek-V2-Lite2.4Ba16B32KEN/CNLawful2024-5-16
https://huggingface.co/deepseek-ai/DeepSeek-V2-Lite
6
IBMGranite3B, 8B, 20B, 34B2K-8KCodeApache 2.02024-5-7
https://research.ibm.com/blog/granite-code-models-open-source
7
DeepSeekDeepSeek-V221Ba236B128KEN/CNLawful2024-5-6
https://github.com/deepseek-ai/DeepSeek-V2
8
SnowflakeArctic17Ba408B4KApache 2.02024-4-24
https://www.snowflake.com/blog/arctic-open-efficient-foundation-language-models-snowflake/
9
MicrosoftPhi-33.8B, 7B, 14B4-128KMIT2024-4-23
https://news.microsoft.com/source/features/ai/the-phi-3-small-language-models-with-big-potential/
10
MetaLlama 38B, 70B, 405B8K<700M MAU2024-4-18
https://ai.meta.com/blog/meta-llama-3/
11
MistralMixtral 8x22B35Ba141B64KApache 2.02024-4-17
https://mistral.ai/news/mixtral-8x22b/
12
Hugging FaceIdefics28B32KVisionApache 2.02024-4-15
https://huggingface.co/blog/idefics2
13
CohereCommand-R+104B128KMultilingualCC-BY-NC2024-4-4
https://cohere.com/blog/command-r-plus-microsoft-azure
14
AI21Jamba12Ba52B256KApache 2.02024-3-28
https://www.ai21.com/blog/announcing-jamba
15
DatabricksDBRX36Ba132B32KApache 2.02024-3-27
https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
16
xAIGrok-179Ba314B8KApache 2.02024-3-17
https://x.ai/blog/grok-os
17
CohereCommand-R104B128KMultilingualCC-BY-NC2024-3-11
https://cohere.com/blog/command-r
18
BigCodeStarCoder23B, 7B, 15B16KCodeApache 2.02024-2-28
https://huggingface.co/blog/starcoder2
19
AlibabaQwen 1.50.5B, 1.8B, 4B, 7B, 14B, 72B32K<100M MAU2024-2-4
https://qwenlm.github.io/blog/qwen1.5/
20
OpenBMBMiniCPM2.7BChina2024-2-1
https://openbmb.vercel.app/minicpm-en
21
Allen InstituteOLMo7B2KApache 2.02024-2-1
https://allenai.org/olmo
22
LLaVALLaVA-NeXT7B, 13B, 34B4K-32KVisionVarious2024-1-30
https://llava-vl.github.io/blog/2024-01-30-llava-next/
23
BlinkDLRWKV-57B4K+Apache 2.02024-1-28
https://twitter.com/BlinkDL_AI/status/1751542433039651304
24
ORION STAROrion14B4K, 200KMultilingualLawful/+Comm2024-1-22
https://github.com/OrionStarAI/Orion
25
InternLMInternLM27B, 20B200KEN/CNLawful/+Comm2024-1-17
https://github.com/InternLM/InternLM
26
MistralMixtral 8x7B13Ba47BApache 2.02023-12-11
https://mistral.ai/news/mixtral-of-experts/
27
XVERSEXVERSE7B, 13B, 65B, 65B-216KEN/CNLawful/+Comm2023-12-8
https://github.com/xverse-ai
28
DeepSeekLLM7B, 67B4KEN/CNLawful2023-11-29
https://github.com/deepseek-ai/DeepSeek-LLM
29
DeepSeekCoder1.3B, 6.7B, 33B16KCodeMIT2023-11-3
https://deepseekcoder.github.io/
30
01.aiYi6B, 34B4KEN/CNApache 2.02023-11-02
https://github.com/01-ai/Yi
31
MistralMistral7B4KGeneralApache 2.02023-9-27
https://mistral.ai/news/announcing-mistral-7b/
32
CofeAIFLM101B2K+EN/CNApache 2.02023-09-08
Terrible perf https://www.reddit.com/r/LocalLLaMA/comments/16danhb/flm101b_an_open_llm_and_how_to_train_it_with_100k/
33
MetaCodeLlama7B, 13B, 34B16KCode<700M MAU2023-08-24
https://about.fb.com/news/2023/08/code-llama-ai-for-coding/
34
matsuo-labweblab10B2KJPCC-BY-NC 4.02023-08-18
https://weblab.t.u-tokyo.ac.jp/100%E5%84%84%E3%83%91%E3%83%A9%E3%83%A1%E3%83%BC%E3%82%BF%E3%82%B5%E3%82%A4%E3%82%BA%E3%83%BB%E6%97%A5%E8%8B%B12%E3%83%B6%E5%9B%BD%E8%AA%9E%E5%AF%BE%E5%BF%9C%E3%81%AE%E5%A4%A7%E8%A6%8F%E6%A8%A1/
35
LINEjapanese-large-lm3.6B2KJPApache 2.02023-08-14
https://engineering.linecorp.com/ja/blog/3.6-billion-parameter-japanese-language-model
36
Stability AIJapanese StableLM7B2KJPApache 2.02023-08-10
https://stability.ai/blog/stability-ai-new-jplm-japanese-language-model-stablelm
37
AlibabaQwen7B, 14B8KEN/CN<100M MAU2023-08-03
https://github.com/QwenLM/Qwen-7B
38
MetaLlama 27B, 13B, 34B, 70B4KGeneral<700M MAU2023-07-18
https://ai.meta.com/llama/
39
SalesforceCodeGen2.57BCodeApache 2.02023-07-06
https://blog.salesforceairesearch.com/codegen25/
40
SalesforceXGen7BGeneral/CodeApache 2.02023-06-28
https://blog.salesforceairesearch.com/xgen/
41
BAAIAquila7B, 33BEN/CNLawful SA2023-06-09
42
TIIFalcon7B, 40B2KGeneralApache 2.02023-05-25
license changed 5/31
43
s-JoLOpen-Llama7BGeneralMIT2023-05-11
https://github.com/s-JoL/Open-Llama
44
conceptofmindPaLM1BGeneralMIT2023-05-08
Open Source Reimplementation of Google's PaLM, only C4 trained https://www.reddit.com/r/MachineLearning/comments/13bxu2g/p_opensource_palm_models_trained_at_8k_context/
45
RedPajamaINCITE3B, 7BGeneralApache 2.02023-05-05
https://www.together.xyz/blog/redpajama-models-v1
46
BigCodeStarCoder15.5BCodeOpenRAIL2023-05-04
47
openlm-researchOpenLLaMA3B, 7B, 13B, 20BGeneralApache 2.02023-05-02
All models done training to 1T
48
MosaicMLMPT1B, 7B, 30BGeneralApache 2.02023-04-20
More being trained https://twitter.com/jefrankle/status/1649060478910357504
49
Stability AIStableLM3B, 7B, 15B, 30BGeneralCC-BY-SA 4.02023-04-19
Still training (alpha checkpoint not good)
50
NVIDIAGPT-2B-0012BGeneralCC-BY 4.02023-04-17
51
GeoVGeoV9BGeneralOpenRAIL2023-04-02
Still Training (checkpoints available)
52
CerebrasCerebras-GPT1.3B, 2.7B, 6.7B, 13BGeneralApache 2.02023-03-28
53
AnthropicClaudeclaude-instant-1, claude-2100KGeneralCommercial API2023-03-14Wait List
54
OpenAIGPT-41.8T?32KGeneralCommercial API2023-03-14Wait List
55
TogetherGPT-JT-Moderation6BModerationApache 2.02023-03-10
56
TogetherGPT-NeoXT-Chat-Base20BInstructionApache 2.02023-03-10
57
AI21J27.5B, 17B, 178BGeneralCommercial API2023-03-09
58
MetaLLaMA7B, 13B, 33B, 65B2KGeneralNC Research2023-02-24
59
BlinkDLRWKV1B, 3B, 7B, 14B4K+GeneralApache 2.02023-02-15
60
EleutherAIPythia1B, 1.4B, 2.8B, 6.9B, 12BGeneralApache 2.02023-02-13
61
BigCodeSantacoder1.1BCodeOpenRAIL2022-12-22
62
StanfordBioMedLM2.7BAcademic (Bio)RAIL2022-12-16
63
EleutherAIPolyglot1.3B, 3.8B, 5.8BKOApache 2.02022-12-15
64
OpenAIGPT-3.5175B?GeneralCommercial API2022-11-30
65
MetaGalactica120BAcademicCC-BY-NC 4.02022-11-16
66
Coherecohere6B, 13B, 52BGeneralCommercial API2022-11-08
67
GoogleFlan-T53B, 11BGeneralApache 2.02022-10-22
68
OpenBMBCPM-Ant1B, 3B, 7B, 10BEN/CNGML Open2022-10-12
69
NVIDIANeMo1.3B, 5B, 20BGeneralCC-BY 4.02022-09-15
70
THUDMGLM-130B130BEN/CNNC China2022-08-04
71
BigScienceBLOOM1B, 3B, 7B, 176BMultilingualOpenRAIL2022-07-12
72
YandexYaLM100BEN/RUApache 2.02022-06-23
73
MetaOPT1.3B, 2.7B, 13B, 30B, 66B, 175BGeneralNC Research2022-05-03
74
SalesforceCodeGen2B, 6B, 16BCodeBSD2022-04-06
75
EleutherAIGPT-Neo1.3B, 2.7BGeneralApache 2.02022-03-21
76
GoogleFlan-UL220BGeneralApache 2.02022-03-03
77
EleutherAIGPT-NeoX20BGeneralApache 2.02022-02-02
78
HuaweiPanGu-α2.6B, 13B, (200B)EN/CNApache 2.02021-12-30
79
MetaFairseq1.3B, 2.7B, 6.7B, 13BGeneralMIT2021-12-21
80
OpenAICodexcushman, davinciCodeCommercial API2021-08-10
81
EleutherAIGPT-J6BGeneralApache 2.02021-06-04
82
GooglemT51.2B, 3.7B, 13BMultilingualApache 2.02020-12-02
83
OpenAIGPT-3ada, babbage, curie, davinciGeneralCommercial API2020-06-11
84
MetaMegatron11BGeneralMIT2020-04-04
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100