ABCDEFGHIJKLMNOPQRSTUVWXYZAAAB
1
ModelSize (b)GPUGPUsVRAM GBIn GPU %
token/s
Estimated Cost
Bang for buck
SpecsGPU TypeOS
Ollama
PlatformComment
2
llama3.18A100 PCIe180100120,322.000€5,47117+14ProfessionalUbuntu 22.040.3.9runpod.ioPrices for Germany (September 2024)
3
llama3.170A100 PCIe18010027,322.000€1,24117+14ProfessionalUbuntu 22.040.3.9runpod.io
4
llama3.1405A100 PCIe2160700,044.000€0,00234+62ProfessionalUbuntu 22.040.3.9runpod.io
5
mistral-large123A100 PCIe216010017,744.000€0,40234+62ProfessionalUbuntu 22.040.3.9runpod.io
6
llama3.18A100 PCIe2160100135,044.000€3,07234+62ProfessionalUbuntu 22.040.3.9runpod.io
7
llama3.170A100 PCIe216010027,444.000€0,62234+62ProfessionalUbuntu 22.040.3.9runpod.io
8
llama3.18A4014810088,35.700€15,4950+9ProfessionalUbuntu 22.040.3.9runpod.ioGPUNew
Used
9
llama3.170A4014810013,35.700€2,3350+9ProfessionalUbuntu 22.040.3.9runpod.ioRTX A50002.300€1.700€
10
llama3.18Apple Silicon11610011,1800€13,88M1 16GBConsumermacOS 15.00.3.10Physical
11
mistral12Apple Silicon1161007,3800€9,13M1 16GBConsumermacOS 15.00.3.10PhysicalRTX A60004.900€4.200€
12
llama3.170Apple Silicon1641007,92.400€3,29M1 Max 64GBConsumermacOS ??.?0.3.9PhysicalRTX 40901.800€1.500€
13
llama3.18Apple Silicon13210034,62.000€17,30M2 Pro 32GBConsumermacOS 14.4.10.3.9PhysicalA100 (40GB)14.000€5.000€
14
mistral-nemo12Apple Silicon13210023,22.000€11,60M2 Pro 32GBConsumermacOS 14.4.10.3.9PhysicalA100 (80GB)22.000€17.500€
15
llama3.1405H100 PCIe180360,140.000€0,00176+16ProfessionalUbuntu 22.040.3.9runpod.ioH100 (80GB)40.000€30.000€
16
mistral-large123H100 PCIe18010020,040.000€0,50176+16ProfessionalUbuntu 22.040.3.9runpod.ioA405.700€4.000€
17
llama3.18H100 PCIe180100150,040.000€3,75176+16ProfessionalUbuntu 22.040.3.9runpod.ioRTX 30901.500€800€
18
llama3.170H100 PCIe18010032,040.000€0,80176+16ProfessionalUbuntu 22.040.3.9runpod.ioL408.700€5.800€
19
mistral-large123H100 PCIe216010020,080.000€0,25500+32ProfessionalUbuntu 22.040.3.9runpod.ioL40S9.000€8.500€
20
llama3.18H100 PCIe2160100150,080.000€1,88500+32ProfessionalUbuntu 22.040.3.9runpod.ioRTX 2000 Ada700€790€
21
llama3.170H100 PCIe216010032,080.000€0,40500+32ProfessionalUbuntu 22.040.3.9runpod.ioRTX 4000 Ada1.400€1.500€
22
llama3.18L4014810099,08.700€11,38250+16ProfessionalUbuntu 22.040.3.9runpod.ioRTX 6000 Ada7.700€8.000€
23
llama3.170L4014810016,48.700€1,89250+16ProfessionalUbuntu 22.040.3.9runpod.ioRTX 4070 Ti800€700€
24
mistral-nemo12L4014810072,88.700€8,37250+17ProfessionalUbuntu 22.040.3.9runpod.ioM1 16GB800€
25
command-r (08-2024)35L40S14810032,69.000€3,6262+16ProfessionalUbuntu 22.040.3.9runpod.ioM1 Max 64GB2.400€
26
llama3.18L40S148100100,09.000€11,1162+16ProfessionalUbuntu 22.040.3.9runpod.ioM2 Pro 32GB2.000€
27
mistral-nemo12L40S14810072,59.000€8,0662+16ProfessionalUbuntu 22.040.3.9runpod.ioRTX 3070 Ti350€
28
llama3.170L40S14810016,69.000€1,8462+16ProfessionalUbuntu 22.040.3.9runpod.io
29
mistral-nemo12RTX 2000 Ada11610028,7700€41,0031+6ProfessionalUbuntu 22.040.3.9runpod.io
30
llama3.18RTX 2000 Ada11610042,5700€60,7131+6ProfessionalUbuntu 22.040.3.9runpod.io
31
llama3.18RTX 3070 Ti1810082,2350€234,86i7 16GB RAMConsumerWindows 110.3.10Physical
32
mistral-nemo12RTX 3070 Ti18764,5350€12,86i7 16GB RAMConsumerWindows 110.3.10Physical
33
llama3.18RTX 3090124100108,01.500€72,00125+32ConsumerUbuntu 22.040.3.9runpod.io
34
mistral-nemo12RTX 309012410079,01.500€52,67125+32ConsumerUbuntu 22.040.3.9runpod.io
35
mistral-nemo12RTX 4000 Ada12010040,31.400€28,7939+9ProfessionalUbuntu 22.040.3.9runpod.io
36
llama3.18RTX 4000 Ada12010058,51.400€41,7939+9ProfessionalUbuntu 22.040.3.9runpod.ioModellFile sizeVRAM size
37
llama3.170RTX 4070 Ti224541,21.600€0,7562+32ConsumerUbuntu 22.040.3.9runpod.iollama3.1:8b4.76.7
38
mistral-nemo12RTX 4070 Ti22410055,01.600€34,3862+32ConsumerUbuntu 22.040.3.9runpod.iollama3.1:70b3944 - 46
39
llama3.18RTX 4070 Ti22410077,01.600€48,1362+32ConsumerUbuntu 22.040.3.9runpod.iollama3.1:405b228237 - 260
40
command-r (08-2024)35RTX 4070 Ti2249311,51.600€7,1962+32ConsumerUbuntu 22.040.3.9runpod.iomistral-large:123b6974 - 90
41
llama3.18RTX 4090248100115,03.600€31,94124+32ConsumerUbuntu 22.040.3.9runpod.iomistral-nemo:12b7.17.8 - 9.3
42
mistral-large123RTX 4090248671,13.600€0,31124+34ConsumerUbuntu 22.040.3.9runpod.iocommand-r (08-2024)1822
43
llama3.170RTX 409024810020,33.600€5,64124+33ConsumerUbuntu 22.040.3.9runpod.io
44
llama3.170RTX 409049610020,07.200€2,78248+64ConsumerUbuntu 22.040.3.9runpod.io
45
command-r-plus104RTX 409049610014,07.200€1,94248+64ConsumerUbuntu 22.040.3.9runpod.io
46
llama3.18RTX 4090496100115,47.200€16,03248+64ConsumerUbuntu 22.040.3.9runpod.io
47
mistral-large123RTX 409049610012,47.200€1,72248+64ConsumerUbuntu 22.040.3.9runpod.io
48
llama3.170RTX 6000 Ada14810020,27.700€2,6262+14ProfessionalUbuntu 22.040.3.9runpod.io
49
llama3.18RTX 6000 Ada148100130,07.700€16,8862+14ProfessionalUbuntu 22.040.3.9runpod.io
50
mistral-nemo12RTX 6000 Ada14810093,07.700€12,0862+14ProfessionalUbuntu 22.040.3.9runpod.io
51
llama3.170RTX A5000124600,42.300€0,1725+8ProfessionalUbuntu 22.040.3.9runpod.io
52
llama3.18RTX A500012410093,02.300€40,4325+8ProfessionalUbuntu 22.040.3.9runpod.io
53
llama3.170RTX A500024810015,44.600€3,35100+18ProfessionalUbuntu 22.040.3.9runpod.io
54
llama3.18RTX A500024810094,04.600€20,43100+18ProfessionalUbuntu 22.040.3.9runpod.io
55
llama3.170RTX A500049610015,89.200€1,72200+36ProfessionalUbuntu 22.040.3.9runpod.io
56
llama3.18RTX A500049610095,09.200€10,33200+36ProfessionalUbuntu 22.040.3.9runpod.io
57
mistral-large123RTX A50004961009,69.200€1,04200+36ProfessionalUbuntu 22.040.3.9runpod.io
58
llama3.170RTX A6000628810015,829.400€0,54300+54ProfessionalUbuntu 22.040.3.9runpod.io
59
llama3.18RTX A60006288100106,029.400€3,61300+54ProfessionalUbuntu 22.040.3.9runpod.io
60
llama3.1405RTX A600062881003,029.400€0,10300+54ProfessionalUbuntu 22.040.3.9runpod.io
61
mistral-large123RTX A600062881009,629.400€0,33300+54ProfessionalUbuntu 22.040.3.9runpod.io
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100