LLM Name | Tess 3 Mistral Nemo 12B |
Repository ๐ค | https://huggingface.co/migtissera/Tess-3-Mistral-Nemo-12B |
Model Size | 12b |
Required VRAM | 24.5 GB |
Updated | 2024-09-20 |
Maintainer | migtissera |
Model Type | mistral |
Model Files | |
Model Architecture | MistralForCausalLM |
License | apache-2.0 |
Context Length | 1024000 |
Model Max Length | 1024000 |
Transformers Version | 4.44.0.dev0 |
Tokenizer Class | PreTrainedTokenizerFast |
Padding Token | <|end_of_text|> |
Vocabulary Size | 131075 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Dolphin 2.9.3 Mistral Nemo 12B | 1000K / 24.5 GB | 7110 | 68 |
SauerkrautLM Nemo 12B Instruct | 1000K / 24.5 GB | 7673 | 20 |
NemoMix Unleashed 12B | 1000K / 24.5 GB | 6443 | 92 |
MN 12B Lyra V3 | 1000K / 24.4 GB | 337 | 32 |
MN 12B Lyra V4 | 1000K / 24.5 GB | 324 | 8 |
Lumimaid V0.2 12B | 1000K / 24.5 GB | 766 | 68 |
Romulus Mistral Nemo 12B Simpo | 1000K / 24.5 GB | 1789 | 11 |
Rocinante 12B V1 | 1000K / 24.5 GB | 298 | 18 |
Mini Magnum 12B V1.1 | 1000K / 24.5 GB | 147 | 70 |
Arcanum 12B | 1000K / 24.5 GB | 85 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐