LLM Name | Saiga Nemo 12b |
Repository ๐ค | https://huggingface.co/IlyaGusev/saiga_nemo_12b |
Model Size | 12b |
Required VRAM | 24.5 GB |
Updated | 2025-02-05 |
Maintainer | IlyaGusev |
Model Type | mistral |
Model Files | |
Supported Languages | ru |
Model Architecture | MistralForCausalLM |
License | apache-2.0 |
Context Length | 1024000 |
Model Max Length | 1024000 |
Transformers Version | 4.46.2 |
Tokenizer Class | PreTrainedTokenizerFast |
Padding Token | <pad> |
Vocabulary Size | 131072 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...r Nemo 12B Instruct R 21 09 24 | 1000K / 24.5 GB | 8449 | 106 |
...s PersonalityEngine V1.1.0 12B | 1000K / 24.5 GB | 492 | 29 |
Captain Eris Violet V0.420 12B | 1000K / 24.5 GB | 1069 | 23 |
Mistral Nemo Kartoffel 12B | 1000K / 24.5 GB | 183 | 3 |
MN 12B Mimicore GreenSnake | 1000K / 24.5 GB | 83 | 2 |
MN 12B Mimicore WhiteSnake | 1000K / 24.5 GB | 61 | 3 |
MN 12B Mag Mell R1 | 1000K / 24.5 GB | 4246 | 99 |
SauerkrautLM Nemo 12B Instruct | 1000K / 24.5 GB | 19527 | 22 |
MN 12B Mimicore Orochi | 1000K / 24.5 GB | 31 | 2 |
Dolphin 2.9.3 Mistral Nemo 12B | 1000K / 24.5 GB | 8318 | 97 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐