LLM Name | Llama 2.4 |
Repository ๐ค | https://huggingface.co/denisman/llama-2.4 |
Model Size | 12.9b |
Required VRAM | 25.8 GB |
Updated | 2024-12-22 |
Maintainer | denisman |
Model Type | mixtral |
Model Files | |
Model Architecture | MixtralForCausalLM |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.44.1 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <s> |
Vocabulary Size | 32000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
MixTAO 7Bx2 MoE V8.1 | 32K / 25.8 GB | 9531 | 55 |
MixTAO 7Bx2 MoE V8.1 | 32K / 25.8 GB | 8760 | 52 |
Inf Silent Kunoichi V0.1 2x7B | 32K / 25.6 GB | 5 | 0 |
Inf Silent Kunoichi V0.2 2x7B | 32K / 25.6 GB | 5 | 0 |
LogoS 7Bx2 MoE 13B V0.2 | 32K / 25.9 GB | 3467 | 10 |
MultiMash8 13B Slerp | 32K / 25.7 GB | 27 | 0 |
MultiMash9 13B Slerp | 32K / 25.7 GB | 24 | 0 |
MultiMash11 13B Slerp | 32K / 25.7 GB | 19 | 0 |
MultiMash10 13B Slerp | 32K / 25.7 GB | 20 | 0 |
Multimash3 12B Slerp | 32K / 25.7 GB | 25 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐