LLM Name | MT3 Gen5 MM Gemma 2 MT2Av4d 9B V1 |
Repository ๐ค | https://huggingface.co/zelk12/MT3-Gen5-MM-gemma-2-MT2Av4d-9B_v1 |
Base Model(s) | |
Merged Model | Yes |
Model Size | 9b |
Required VRAM | 20.4 GB |
Updated | 2025-04-23 |
Maintainer | zelk12 |
Model Type | gemma2 |
Model Files | |
Model Architecture | Gemma2ForCausalLM |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.46.2 |
Tokenizer Class | GemmaTokenizer |
Padding Token | <pad> |
Vocabulary Size | 256000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
G2 GSHT 32K | 32K / 20.4 GB | 10 | 1 |
SystemGemma2 9B It | 32K / 18.6 GB | 63 | 1 |
Gemma 2 9B It SimPO | 8K / 18.6 GB | 15861 | 164 |
Gemma 2 9B It | 8K / 18.6 GB | 334352 | 704 |
Gemma 2 9B | 8K / 37.1 GB | 117523 | 653 |
...2 9B Cpt Sahabatai V1 Instruct | 8K / 18.6 GB | 4115 | 37 |
Darkest Muse V1 | 8K / 20.4 GB | 847 | 73 |
SILMA 9B Instruct V1.0 | 8K / 18.6 GB | 12478 | 70 |
Gemma 2 9B It | 8K / 18.6 GB | 25468 | 10 |
MTM Merge Gemma 2 9B | 8K / 20.4 GB | 35 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐