LLM Name | MT3 Gen1 MU Gemma 2 GAv4c 9B |
Repository ๐ค | https://huggingface.co/zelk12/MT3-Gen1-MU-gemma-2-GAv4c-9B |
Base Model(s) | |
Merged Model | Yes |
Model Size | 9b |
Required VRAM | 20.4 GB |
Updated | 2025-02-15 |
Maintainer | zelk12 |
Model Type | gemma2 |
Model Files | |
Model Architecture | Gemma2ForCausalLM |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.45.1 |
Tokenizer Class | GemmaTokenizer |
Padding Token | <pad> |
Vocabulary Size | 256000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
G2 GSHT 32K | 32K / 20.4 GB | 10 | 0 |
SystemGemma2 9B It | 32K / 18.6 GB | 60 | 1 |
Gemma 2 9B It SimPO | 8K / 18.6 GB | 27514 | 150 |
Gemma 2 9B It | 8K / 18.6 GB | 563660 | 659 |
Gemma 2 9B | 8K / 37.1 GB | 111138 | 645 |
...2 9B Cpt Sahabatai V1 Instruct | 8K / 18.6 GB | 3050 | 34 |
MT4 Gen5 Gemma 2 9B | 8K / 20.4 GB | 169 | 2 |
MT Merge4 Gemma 2 9B | 8K / 20.4 GB | 139 | 1 |
MT3 Gen4 Gemma 2 9B | 8K / 20.4 GB | 122 | 4 |
SILMA 9B Instruct V1.0 | 8K / 18.6 GB | 18089 | 64 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐