Model Type |
| |||||||||
Supported Languages |
| |||||||||
Training Details |
|
LLM Name | Gemma 2 9B It SPPO Iter3 |
Repository ๐ค | https://huggingface.co/UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3 |
Model Size | 9b |
Required VRAM | 18.6 GB |
Updated | 2025-04-22 |
Maintainer | UCLA-AGI |
Model Type | gemma2 |
Model Files | |
Supported Languages | en |
Model Architecture | Gemma2ForCausalLM |
License | gemma |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.43.0.dev0 |
Tokenizer Class | GemmaTokenizer |
Padding Token | <pad> |
Vocabulary Size | 256000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
G2 GSHT 32K | 32K / 20.4 GB | 10 | 1 |
SystemGemma2 9B It | 32K / 18.6 GB | 64 | 1 |
Gemma 2 9B It SimPO | 8K / 18.6 GB | 15860 | 164 |
Gemma 2 9B It | 8K / 18.6 GB | 336347 | 705 |
Gemma 2 9B | 8K / 37.1 GB | 115488 | 654 |
...2 9B Cpt Sahabatai V1 Instruct | 8K / 18.6 GB | 4220 | 37 |
Darkest Muse V1 | 8K / 20.4 GB | 903 | 73 |
SILMA 9B Instruct V1.0 | 8K / 18.6 GB | 12408 | 70 |
Gemma 2 9B It | 8K / 18.6 GB | 25519 | 10 |
MTM Merge Gemma 2 9B | 8K / 20.4 GB | 35 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐