Additional Notes |
|
LLM Name | Quantized Gemma 2B It |
Repository ๐ค | https://huggingface.co/mlx-community/quantized-gemma-2b-it |
Model Size | 2b |
Required VRAM | 2.2 GB |
Updated | 2025-03-13 |
Maintainer | mlx-community |
Model Type | gemma |
Model Files | |
Model Architecture | GemmaForCausalLM |
License | other |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.38.0 |
Padding Token | <pad> |
Vocabulary Size | 256000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Gemma 1.1 2B It | 8K / 5.1 GB | 121816 | 156 |
Codegemma 2B | 8K / 5.1 GB | 3914 | 78 |
Gemma Ko 1.1 2B It | 8K / 5.1 GB | 2037 | 1 |
EMO 2B | 8K / 5.1 GB | 4465 | 2 |
Octopus V2 | 8K / 5.1 GB | 1047 | 882 |
Gemma 2B It Customer Support | 8K / 10 GB | 23816 | 1 |
LION Gemma 2B Sft V1.0 | 8K / 5.1 GB | 70 | 0 |
... 2B Finetuned Sft Navarasa 2.0 | 8K / 10 GB | 301 | 23 |
Gemma 2B Orpo | 8K / 5.1 GB | 165 | 28 |
2B Or Not 2B | 8K / 5.1 GB | 53 | 27 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐