Gemma 2B Malayalam T2 Model Vllm 4bit by animaRegem

 ยป  All LLMs  ยป  animaRegem  ยป  Gemma 2B Malayalam T2 Model Vllm 4bit   URL Share it on

  4-bit   4bit   Autotrain compatible Base model:quantized:telugu-ll... Base model:telugu-llm-labs/ind...   Bitsandbytes   En   Endpoints compatible   Finetuned   Gemma   Quantized   Region:us   Safetensors   Sft   Trl   Unsloth

Gemma 2B Malayalam T2 Model Vllm 4bit Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Gemma 2B Malayalam T2 Model Vllm 4bit (animaRegem/gemma-2b-malayalam-t2-model-vllm-4bit)

Gemma 2B Malayalam T2 Model Vllm 4bit Parameters and Internals

Model Type 
text-generation-inference, transformers, unsloth, gemma, trl, sft
Additional Notes 
This gemma model was trained 2x faster with Unsloth and Huggingface's TRL library.
Training Details 
Methodology:
Finetuned from model : Telugu-LLM-Labs/Indic-gemma-2b-finetuned-sft-Navarasa-2.0; trained 2x faster with Unsloth and Huggingface's TRL library
LLM NameGemma 2B Malayalam T2 Model Vllm 4bit
Repository ๐Ÿค—https://huggingface.co/animaRegem/gemma-2b-malayalam-t2-model-vllm-4bit 
Base Model(s)  Telugu-LLM-Labs/Indic-gemma-2b-finetuned-sft-Navarasa-2.0   Telugu-LLM-Labs/Indic-gemma-2b-finetuned-sft-Navarasa-2.0
Model Size2b
Required VRAM2.1 GB
Updated2025-02-05
MaintaineranimaRegem
Model Typegemma
Model Files  2.1 GB
Supported Languagesen
Quantization Type4bit
Model ArchitectureGemmaForCausalLM
Licenseapache-2.0
Context Length8192
Model Max Length8192
Transformers Version4.41.1
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typefloat16

Best Alternatives to Gemma 2B Malayalam T2 Model Vllm 4bit

Best Alternatives
Context / RAM
Downloads
Likes
... 2B It Hermes Function Calling8K / 5.1 GB50
Gemma 1.1 2B It Bnb 4bit8K / 2.1 GB28784
Vi Gemma 2B RAG8K / 5.1 GB15613
Jenna V28K / 5.1 GB1470
Gemma 2B Bnb 4bit8K / 2.1 GB285416
Gemma 2B FT 500 Orca Maths8K / 5.1 GB1260
Jenna Gemma V0.28K / 2.1 GB750
STEMerald 2B 4bit8K / 2.2 GB791
My AwesomeFinance Model8K / 2.1 GB120
PRIME DeCyphers Final8K / 5.1 GB1251
Note: green Score (e.g. "73.2") means that the model is better than animaRegem/gemma-2b-malayalam-t2-model-vllm-4bit.

Rank the Gemma 2B Malayalam T2 Model Vllm 4bit Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227