Indic Gemma 7B Finetuned Sft Navarasa by Telugu-LLM-Labs

 ยป  All LLMs  ยป  Telugu-LLM-Labs  ยป  Indic Gemma 7B Finetuned Sft Navarasa   URL Share it on

Base model:finetune:google/gem...   Base model:google/gemma-7b   Bn   Dataset:abhinand/tamil-alpaca Dataset:hydraindiclm/bengali a... Dataset:hydraindiclm/punjabi a... Dataset:odiagenai/odia alpaca ... Dataset:ravithejads/samvaad-hi... Dataset:telugu-llm-labs/telugu... Dataset:telugu-llm-labs/telugu... Dataset:tensoic/airoboros-3.2 ... Dataset:tensoic/alpaca-gujarat...   Dataset:tensoic/gpt-teacher kn Dataset:vishnupj/alpaca instru...   Dataset:yahma/alpaca-cleaned   En   Endpoints compatible   Finetuned   Gu   Hi   Instruct   Kn   Lora   Ml   Or   Pa   Region:us   Safetensors   Ta   Te

Indic Gemma 7B Finetuned Sft Navarasa Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Indic Gemma 7B Finetuned Sft Navarasa (Telugu-LLM-Labs/Indic-gemma-7b-finetuned-sft-Navarasa)

Indic Gemma 7B Finetuned Sft Navarasa Parameters and Internals

Model Type 
text-generation
Supported Languages 
te (Telugu), en (English), ta (Tamil), ml (Malayalam), hi (Hindi), kn (Kannada), gu (Gujarati), bn (Bengali), pa (Punjabi), or (Odia)
Training Details 
Data Sources:
ravithejads/samvaad-hi-filtered, HydraIndicLM/hindi_alpaca_dolly_67k, Telugu-LLM-Labs/yahma_alpaca_cleaned_telugu_filtered_and_romanized, Telugu-LLM-Labs/teknium_GPTeacher_general_instruct_telugu_filtered_and_romanized, abhinand/tamil-alpaca, Tensoic/airoboros-3.2_kn, Tensoic/gpt-teacher_kn, VishnuPJ/Alpaca_Instruct_Malayalam, Tensoic/Alpaca-Gujarati, HydraIndicLM/punjabi_alpaca_52K, HydraIndicLM/bengali_alpaca_dolly_67k, OdiaGenAI/Odia_Alpaca_instructions_52k, yahma/alpaca-cleaned
Data Volume:
approx 500K instruction samples
Methodology:
LoRA finetuned on 9 Indian languages and English language instruction datasets
Training Time:
36.5 Hours
Hardware Used:
1 A100, 80GB
Input Output 
Input Format:
### Instruction: {instruction} ### Input: {input} ## Response: {response}
LLM NameIndic Gemma 7B Finetuned Sft Navarasa
Repository ๐Ÿค—https://huggingface.co/Telugu-LLM-Labs/Indic-gemma-7b-finetuned-sft-Navarasa 
Base Model(s)  Gemma 7B   google/gemma-7b
Model Size7b
Required VRAM0.8 GB
Updated2024-12-22
MaintainerTelugu-LLM-Labs
Instruction-BasedYes
Model Files  0.8 GB
Supported Languageste en ta ml hi kn gu bn pa or
Model ArchitectureAutoModel
Licenseother
Model Max Length8192
Is Biasednone
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
PEFT TypeLORA
LoRA ModelYes
PEFT Target Modulesdown_proj|q_proj|v_proj|gate_proj|k_proj|o_proj|up_proj
LoRA Alpha128
LoRA Dropout0
R Param64

Best Alternatives to Indic Gemma 7B Finetuned Sft Navarasa

Best Alternatives
Context / RAM
Downloads
Likes
... 7b 448 Qinstruct Preview V0.12K / 17.3 GB174
Medical Mixtral 7B V2k0K / 0.4 GB60
Mistral 7B Instruct V0.2 ONNX0K /  GB126
Telugu Gemma 7B Finetuned Sft0K / 0.8 GB014
...l 7B Instructv0.2 Finetuned V20K / 0 GB40432
...tral 7B Instruct V0.1 Int8 Ct20K / 7.2 GB82
...tral 7B Instruct V0.1 Ct2 Int80K / 7.2 GB62
...penthaigpt 1.0.0 Alpha 7B Chat0K / 0.3 GB04
...st Open Llama 7B Open Instruct0K / 6.7 GB144
Ct2fast Falcon 7B Instruct0K / 6.9 GB81
Note: green Score (e.g. "73.2") means that the model is better than Telugu-LLM-Labs/Indic-gemma-7b-finetuned-sft-Navarasa.

Rank the Indic Gemma 7B Finetuned Sft Navarasa Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40123 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217