Supported Languages | te (Telugu), en (English), ta (Tamil), ml (Malayalam), hi (Hindi), kn (Kannada), gu (Gujarati), bn (Bengali), pa (Punjabi), or (Odia) |
|
Training Details |
Data Sources: | ravithejads/samvaad-hi-filtered, HydraIndicLM/hindi_alpaca_dolly_67k, Telugu-LLM-Labs/yahma_alpaca_cleaned_telugu_filtered_and_romanized, Telugu-LLM-Labs/teknium_GPTeacher_general_instruct_telugu_filtered_and_romanized, abhinand/tamil-alpaca, Tensoic/airoboros-3.2_kn, Tensoic/gpt-teacher_kn, VishnuPJ/Alpaca_Instruct_Malayalam, Tensoic/Alpaca-Gujarati, HydraIndicLM/punjabi_alpaca_52K, HydraIndicLM/bengali_alpaca_dolly_67k, OdiaGenAI/Odia_Alpaca_instructions_52k, yahma/alpaca-cleaned |
|
Data Volume: | approx 500K instruction samples |
|
Methodology: | LoRA finetuned on 9 Indian languages and English language instruction datasets |
|
Training Time: | |
Hardware Used: | |
|