Model Type |
| |||||||||
Use Cases |
| |||||||||
Supported Languages |
| |||||||||
Training Details |
| |||||||||
Input Output |
|
LLM Name | Gemma Swahili Mollel 1 Epoch |
Repository ๐ค | https://huggingface.co/Mollel/Gemma_Swahili_Mollel_1_epoch |
Base Model(s) | |
Model Size | 7b |
Required VRAM | 0.2 GB |
Updated | 2024-11-21 |
Maintainer | Mollel |
Model Files | |
Supported Languages | en sw |
Quantization Type | 4bit |
Model Architecture | AutoModel |
License | apache-2.0 |
Model Max Length | 8192 |
Is Biased | none |
Tokenizer Class | GemmaTokenizer |
Padding Token | <pad> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | down_proj|o_proj|up_proj|k_proj|q_proj|gate_proj|v_proj |
LoRA Alpha | 16 |
LoRA Dropout | 0 |
R Param | 16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Qwen 2.1 7B Persona Lora Model | 0K / 0.2 GB | 0 | 1 |
Qwen2.5 7B Exp2 Lora Model | 0K / 0.2 GB | 0 | 1 |
Qwen 2.5 7B Rp Lora | 0K / 0.2 GB | 0 | 1 |
Text Completion | 0K / 2.4 GB | 0 | 1 |
Cognitive Hacker0.2 | 0K / 0.2 GB | 0 | 1 |
Cognitive Hacker | 0K / 0.2 GB | 0 | 1 |
Lora Model | 0K / 0.2 GB | 0 | 1 |
Mistral 7B Bnb 4bit Lora Model | 0K / 0.2 GB | 0 | 1 |
Mistral Sharegpt90k | 0K / 0.2 GB | 31 | 0 |
Swahili Gemma Lora | 0K / 0.2 GB | 0 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐