LLM Name | Saiga Llama3 70b Sft M1 D5 Lora |
Repository ๐ค | https://huggingface.co/IlyaGusev/saiga_llama3_70b_sft_m1_d5_lora |
Model Size | 70b |
Required VRAM | 5.9 GB |
Updated | 2024-12-22 |
Maintainer | IlyaGusev |
Model Files | |
Model Architecture | Adapter |
Model Max Length | 8192 |
Is Biased | none |
Tokenizer Class | PreTrainedTokenizerFast |
Padding Token | <|begin_of_text|> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | down_proj|q_proj|v_proj|up_proj|k_proj|o_proj|gate_proj |
LoRA Alpha | 16 |
LoRA Dropout | 0 |
R Param | 32 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Llama 3 70B Instruct Spider | 0K / 141.9 GB | 6 | 0 |
Airoboros 70B 3.3 Peft | 0K / 0.4 GB | 0 | 2 |
Llama3v1 | 0K / 0.1 GB | 5 | 0 |
Xwin LM 70B V0.1 LORA | 0K / 1.7 GB | 0 | 1 |
Euryale 1.3 L2 70B LORA | 0K / 1.7 GB | 3 | 1 |
Miqu 1 70B Hermes2.5 Qlora | 0K / 4.8 GB | 0 | 4 |
Limarp Miqu 1 70B Qlora | 0K / 1.7 GB | 5 | 4 |
Miqu Limarp 70B DPO Safefile | 0K / 38.4 GB | 0 | 1 |
Miqu Limarp 70B | 0K / 4.4 GB | 1 | 2 |
Waxwing Storytelling 70B LoRA | 0K / 0.8 GB | 12 | 4 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐