Additional Notes |
| ||||||
Training Details |
|
LLM Name | Zephyr Danube Sft Qlora |
Repository ๐ค | https://huggingface.co/Ritvik19/zephyr-danube-sft-qlora |
Base Model(s) | |
Model Size | 1.8b |
Required VRAM | 0 GB |
Updated | 2025-01-04 |
Maintainer | Ritvik19 |
Model Files | |
Model Architecture | Adapter |
License | apache-2.0 |
Model Max Length | 2048 |
Is Biased | none |
Tokenizer Class | LlamaTokenizer |
Padding Token | </s> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | down_proj|gate_proj|q_proj|v_proj|up_proj|o_proj|k_proj |
LoRA Alpha | 16 |
LoRA Dropout | 0.05 |
R Param | 16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Openhermes Danube Sft Qlora | 0K / 0 GB | 6 | 0 |
Openhermes Danube2 Sft Qlora | 0K / 0 GB | 6 | 0 |
Zephyr Danube2 Sft Qlora | 0K / 0 GB | 5 | 1 |
Qwen Qwen1.5 1.8B 1719882362 | 0K / 0 GB | 6 | 0 |
Qwen Qwen1.5 1.8B 1719881006 | 0K / 0 GB | 6 | 0 |
Qwen Qwen1.5 1.8B 1719880201 | 0K / 0 GB | 6 | 0 |
Qwen Qwen1.5 1.8B 1719898603 | 0K / 0 GB | 5 | 0 |
Qwen Qwen1.5 1.8B 1719860283 | 0K / 0 GB | 6 | 0 |
Qwen Qwen1.5 1.8B 1719861591 | 0K / 0 GB | 6 | 0 |
Qwen Qwen1.5 1.8B 1719835117 | 0K / 0 GB | 5 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐