LLM Name | BigMistral 11B GLUE LORA |
Repository ๐ค | https://huggingface.co/athirdpath/BigMistral-11b-GLUE_LORA |
Base Model(s) | |
Model Size | 11b |
Required VRAM | 2 GB |
Updated | 2024-12-22 |
Maintainer | athirdpath |
Model Files | |
Supported Languages | en |
Model Architecture | AutoModelForCausalLM |
License | cc-by-nc-4.0 |
Is Biased | none |
Tokenizer Class | LlamaTokenizer |
Padding Token | </s> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | v_proj|o_proj|k_proj|up_proj|gate_proj|down_proj|q_proj |
LoRA Alpha | 16 |
LoRA Dropout | 0.08 |
R Param | 128 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐