Additional Notes |
| |||
Training Details |
|
LLM Name | Meta Llama 3 Instruct 120B Cat A Llama EXL2 4.5bpw |
Repository ๐ค | https://huggingface.co/gbueno86/Meta-Llama-3-Instruct-120b-Cat-a-llama-exl2-4.5bpw |
Base Model(s) | |
Merged Model | Yes |
Model Size | 120b |
Required VRAM | 70.3 GB |
Updated | 2025-02-05 |
Maintainer | gbueno86 |
Model Type | llama |
Instruction-Based | Yes |
Model Files | |
Quantization Type | exl2 |
Model Architecture | LlamaForCausalLM |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.40.1 |
Tokenizer Class | PreTrainedTokenizerFast |
Vocabulary Size | 128256 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...t 120B Cat A Llama EXL2 5.5bpw | 8K / 85.3 GB | 8 | 0 |
...egaDolphin 120B 2.9bpw H6 EXL2 | 4K / 44.3 GB | 6 | 3 |
...gaDolphin 120B 2.65bpw H6 EXL2 | 4K / 40.5 GB | 5 | 2 |
...egaDolphin 120B 4.0bpw H6 EXL2 | 4K / 60.8 GB | 3 | 1 |
Meta Llama 3 225B Instruct | 8K / 443.2 GB | 7 | 18 |
...ma 3 Instruct 120B Cat A Llama | 8K / 243.9 GB | 17 | 1 |
...0B Instruct Abliterated Merged | 8K / 243.7 GB | 4 | 1 |
MegaDolphin 120B AWQ | 4K / 63.3 GB | 117 | 2 |
MegaDolphin 120B GPTQ | 4K / 61.1 GB | 15 | 4 |
Koishi 120B Qlora Gptq | 4K / 59.8 GB | 4 | 1 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐