Meta Llama 3 Instruct 120B Cat A Llama EXL2 5.5bpw by gbueno86

 ยป  All LLMs  ยป  gbueno86  ยป  Meta Llama 3 Instruct 120B Cat A Llama EXL2 5.5bpw   URL Share it on

  Merged Model   Autotrain compatible   Conversational   Endpoints compatible   Exl2   Instruct   Llama   Quantized   Region:us   Safetensors   Sharded   Tensorflow

Meta Llama 3 Instruct 120B Cat A Llama EXL2 5.5bpw Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Meta Llama 3 Instruct 120B Cat A Llama EXL2 5.5bpw (gbueno86/Meta-Llama-3-Instruct-120b-Cat-a-llama-exl2-5.5bpw)

Meta Llama 3 Instruct 120B Cat A Llama EXL2 5.5bpw Parameters and Internals

Additional Notes 
This model is a result of merging two pre-trained language models using the passthrough merge method. It demonstrates unexpected intelligence and capability, as observed by the user.
Training Details 
Methodology:
Passthrough merge method, combining layers from several models using mergekit.
LLM NameMeta Llama 3 Instruct 120B Cat A Llama EXL2 5.5bpw
Repository ๐Ÿค—https://huggingface.co/gbueno86/Meta-Llama-3-Instruct-120b-Cat-a-llama-exl2-5.5bpw 
Base Model(s)  ...ma 3 Instruct 120B Cat A Llama   gbueno86/Meta-Llama-3-Instruct-120b-Cat-a-llama
Merged ModelYes
Model Size120b
Required VRAM85.3 GB
Updated2025-01-13
Maintainergbueno86
Model Typellama
Instruction-BasedYes
Model Files  8.6 GB: 1-of-10   8.6 GB: 2-of-10   8.5 GB: 3-of-10   8.6 GB: 4-of-10   8.5 GB: 5-of-10   8.5 GB: 6-of-10   8.4 GB: 7-of-10   8.5 GB: 8-of-10   8.6 GB: 9-of-10   8.5 GB: 10-of-10
Quantization Typeexl2
Model ArchitectureLlamaForCausalLM
Context Length8192
Model Max Length8192
Transformers Version4.40.1
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size128256
Torch Data Typebfloat16

Best Alternatives to Meta Llama 3 Instruct 120B Cat A Llama EXL2 5.5bpw

Best Alternatives
Context / RAM
Downloads
Likes
...t 120B Cat A Llama EXL2 4.5bpw8K / 70.3 GB171
...egaDolphin 120B 2.9bpw H6 EXL24K / 44.3 GB143
...gaDolphin 120B 2.65bpw H6 EXL24K / 40.5 GB142
...egaDolphin 120B 4.0bpw H6 EXL24K / 60.8 GB111
Meta Llama 3 225B Instruct8K / 443.2 GB2518
...ma 3 Instruct 120B Cat A Llama8K / 243.9 GB221
...0B Instruct Abliterated Merged8K / 243.7 GB221
MegaDolphin 120B AWQ4K / 63.3 GB3432
MegaDolphin 120B GPTQ4K / 61.1 GB184
Koishi 120B Qlora Gptq4K / 59.8 GB91

Rank the Meta Llama 3 Instruct 120B Cat A Llama EXL2 5.5bpw Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 41301 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227