DiscoLM 120B GPTQ by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  DiscoLM 120B GPTQ   URL Share it on

  4-bit   Autotrain compatible Base model:discoresearch/disco... Base model:quantized:discorese...   Dataset:bjoernp/ultrachat de   Dataset:leolm/german poems   Dataset:leolm/german songs   Dataset:leolm/openschnabeltier   Dataset:meta-math/metamathqa Dataset:migtissera/synthia-v1.... Dataset:open-orca/slimorca-ded...   Dataset:teknium/openhermes   Dataset:thudm/agentinstruct   Deutsch   Discoresearch   En   Goliath   Gptq   Llama   Llama2   Quantized   Region:us   Safetensors   Sharded   Tensorflow

DiscoLM 120B GPTQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

DiscoLM 120B GPTQ Parameters and Internals

LLM NameDiscoLM 120B GPTQ
Repository ๐Ÿค—https://huggingface.co/TheBloke/DiscoLM-120b-GPTQ 
Model NameDiscoLM 120B
Model CreatorDisco Research
Base Model(s)  DiscoLM 120B   DiscoResearch/DiscoLM-120b
Model Size120b
Required VRAM59.8 GB
Updated2024-09-18
MaintainerTheBloke
Model Typellama
Model Files  10.0 GB: 1-of-6   9.9 GB: 2-of-6   10.0 GB: 3-of-6   10.0 GB: 4-of-6   10.0 GB: 5-of-6   9.9 GB: 6-of-6
Supported Languagesen
GPTQ QuantizationYes
Quantization Typegptq
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length4096
Model Max Length4096
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32032
Torch Data Typefloat16
DiscoLM 120B GPTQ (TheBloke/DiscoLM-120b-GPTQ)

Best Alternatives to DiscoLM 120B GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
...gLORA 120B Rope8 32K Fp16 GPTQ4K / 59.8 GB204
MegaDolphin 120B GPTQ4K / 61.1 GB74
Koishi 120B Qlora Gptq4K / 9.9 GB61
Goliath 120B GPTQ4K / 59.8 GB3416
Miquella 120B 3.0bpw H6 EXL231K / 44.8 GB410
Miquella 120B 8.0bpw H8 EXL231K / 118.1 GB83
Miquella 120B 4.0bpw H6 EXL231K / 59.4 GB52
...t 120B Cat A Llama EXL2 5.5bpw8K / 85.3 GB3200
...t 120B Cat A Llama EXL2 4.5bpw8K / 70.3 GB21
...h LongLORA 120B Rope8 32K Fp164K / 235.4 GB67
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/DiscoLM-120b-GPTQ.

Rank the DiscoLM 120B GPTQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 36026 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803