Mixtral 8x7B V0.1 Int8 GPTQ by Inferless

 ยป  All LLMs  ยป  Inferless  ยป  Mixtral 8x7B V0.1 Int8 GPTQ   URL Share it on

  Autotrain compatible Base model:mistralai/mixtral-8...   En   Gptq   License:apache-2.0   Mixtral   Moe   Quantized   Region:us   Safetensors   Vllm

Mixtral 8x7B V0.1 Int8 GPTQ Benchmarks

Rank the Mixtral 8x7B V0.1 Int8 GPTQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Mixtral 8x7B V0.1 Int8 GPTQ (Inferless/Mixtral-8x7B-v0.1-int8-GPTQ)

Best Alternatives to Mixtral 8x7B V0.1 Int8 GPTQ

Best Alternatives
HF Rank
Context/RAM
Downloads
Likes
...8x22b Instruct Oh EXL2 2.25bpw65.364K / 40.1 GB21
...2 Mixtral 8x22b 6.0bpw H8 EXL265.364K / 105.8 GB01
...2 Mixtral 8x22b 8.0bpw H8 EXL265.364K / 125.1 GB62
WizardLM2 2bit65.364K / 4.8 GB1630
...Hermes 2 Mixtral 8x7B DPO GGUF64.132K / 17.3 GB4512
...es Mixtral 8x7B 2.4bpw H6 EXL263.732K / 14.3 GB12
...es Mixtral 8x7B 3.0bpw H6 EXL263.732K / 17.8 GB11
Notux 8x7b V1.3.5bpw EXL263.732K / 20.7 GB12
Notux 8x7b V1.3.5bpw H6 EXL263.732K / 20.7 GB11
...es Mixtral 8x7B 6.0bpw H6 EXL263.732K / 35.3 GB01
Note: green Score (e.g. "73.2") means that the model is better than Inferless/Mixtral-8x7B-v0.1-int8-GPTQ.

Mixtral 8x7B V0.1 Int8 GPTQ Parameters and Internals

LLM NameMixtral 8x7B V0.1 Int8 GPTQ
RepositoryOpen on ๐Ÿค— 
Model NameMixtral-8x7B
Model Creatormistralai
Base Model(s)  Mixtral 8x7B V0.1   mistralai/Mixtral-8x7B-v0.1
Updated2024-06-24
MaintainerInferless
Model Typemixtral
Supported Languagesen
GPTQ QuantizationYes
Quantization Typegptq
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.0.dev0
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Initializer Range0.02
Torch Data Typebfloat16

What open-source LLMs or SLMs are you in search of? 34902 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024042801