Instruct Mixtral 8x7B V0.1 Dolly15K GPTQ by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  Instruct Mixtral 8x7B V0.1 Dolly15K GPTQ   URL Share it on

  4-bit   Autotrain compatible Base model:brillibits/instruct... Base model:quantized:brillibit...   Conversational Dataset:databricks/databricks-...   Gptq   Instruct   Mixtral   Moe   Quantized   Region:us   Safetensors

Instruct Mixtral 8x7B V0.1 Dolly15K GPTQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Instruct Mixtral 8x7B V0.1 Dolly15K GPTQ (TheBloke/Instruct_Mixtral-8x7B-v0.1_Dolly15K-GPTQ)

Instruct Mixtral 8x7B V0.1 Dolly15K GPTQ Parameters and Internals

Model Type 
mixtral, auto-regressive language model
Training Details 
Data Sources:
Dolly15K
Methodology:
Trained for 1.0 epochs using QLora with 1024 context window
Context Length:
1024
Model Architecture:
Llama 2 transformer
Input Output 
Input Format:
{prompt} Output:
LLM NameInstruct Mixtral 8x7B V0.1 Dolly15K GPTQ
Repository ๐Ÿค—https://huggingface.co/TheBloke/Instruct_Mixtral-8x7B-v0.1_Dolly15K-GPTQ 
Model NameInstruct Mixtral 8X7B V0.1 Dolly15K
Model CreatorBrillibits
Base Model(s)  Brillibits/Instruct_Mixtral-8x7B-v0.1_Dolly15K   Brillibits/Instruct_Mixtral-8x7B-v0.1_Dolly15K
Model Size6.1b
Required VRAM23.8 GB
Updated2025-02-05
MaintainerTheBloke
Model Typemixtral
Instruction-BasedYes
Model Files  23.8 GB
GPTQ QuantizationYes
Quantization Typegptq
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.2
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to Instruct Mixtral 8x7B V0.1 Dolly15K GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
...ixtral 8x7B Instruct V0.1 GPTQ32K / 23.8 GB55564135
Dolphin 2.5 Mixtral 8x7b GPTQ32K / 23.8 GB143109
...tLM Mixtral 8x7B Instruct GPTQ32K / 23.8 GB2433
...xtral Instruct 8x7b Zloss GPTQ32K / 23.8 GB252
....1 LimaRP ZLoss DARE TIES GPTQ32K / 23.8 GB156
Dolphin 2.7 Mixtral 8x7b GPTQ32K / 23.8 GB3919
...nstruct V0.1 LimaRP ZLoss GPTQ32K / 23.8 GB93
... Mixtral 8x7b Instruct V3 GPTQ32K / 23.8 GB252
Dolphin 2.6 Mixtral 8x7b GPTQ32K / 23.8 GB226
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/Instruct_Mixtral-8x7B-v0.1_Dolly15K-GPTQ.

Rank the Instruct Mixtral 8x7B V0.1 Dolly15K GPTQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227