LLM Name | Nous Hermes 2 Mixtral 8x7B DPO GPTQ |
Repository ๐ค | https://huggingface.co/TheBloke/Nous-Hermes-2-Mixtral-8x7B-DPO-GPTQ |
Model Name | Nous Hermes 2 Mixtral 8X7B DPO |
Model Creator | NousResearch |
Base Model(s) | |
Model Size | 6.1b |
Required VRAM | 23.8 GB |
Updated | 2025-03-12 |
Maintainer | TheBloke |
Model Type | mixtral |
Model Files | |
Supported Languages | en |
GPTQ Quantization | Yes |
Quantization Type | gptq |
Model Architecture | MixtralForCausalLM |
License | apache-2.0 |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.37.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Padding Token | </s> |
Vocabulary Size | 32002 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...ixtral 8x7B Instruct V0.1 GPTQ | 32K / 23.8 GB | 214020 | 136 |
Mixtral 8x7B V0.1 GPTQ | 32K / 23.8 GB | 699 | 127 |
Dolphin 2.7 Mixtral 8x7b GPTQ | 32K / 23.8 GB | 528 | 19 |
Dolphin 2.5 Mixtral 8x7b GPTQ | 32K / 23.8 GB | 269 | 111 |
...Hermes 2 Mixtral 8x7B SFT GPTQ | 32K / 23.8 GB | 106 | 11 |
Bagel DPO 8x7b V0.2 GPTQ | 32K / 23.8 GB | 32 | 2 |
Open Gpt4 8x7B V0.2 GPTQ | 32K / 23.8 GB | 36 | 6 |
...xtral Instruct 8x7b Zloss GPTQ | 32K / 23.8 GB | 45 | 2 |
Sensualize Mixtral GPTQ | 32K / 23.8 GB | 39 | 5 |
....1 LimaRP ZLoss DARE TIES GPTQ | 32K / 23.8 GB | 28 | 6 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐