๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Model |
Likes |
Downloads |
VRAM |
---|---|---|---|
...Hermes 2 Mixtral 8x7B DPO GGUF | 56 | 2968 | 17 GB |
...Hermes 2 Mixtral 8x7B DPO GPTQ | 25 | 11710 | 23 GB |
... Hermes 2 Mixtral 8x7B DPO AWQ | 20 | 452 | 24 GB |
...Hermes 2 Mixtral 8x7B DPO GGUF | 2 | 414 | 17 GB |
...Hermes 2 Mixtral 8x7B DPO GPTQ | 1 | 3 | 24 GB |
Best Alternatives |
HF Rank |
Context/RAM |
Downloads |
Likes |
---|---|---|---|---|
Mixtral 8x7B V0.1 | 77.95 | 32K / 93.6 GB | 1122606 | 1584 |
Mixtral 8x7B Instruct V0.1 | 77.75 | 32K / 93.6 GB | 501288 | 3937 |
...lQA Mixtral 8x7B Instruct V0.1 | — | 32K / 43.3 GB | 9 | 2 |
Mixtral 8x7B V0.1 Fp8 | — | 32K / 47 GB | 22 | 0 |
Mixtral 8x7B Instruct V0.1 FP8 | — | 32K / 47.1 GB | 226 | 1 |
...tral 8x7B Instruct V0.1 FP8 V2 | — | 32K / 47.1 GB | 112 | 0 |
...tral 8x7B Instruct V0.1 FP8 V3 | — | 32K / 47.1 GB | 35 | 0 |
...tral 8x7B Instruct V0.1 FP8 V1 | — | 32K / 47.1 GB | 7 | 0 |
Aldan Mix 8x7B | — | 32K / 89.4 GB | 1 | 1 |
Taiwan LLM 8x7B DPO | — | 32K / 90 GB | 722 | 18 |
LLM Name | Nous Hermes 2 Mixtral 8x7B DPO |
Repository | Open on ๐ค |
Base Model(s) | |
Model Size | 46.7b |
Required VRAM | 93.6 GB |
Updated | 2024-07-04 |
Maintainer | NousResearch |
Model Type | mixtral |
Model Files | |
Supported Languages | en |
Model Architecture | MixtralForCausalLM |
License | apache-2.0 |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.37.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Padding Token | </s> |
Vocabulary Size | 32002 |
Initializer Range | 0.02 |
Torch Data Type | bfloat16 |