Nous Hermes 2 Mixtral 8x7B SFT by NousResearch

 ยป  All LLMs  ยป  NousResearch  ยป  Nous Hermes 2 Mixtral 8x7B SFT   URL Share it on

  Autotrain compatible Base model:mistralai/mixtral-8...   Chatml   Conversational   Dataset:teknium/openhermes-2.5   Distillation   En   Endpoints compatible   Finetuned   Gpt4   Instruct   License:apache-2.0   Mixtral   Moe   Region:us   Safetensors   Sharded   Synthetic data   Tensorflow

Rank the Nous Hermes 2 Mixtral 8x7B SFT Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Nous Hermes 2 Mixtral 8x7B SFT (NousResearch/Nous-Hermes-2-Mixtral-8x7B-SFT)

Quantized Models of the Nous Hermes 2 Mixtral 8x7B SFT

Model
Likes
Downloads
VRAM
...Hermes 2 Mixtral 8x7B SFT GGUF25258817 GB
...Hermes 2 Mixtral 8x7B SFT GPTQ102923 GB
...Hermes 2 Mixtral 8x7B SFT GGUF370717 GB
... Hermes 2 Mixtral 8x7B SFT AWQ25524 GB

Best Alternatives to Nous Hermes 2 Mixtral 8x7B SFT

Best Alternatives
HF Rank
Context/RAM
Downloads
Likes
Mixtral 8x7B V0.177.9532K / 93.6 GB11226061584
Mixtral 8x7B Instruct V0.177.7532K / 93.6 GB5012883937
...lQA Mixtral 8x7B Instruct V0.132K / 43.3 GB92
Mixtral 8x7B V0.1 Fp832K / 47 GB220
Mixtral 8x7B Instruct V0.1 FP832K / 47.1 GB2261
...tral 8x7B Instruct V0.1 FP8 V232K / 47.1 GB1120
...tral 8x7B Instruct V0.1 FP8 V332K / 47.1 GB350
...tral 8x7B Instruct V0.1 FP8 V132K / 47.1 GB70
Aldan Mix 8x7B32K / 89.4 GB11
Taiwan LLM 8x7B DPO32K / 90 GB72218
Note: green Score (e.g. "73.2") means that the model is better than NousResearch/Nous-Hermes-2-Mixtral-8x7B-SFT.

Nous Hermes 2 Mixtral 8x7B SFT Parameters and Internals

LLM NameNous Hermes 2 Mixtral 8x7B SFT
RepositoryOpen on ๐Ÿค— 
Base Model(s)  Mixtral 8x7B V0.1   mistralai/Mixtral-8x7B-v0.1
Model Size46.7b
Required VRAM93.6 GB
Updated2024-07-04
MaintainerNousResearch
Model Typemixtral
Model Files  4.9 GB: 1-of-19   5.0 GB: 2-of-19   5.0 GB: 3-of-19   4.9 GB: 4-of-19   5.0 GB: 5-of-19   5.0 GB: 6-of-19   4.9 GB: 7-of-19   5.0 GB: 8-of-19   5.0 GB: 9-of-19   4.9 GB: 10-of-19   5.0 GB: 11-of-19   5.0 GB: 12-of-19   5.0 GB: 13-of-19   4.9 GB: 14-of-19   5.0 GB: 15-of-19   5.0 GB: 16-of-19   4.9 GB: 17-of-19   5.0 GB: 18-of-19   4.2 GB: 19-of-19
Supported Languagesen
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32002
Initializer Range0.02
Torch Data Typebfloat16

What open-source LLMs or SLMs are you in search of? 33742 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024042801