LLM Explorer: A Curated Large Language Model Directory and Analytics  // 

Mistral 8x7b Chat by mattshumer

What open-source LLMs or SLMs are you in search of? 18857 in total.

 ยป  All LLMs  ยป  mattshumer  ยป  Mistral 8x7b Chat   URL Share it on

  Autotrain compatible   Custom code   Endpoints compatible   Has space   Mistral   Moe   Pytorch   Region:us   Sharded

Rank the Mistral 8x7b Chat Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Mistral 8x7b Chat (mattshumer/mistral-8x7b-chat)

Best Alternatives to Mistral 8x7b Chat

Best Alternatives
HF Rank
Context/RAM
Downloads
Likes
Multimaster 7B V273.3332K / 142.5 GB3230
Amadeus V0.171.428K / 48.3 GB20476
TinyLlama ClownCar32K / 3.7 GB60
Calm2 7B Chat 7B MoE32K / 22.7 GB71
Evolorxa 13B32K / 25.8 GB200
BioMistral Prompted32K / 25.8 GB60
CrystalMistral 24B32K / 48.3 GB120
Mixtral 7B 8expert32K / 93.6 GB182291255
Synthia MoE V3 CE32K / 93.6 GB415
Sonya 7B X8 MoE8K / 93.5 GB51
Note: green Score (e.g. "73.2") means that the model is better than mattshumer/mistral-8x7b-chat.

Mistral 8x7b Chat Parameters and Internals

LLM NameMistral 8x7b Chat
RepositoryOpen on ๐Ÿค— 
Model Size7b
Required VRAM93.6 GB
Updated2024-02-28
Maintainermattshumer
Model Typemistral
Model Files  4.9 GB: 1-of-19   5.0 GB: 2-of-19   5.0 GB: 3-of-19   4.9 GB: 4-of-19   5.0 GB: 5-of-19   5.0 GB: 6-of-19   4.9 GB: 7-of-19   5.0 GB: 8-of-19   5.0 GB: 9-of-19   4.9 GB: 10-of-19   5.0 GB: 11-of-19   5.0 GB: 12-of-19   5.0 GB: 13-of-19   4.9 GB: 14-of-19   5.0 GB: 15-of-19   5.0 GB: 16-of-19   4.9 GB: 17-of-19   5.0 GB: 18-of-19   4.2 GB: 19-of-19
Model ArchitectureMixtralForCausalLM
Context Length32768
Model Max Length32768
Transformers Version4.36.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32001
Initializer Range0.02
Torch Data Typebfloat16
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024022003