Hermes Mixtral Instruct Seqlen 4096 Bs 4 Optimum 0 0 23 by aws-neuron

 ยป  All LLMs  ยป  aws-neuron  ยป  Hermes Mixtral Instruct Seqlen 4096 Bs 4 Optimum 0 0 23   URL Share it on

  Autotrain compatible   Conversational   Endpoints compatible   Instruct   Mixtral   Moe   Region:us   Safetensors

Rank the Hermes Mixtral Instruct Seqlen 4096 Bs 4 Optimum 0 0 23 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Hermes Mixtral Instruct Seqlen 4096 Bs 4 Optimum 0 0 23 (aws-neuron/hermes-mixtral-instruct-seqlen-4096-bs-4-optimum-0-0-23)

Best Alternatives to Hermes Mixtral Instruct Seqlen 4096 Bs 4 Optimum 0 0 23

Best Alternatives
HF Rank
Context/RAM
Downloads
Likes
...es Mixtral 8x7B 2.4bpw H6 EXL268.232K / 14.3 GB22
...es Mixtral 8x7B 3.0bpw H6 EXL268.232K / 17.8 GB21
Notux 8x7b V1.3.5bpw EXL268.232K / 20.7 GB22
Notux 8x7b V1.3.5bpw H6 EXL268.232K / 20.7 GB21
...es Mixtral 8x7B 6.0bpw H6 EXL268.232K / 35.3 GB21
...ixtral 8x7B Instruct V0.1 GGUF68.232K / 17.3 GB3585
... 8x22B V0.1 Instruct Sft En De64K / 212 GB21
...eqlen 4096 Bs 4 Optimum 0 0 2332K /  GB2431
...al 8x7B Instruct V0.1 GPT Fast32K /  GB31
Mixtral 8x7B Instruct V0.132K /  GB50
Note: green Score (e.g. "73.2") means that the model is better than aws-neuron/hermes-mixtral-instruct-seqlen-4096-bs-4-optimum-0-0-23.

Hermes Mixtral Instruct Seqlen 4096 Bs 4 Optimum 0 0 23 Parameters and Internals

LLM NameHermes Mixtral Instruct Seqlen 4096 Bs 4 Optimum 0 0 23
RepositoryOpen on ๐Ÿค— 
Updated2024-07-07
Maintaineraws-neuron
Model Typemixtral
Instruction-BasedYes
Model ArchitectureMixtralForCausalLM
Context Length32768
Model Max Length32768
Transformers Version4.41.2
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32002
Initializer Range0.02
Torch Data Typebfloat16

What open-source LLMs or SLMs are you in search of? 34531 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024042801