Hermes 3 Llama 3.1 8B By NousResearch: Benchmarks, Features and Detailed Analysis. Insights on Hermes 3 Llama 3.1 8B.

Arxiv:2408.11857 Autotrain compatible Axolotl Base model:finetune:meta-llama... Base model:meta-llama/llama-3.... Chat Chatml Conversational Distillation En Endpoints compatible Finetuned Function calling Gpt4 Instruct Json mode Llama Llama-3 Region:us Roleplaying Safetensors Sharded Synthetic data Tensorflow

Model Card on HF 🤗: https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B

Hermes 3 Llama 3.1 8B Benchmarks

MMLU Pro: 23.77

GPQA: 6.38

MUSR: 13.62

BBH: 30.72

IFEval: 61.7 vs 88 (so35)^-29.9%

MATH Lvl 5: 4.76

LLME Score: 0.38966

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Hermes 3 Llama 3.1 8B (NousResearch/Hermes-3-Llama-3.1-8B)

Hermes 3 Llama 3.1 8B Parameters and Internals

Model Type

generalist language model

Use Cases

Areas:

Research, Commercial applications

Applications:

Generalist assistant tasks, Code generation

Primary Use Cases:

Function calling, Roleplaying, Multi-turn conversation

Additional Notes

Capable of structured outputs and function calling, enhances roleplaying and multi-turn interactions.

Training Details

Methodology:

Advanced techniques, including function calling and structured output capabilities.

Model Architecture:

Based on Llama-3.1 architecture with numerous enhancements for reasoning, multi-turn conversation, and code generation.

Input Output

Input Format:

ChatML prompt format

Accepted Modalities:

text

Output Format:

Text output, structured JSON outputs with system prompts for specific applications.

Performance Tips:

Familiarity with ChatGPT API format is beneficial for optimal model steering.

LLM Name	Hermes 3 Llama 3.1 8B
Repository 🤗	https://huggingface.co/NousResearch/Hermes-3-Llama-3.1-8B
Base Model(s)	meta-llama/Meta-Llama-3.1-8B meta-llama/Meta-Llama-3.1-8B
Model Size	8b
Required VRAM	16.1 GB
Updated	2025-02-22
Maintainer	NousResearch
Model Type	llama
Model Files	5.0 GB: 1-of-4 5.0 GB: 2-of-4 4.9 GB: 3-of-4 1.2 GB: 4-of-4
Supported Languages	en
Model Architecture	LlamaForCausalLM
License	llama3
Context Length	131072
Model Max Length	131072
Transformers Version	4.44.0.dev0
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<\|im_end\|>
Vocabulary Size	128256
Torch Data Type	bfloat16

Quantized Models of the Hermes 3 Llama 3.1 8B

Model	Likes	Downloads	VRAM
Hermes 3 Llama 3.1 8B Bnb 4bit	5	3597	5 GB
... Horizon AI Korean Advanced 8B	0	58	16 GB

Best Alternatives to Hermes 3 Llama 3.1 8B

Best Alternatives	Context / RAM	Downloads	Likes
...a 3 8B Instruct Gradient 1048K	1024K / 16.1 GB	3927	680
MrRoboto ProLong 8B V4i	1024K / 16.1 GB	66	1
...o ProLongBASE Pt8 Unaligned 8B	1024K / 16.1 GB	24	0
MrRoboto BASE V2 Unholy 8B 64K	1024K / 16.1 GB	27	1
Mpasila Viking 8B	1024K / 16.1 GB	84	0
Thor V1.4 8B DARK FICTION	1024K / 16.1 GB	941	2
4	1024K / 16.1 GB	322	0
Hel V2 8B DARK FICTION	1024K / 16.1 GB	22	0
16	1024K / 16.1 GB	169	0
...di95 LewdStorytellerMix 8B 64K	1024K / 16.1 GB	69	2

Rank the Hermes 3 Llama 3.1 8B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer