Llama 3.1 8B Instruct by meta-llama

 ยป  All LLMs  ยป  meta-llama  ยป  Llama 3.1 8B Instruct   URL Share it on

  Arxiv:2204.05149   Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/llama-3....   Conversational   De   En   Endpoints compatible   Es   Facebook   Fr   Hi   Instruct   It   Llama   Llama-3   Meta   Pt   Pytorch   Region:us   Safetensors   Sharded   Tensorflow   Th

Llama 3.1 8B Instruct Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Llama 3.1 8B Instruct (meta-llama/Llama-3.1-8B-Instruct)

Llama 3.1 8B Instruct Parameters and Internals

Model Type 
text generation, multilingual
Use Cases 
Areas:
Commercial, Research
Applications:
Assistant-like chat, Multilingual dialogue, Synthetic data generation
Primary Use Cases:
Instruction tuning for assistant-like chat
Limitations:
Use in unsupported languages without controls, Violations of applicable laws or the Acceptable Use Policy
Considerations:
Developers should fine-tune Llama 3.1 models for additional languages responsibly.
Additional Notes 
Developers can customize model deployment using available recipes and guidelines
Supported Languages 
en (English), de (German), fr (French), it (Italian), pt (Portuguese), hi (Hindi), es (Spanish), th (Thai)
Training Details 
Data Sources:
Publicly available online data
Data Volume:
~15 trillion tokens
Methodology:
supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)
Context Length:
128000
Training Time:
39.3M GPU hours
Hardware Used:
H100-80GB GPUs
Model Architecture:
Auto-regressive language model using an optimized transformer architecture
Safety Evaluation 
Methodologies:
Safety fine-tuning, Red teaming
Findings:
Model must be deployed with system-level safeguards
Risk Categories:
Misinformation, Bias, Child Safety, Cybersecurity risks
Ethical Considerations:
Avoid using in unsupported languages without fine-tuning and system controls.
Responsible Ai Considerations 
Fairness:
Focus on multilingual safety and fairness across different languages
Transparency:
Clear guidelines and resources provided for deployment
Accountability:
Developers must deploy safeguards when building with the model
Mitigation Strategies:
Incorporation of safety mitigations, domain-specific evaluations
Input Output 
Input Format:
Multilingual text and multilingual text with code
Accepted Modalities:
text
Output Format:
Text, including multilingual text and code
Performance Tips:
Use transformers or llama codebase for generation
Release Notes 
Version:
3.1
Date:
2024-07-23
Notes:
Introduction of multilingual support and longer context window.
LLM NameLlama 3.1 8B Instruct
Repository ๐Ÿค—https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct 
Base Model(s)  meta-llama/Meta-Llama-3.1-8B   meta-llama/Meta-Llama-3.1-8B
Model Size8b
Required VRAM16.1 GB
Updated2025-02-21
Maintainermeta-llama
Model Typellama
Instruction-BasedYes
Model Files  5.0 GB: 1-of-4   5.0 GB: 2-of-4   4.9 GB: 3-of-4   1.2 GB: 4-of-4
Supported Languagesen de fr it pt hi es th
Model ArchitectureLlamaForCausalLM
Licensellama3.1
Context Length131072
Model Max Length131072
Transformers Version4.42.3
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size128256
Torch Data Typebfloat16

Quantized Models of the Llama 3.1 8B Instruct

Model
Likes
Downloads
VRAM
...Llama 3.1 8B Instruct Bnb 4bit567708515 GB
ClimateLlama 8B TR22616 GB
...Llama 3.1 8B BG Reasoning V0.122816 GB
...ma 3.1 8B Instruct XMADai 4bit3210 GB

Best Alternatives to Llama 3.1 8B Instruct

Best Alternatives
Context / RAM
Downloads
Likes
...a 3 8B Instruct Gradient 1048K1024K / 16.1 GB3942680
Mpasila Viking 8B1024K / 16.1 GB840
Hel V2 8B DARK FICTION1024K / 16.1 GB220
161024K / 16.1 GB1690
...di95 LewdStorytellerMix 8B 64K1024K / 16.1 GB692
Because Im Bored Nsfw11024K / 16.1 GB541
121024K / 16.1 GB600
MrRoboto ProLong 8B V4b1024K / 16.1 GB1070
MrRoboto ProLong 8B V1a1024K / 16.1 GB1080
MrRoboto ProLong 8B V2a1024K / 16.1 GB1020

Rank the Llama 3.1 8B Instruct Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43432 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227