Llama 3.1 8B Instruct by meta-llama

 ยป  All LLMs  ยป  meta-llama  ยป  Llama 3.1 8B Instruct   URL Share it on

  Arxiv:2204.05149   Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/llama-3....   Conversational   De   En   Endpoints compatible   Es   Facebook   Fr   Hi   Instruct   It   Llama   Llama-3   Meta   Pt   Pytorch   Region:us   Safetensors   Sharded   Tensorflow   Th

Llama 3.1 8B Instruct Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Llama 3.1 8B Instruct Parameters and Internals

Model Type 
text generation, multilingual
Use Cases 
Areas:
Commercial, Research
Applications:
Assistant-like chat, Multilingual dialogue, Synthetic data generation
Primary Use Cases:
Instruction tuning for assistant-like chat
Limitations:
Use in unsupported languages without controls, Violations of applicable laws or the Acceptable Use Policy
Considerations:
Developers should fine-tune Llama 3.1 models for additional languages responsibly.
Additional Notes 
Developers can customize model deployment using available recipes and guidelines
Supported Languages 
en (English), de (German), fr (French), it (Italian), pt (Portuguese), hi (Hindi), es (Spanish), th (Thai)
Training Details 
Data Sources:
Publicly available online data
Data Volume:
~15 trillion tokens
Methodology:
supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)
Context Length:
128000
Training Time:
39.3M GPU hours
Hardware Used:
H100-80GB GPUs
Model Architecture:
Auto-regressive language model using an optimized transformer architecture
Safety Evaluation 
Methodologies:
Safety fine-tuning, Red teaming
Findings:
Model must be deployed with system-level safeguards
Risk Categories:
Misinformation, Bias, Child Safety, Cybersecurity risks
Ethical Considerations:
Avoid using in unsupported languages without fine-tuning and system controls.
Responsible Ai Considerations 
Fairness:
Focus on multilingual safety and fairness across different languages
Transparency:
Clear guidelines and resources provided for deployment
Accountability:
Developers must deploy safeguards when building with the model
Mitigation Strategies:
Incorporation of safety mitigations, domain-specific evaluations
Input Output 
Input Format:
Multilingual text and multilingual text with code
Accepted Modalities:
text
Output Format:
Text, including multilingual text and code
Performance Tips:
Use transformers or llama codebase for generation
Release Notes 
Version:
3.1
Date:
2024-07-23
Notes:
Introduction of multilingual support and longer context window.
LLM NameLlama 3.1 8B Instruct
Repository ๐Ÿค—https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct 
Base Model(s)  meta-llama/Meta-Llama-3.1-8B   meta-llama/Meta-Llama-3.1-8B
Model Size8b
Required VRAM16.1 GB
Updated2024-11-21
Maintainermeta-llama
Model Typellama
Instruction-BasedYes
Model Files  5.0 GB: 1-of-4   5.0 GB: 2-of-4   4.9 GB: 3-of-4   1.2 GB: 4-of-4
Supported Languagesen de fr it pt hi es th
Model ArchitectureLlamaForCausalLM
Licensellama3.1
Context Length131072
Model Max Length131072
Transformers Version4.42.3
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size128256
Torch Data Typebfloat16
Llama 3.1 8B Instruct (meta-llama/Llama-3.1-8B-Instruct)

Quantized Models of the Llama 3.1 8B Instruct

Model
Likes
Downloads
VRAM
ProductLlama V2012416 GB
...ma 3.1 8B Instruct XMADai 4bit3210 GB

Best Alternatives to Llama 3.1 8B Instruct

Best Alternatives
Context / RAM
Downloads
Likes
...a 3 8B Instruct Gradient 1048K1024K / 16.1 GB19347675
L3.1 Gradient1024K / 16.1 GB90
...SLERP Gradient1048k OpenBioLLM1024K / 16.1 GB270
...lama3 8B Special Dark V3.1.2aa1024K / 16.1 GB130
Llama3 8B Special Dark V3.1.2B1024K / 16.1 GB120
...lama3 8B Special Dark V3.1.1yy1024K / 16.1 GB140
Loki1024K / 16.1 GB90
Unholy Thoth 8B V21024K / 16.1 GB120
...struct Gradient 1048K MAC Lora1024K / 5.9 GB162
... Instruct Gradient 1048K Agent1024K / 16.1 GB771

Rank the Llama 3.1 8B Instruct Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 38149 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241110