Llama3 Instruct 8B by AgentPublic

 ยป  All LLMs  ยป  AgentPublic  ยป  Llama3 Instruct 8B   URL Share it on

  Autotrain compatible   Conversational   En   Endpoints compatible   Facebook   Instruct   Llama   Llama-3   Meta   Pytorch   Region:us   Safetensors   Sharded   Tensorflow

Llama3 Instruct 8B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Llama3 Instruct 8B (AgentPublic/llama3-instruct-8b)

Llama3 Instruct 8B Parameters and Internals

Model Type 
text generation, instruction tuned
Use Cases 
Areas:
commercial, research
Applications:
instruction-tuned for chat applications
Primary Use Cases:
chat-oriented generative tasks
Limitations:
Not suitable for language other than English, restricted under Acceptable Use Policy
Considerations:
Developers encouraged to implement safety assessments for specific applications.
Additional Notes 
Model supports fine-tuning for languages beyond English under compliance with the license and use policy.
Supported Languages 
en (Native)
Training Details 
Data Sources:
publicly available online data
Data Volume:
15 trillion tokens
Methodology:
supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)
Context Length:
8000
Training Time:
7.7M GPU hours on H100-80GB
Hardware Used:
H100-80GB GPUs
Model Architecture:
optimized transformer architecture
Safety Evaluation 
Methodologies:
red teaming, adversarial evaluations
Findings:
residual risks remain, model refusals reduced
Risk Categories:
CBRNE, cybersecurity, child safety
Ethical Considerations:
Ethical considerations include avoiding misuse of AI in harmful areas.
Responsible Ai Considerations 
Fairness:
Model is optimized to balance helpfulness and alignment, with considerations for avoiding biases.
Transparency:
Open source release with detailed documentation and responsible use guidelines.
Accountability:
Users must comply with license terms and acceptable use policy.
Mitigation Strategies:
Safety tools like Meta Llama Guard and Code Shield provided.
Input Output 
Input Format:
text input
Accepted Modalities:
text
Output Format:
text and code output
Performance Tips:
Use appropriate hardware and fine-tuning methods for optimal performance.
Release Notes 
Version:
April 18, 2024
Date:
2024-04-18
Notes:
Initial release of Llama 3 models.
LLM NameLlama3 Instruct 8B
Repository ๐Ÿค—https://huggingface.co/AgentPublic/llama3-instruct-8b 
Model Size8b
Required VRAM16.1 GB
Updated2025-02-22
MaintainerAgentPublic
Model Typellama
Instruction-BasedYes
Model Files  5.0 GB: 1-of-4   5.0 GB: 2-of-4   4.9 GB: 3-of-4   1.2 GB: 4-of-4
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length8192
Model Max Length8192
Transformers Version4.40.0.dev0
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size128256
Torch Data Typebfloat16

Quantized Models of the Llama3 Instruct 8B

Model
Likes
Downloads
VRAM
Llama3 Webinstruct 8bit059 GB

Best Alternatives to Llama3 Instruct 8B

Best Alternatives
Context / RAM
Downloads
Likes
...a 3 8B Instruct Gradient 1048K1024K / 16.1 GB3927680
Mpasila Viking 8B1024K / 16.1 GB840
Hel V2 8B DARK FICTION1024K / 16.1 GB220
161024K / 16.1 GB1690
...di95 LewdStorytellerMix 8B 64K1024K / 16.1 GB692
Because Im Bored Nsfw11024K / 16.1 GB361
121024K / 16.1 GB600
MrRoboto ProLong 8B V4b1024K / 16.1 GB1070
MrRoboto ProLong 8B V1a1024K / 16.1 GB1080
MrRoboto ProLong 8B V2a1024K / 16.1 GB1020
Note: green Score (e.g. "73.2") means that the model is better than AgentPublic/llama3-instruct-8b.

Rank the Llama3 Instruct 8B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227