Meta Llama 3 8B Instruct by rajkosto

 ยป  All LLMs  ยป  rajkosto  ยป  Meta Llama 3 8B Instruct   URL Share it on

  Autotrain compatible   Conversational   En   Endpoints compatible   Facebook   Gguf   Instruct   Llama   Llama-3   Meta   Pytorch   Quantized   Region:us   Safetensors

Meta Llama 3 8B Instruct Benchmarks

Meta Llama 3 8B Instruct (rajkosto/Meta-Llama-3-8B-Instruct)

Meta Llama 3 8B Instruct Parameters and Internals

Model Type 
text generation, instruction tuned
Use Cases 
Areas:
commercial, research
Applications:
instruction-tuned for chat applications
Primary Use Cases:
chat-oriented generative tasks
Limitations:
Not suitable for language other than English, restricted under Acceptable Use Policy
Considerations:
Developers encouraged to implement safety assessments for specific applications.
Additional Notes 
Model supports fine-tuning for languages beyond English under compliance with the license and use policy.
Supported Languages 
en (Native)
Training Details 
Data Sources:
publicly available online data
Data Volume:
15 trillion tokens
Methodology:
supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)
Context Length:
8000
Training Time:
7.7M GPU hours on H100-80GB
Hardware Used:
H100-80GB GPUs
Model Architecture:
optimized transformer architecture
Safety Evaluation 
Methodologies:
red teaming, adversarial evaluations
Findings:
residual risks remain, model refusals reduced
Risk Categories:
CBRNE, cybersecurity, child safety
Ethical Considerations:
Ethical considerations include avoiding misuse of AI in harmful areas.
Responsible Ai Considerations 
Fairness:
Model is optimized to balance helpfulness and alignment, with considerations for avoiding biases.
Transparency:
Open source release with detailed documentation and responsible use guidelines.
Accountability:
Users must comply with license terms and acceptable use policy.
Mitigation Strategies:
Safety tools like Meta Llama Guard and Code Shield provided.
Input Output 
Input Format:
text input
Accepted Modalities:
text
Output Format:
text and code output
Performance Tips:
Use appropriate hardware and fine-tuning methods for optimal performance.
Release Notes 
Version:
April 18, 2024
Date:
2024-04-18
Notes:
Initial release of Llama 3 models.
LLM NameMeta Llama 3 8B Instruct
Repository ๐Ÿค—https://huggingface.co/rajkosto/Meta-Llama-3-8B-Instruct 
Base Model(s)  mpasila/Meta-Llama-3-11.5B-Instruct   mpasila/Meta-Llama-3-11.5B-Instruct
Model Size8b
Required VRAM4.9 GB
Updated2025-02-22
Maintainerrajkosto
Model Typellama
Instruction-BasedYes
Model Files  4.9 GB   6.6 GB   16.1 GB
Supported Languagesen
GGUF QuantizationYes
Quantization Typegguf
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length8192
Model Max Length8192
Transformers Version4.40.0.dev0
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size128256
Torch Data Typebfloat16

Best Alternatives to Meta Llama 3 8B Instruct

Best Alternatives
Context / RAM
Downloads
Likes
...truct Gradient 1048K IMat GGUF1024K / 2 GB3506
...B Instruct Gradient 1048K GGUF1024K / 3.2 GB1523
Llama 3 8B Instruct 262K GGUF256K / 3.2 GB1042
... 8B Instruct Reasoner 1o1 V0.3128K / 16.1 GB4177
Nsfw Plz Gguf Me128K / 16.1 GB383
...lama 3.1 Cantonese 8B Instruct128K / 16.1 GB1165
Alpha R S V2 Q8 0 GGUF39K / 8.5 GB210
SmolTulu 1.7B Instruct8K / 3.4 GB19413
Llama 3 Cantonese 8B Instruct8K / 16.1 GB2566
...ama3 8B Chinese Chat GGUF 8bit8K / 8.5 GB928165
Note: green Score (e.g. "73.2") means that the model is better than rajkosto/Meta-Llama-3-8B-Instruct.

Rank the Meta Llama 3 8B Instruct Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227