Llama 2 7B GGUF by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  Llama 2 7B GGUF   URL Share it on

  Arxiv:2307.09288 Base model:meta-llama/llama-2-... Base model:quantized:meta-llam...   En   Facebook   Gguf   Llama   Llama2   Meta   Pytorch   Quantized   Region:us

Llama 2 7B GGUF Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Llama 2 7B GGUF Parameters and Internals

Model Type 
text-generation
Use Cases 
Areas:
Commercial applications, Research
Applications:
Natural language generation tasks
Primary Use Cases:
Assistant-like chat
Limitations:
Use in languages other than English, Violation of applicable laws and regulations
Considerations:
Specific formatting required for expected performance in chat versions
Additional Notes 
Quantization methods include options like Q2_K, Q3_K, Q4_K, etc., for trade-offs between memory size and model accuracy.
Supported Languages 
English (unknown proficiency level)
Training Details 
Data Sources:
A new mix of publicly available online data
Data Volume:
2T tokens
Methodology:
Pretraining and fine-tuning with supervised fine-tuning and reinforcement learning with human feedback
Context Length:
4096
Training Time:
January 2023 to July 2023
Hardware Used:
A100-80GB GPUs
Model Architecture:
Auto-regressive language model with an optimized transformer architecture
Safety Evaluation 
Methodologies:
Internal evaluations library
Risk Categories:
Misinformation, Bias
Ethical Considerations:
Testing in languages other than English not conducted
Responsible Ai Considerations 
Fairness:
Testing conducted for fairness, but not exhaustive
Transparency:
Model card available with detailed information
Accountability:
Developers accountable for safe deployment of applications
Mitigation Strategies:
Use Responsible Use Guide for deployment
Input Output 
Input Format:
Text input with special token formatting
Accepted Modalities:
text
Output Format:
Text generation
Performance Tips:
Follow recommended formatting with special tokens for chat models.
Release Notes 
Version:
GGUF format introduced on August 21st 2023
Date:
2023-08-21
Notes:
New format for improved tokenization and metadata support.
Version:
Macro-scaling models with parameter variations (7B, 13B, 70B)
Notes:
Pretrained and fine-tuned generative text models available.
LLM NameLlama 2 7B GGUF
Repository ๐Ÿค—https://huggingface.co/TheBloke/Llama-2-7B-GGUF 
Model NameLlama 2 7B
Model CreatorMeta
Base Model(s)  Llama 2 7B Hf   meta-llama/Llama-2-7b-hf
Model Size7b
Required VRAM2.8 GB
Updated2024-11-19
MaintainerTheBloke
Model Typellama
Model Files  2.8 GB   3.6 GB   3.3 GB   3.0 GB   3.8 GB   4.1 GB   3.9 GB   4.7 GB   4.8 GB   4.7 GB   5.5 GB   7.2 GB
Supported Languagesen
GGUF QuantizationYes
Quantization Typegguf
Model ArchitectureAutoModel
Licensellama2
Llama 2 7B GGUF (TheBloke/Llama-2-7B-GGUF)

Best Alternatives to Llama 2 7B GGUF

Best Alternatives
Context / RAM
Downloads
Likes
Pixel8K / 4.4 GB150
Mistral 7B Instruct V0.3 GGUF0K / 1.6 GB191046262
Qwen2 7B Instruct GGUF0K / 1.9 GB20862858
WizardLM 2 7B GGUF0K / 2.7 GB185465374
Mistral 7B Instruct V0.2 GGUF0K / 3.1 GB137973398
Mistral 7B Instruct V0.3 GGUF0K / 2.7 GB571407
Qwen2 7B Instruct V0.6 GGUF0K / 4.5 GB135220
Mistral 7B Instruct V0.1 GGUF0K / 3.1 GB188536510
Qwen2 7B Instruct V0.1 GGUF0K / 4.5 GB97140
Qwen2 7B Instruct V0.7 GGUF0K / 4.5 GB95300
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/Llama-2-7B-GGUF.

Rank the Llama 2 7B GGUF Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 38100 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241110