CodeLlama 13B AWQ by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  CodeLlama 13B AWQ   URL Share it on

  Arxiv:2308.12950   4-bit   Autotrain compatible   Awq Base model:codellama/codellama... Base model:quantized:codellama...   Code   Codegen   Llama   Llama2   Quantized   Region:us   Safetensors

CodeLlama 13B AWQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
CodeLlama 13B AWQ (TheBloke/CodeLlama-13B-AWQ)

CodeLlama 13B AWQ Parameters and Internals

Model Type 
text-generation, code synthesis
Use Cases 
Areas:
commercial, research
Applications:
code synthesis, code understanding
Primary Use Cases:
general code generation
Limitations:
Use only in English, Violations of applicable laws
Considerations:
Perform safety testing tailored to specific applications.
Supported Languages 
English (general proficiency)
Training Details 
Data Sources:
Llama 2 data with different weights
Data Volume:
See Section 2 and Table 1 in the research paper
Methodology:
Auto-regressive transformer architecture
Context Length:
4096
Training Time:
400K GPU hours of computation
Hardware Used:
A100-80GB GPUs
Model Architecture:
Optimized transformer architecture
Safety Evaluation 
Methodologies:
English testing
Findings:
Model may produce inaccurate or objectionable responses
Risk Categories:
misinformation, objectionable content
Ethical Considerations:
Responsible Use Guide is available.
Responsible Ai Considerations 
Fairness:
Testing in English, broader scenarios not covered.
Transparency:
Model details are public via research paper.
Accountability:
Developers should perform safety testing tailored to specific applications.
Mitigation Strategies:
Strong emphasis on safety evaluations and responsible use.
Input Output 
Input Format:
Text
Accepted Modalities:
text
Output Format:
Text
Performance Tips:
Utilize optimized transformer architecture for faster inference.
LLM NameCodeLlama 13B AWQ
Repository ๐Ÿค—https://huggingface.co/TheBloke/CodeLlama-13B-AWQ 
Model NameCodeLlama 13B
Model CreatorMeta
Base Model(s)  CodeLlama 13B Hf   codellama/CodeLlama-13b-hf
Model Size13b
Required VRAM7.2 GB
Updated2024-12-22
MaintainerTheBloke
Model Typellama
Model Files  7.2 GB
Supported Languagescode
AWQ QuantizationYes
Quantization Typeawq
Generates CodeYes
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length16384
Model Max Length16384
Transformers Version4.32.0.dev0
Tokenizer ClassCodeLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32016
Torch Data Typebfloat16

Best Alternatives to CodeLlama 13B AWQ

Best Alternatives
Context / RAM
Downloads
Likes
...th CodeLlama 13B Python Hf AWQ16K / 7.5 GB70
Ramgpt 13B AWQ Gemm16K / 7.2 GB01
NexusRaven 13B AWQ16K / 7.2 GB354
CodeLlama 13B Instruct AWQ16K / 7.2 GB639
MAmmoTH Coder 13B AWQ16K / 7.2 GB371
...odeLlama 13B Oasst Sft V10 AWQ16K / 7.2 GB251
CodeLlama 13B Python AWQ16K / 7.2 GB252
...ma 13B Instruct Hf W4 G128 AWQ16K / 7.2 GB330
...lama 13B Python Hf W4 G128 AWQ16K / 7.2 GB270
WhiteRabbitNeo 13B V116K / 26 GB1767404
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/CodeLlama-13B-AWQ.

Rank the CodeLlama 13B AWQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40066 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217