CodeFuse CodeLlama 34B 4bits by codefuse-ai

 ยป  All LLMs  ยป  codefuse-ai  ยป  CodeFuse CodeLlama 34B 4bits   URL Share it on

  Merged Model   Arxiv:2311.02303   4bit   Autotrain compatible   Codegen   Endpoints compatible   Gptq   Llama   Pytorch   Quantized   Region:us

CodeFuse CodeLlama 34B 4bits Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
CodeFuse CodeLlama 34B 4bits (codefuse-ai/CodeFuse-CodeLlama-34B-4bits)

CodeFuse CodeLlama 34B 4bits Parameters and Internals

Model Type 
text-generation
Additional Notes 
The model can be loaded on either a single A10 (24GB VRAM) or a RTX 4090 (24GB VRAM) and still achieves impressive accuracy.
Training Details 
Methodology:
Fine-tuned over multiple code tasks with 600k instructions/answers
Model Architecture:
4-bit quantization
Input Output 
Input Format:
Concatenated string formed by combining conversation data (human and bot contents) in the training data format.
Performance Tips:
Ensure input string ends with '<|role_start|>bot<|role_end|>' to ask the model to generate answers.
Release Notes 
Version:
2023.9
Date:
2023-09-26
Notes:
4-bit quantized version of CodeFuse-CodeLlama-34B with 73.8% accuracy.
Version:
2023.9
Date:
2023-09-11
Notes:
CodeFuse-CodeLlama-34B achieved 74.4% of pass@1, which is SOTA results for open-source LLMs at present.
LLM NameCodeFuse CodeLlama 34B 4bits
Repository ๐Ÿค—https://huggingface.co/codefuse-ai/CodeFuse-CodeLlama-34B-4bits 
Base Model(s)  CodeFuse CodeLlama 34B   codefuse-ai/CodeFuse-CodeLlama-34B
Merged ModelYes
Model Size34b
Required VRAM19 GB
Updated2024-12-22
Maintainercodefuse-ai
Model Typellama
Model Files  19.0 GB
GPTQ QuantizationYes
Quantization Typegptq|4bit
Generates CodeYes
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length16384
Model Max Length16384
Transformers Version4.32.0
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Padding Token<unk>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to CodeFuse CodeLlama 34B 4bits

Best Alternatives
Context / RAM
Downloads
Likes
CodeLlama 34B Guanaco Gptq16K / 18.3 GB7375
Phind CodeLlama 34B V2 GPTQ16K / 17.7 GB26689
...chless Codellama 34B V2.0 GPTQ16K / 17.7 GB224
CodeLlama 34B Instruct GPTQ16K / 18.3 GB13974
...nbuddy Coder 34B V11 Bf16 GPTQ16K / 17.9 GB360
...zardCoder Python 34B V1.0 GPTQ16K / 17.7 GB2761
MAmmoTH Coder 34B GPTQ16K / 17.7 GB192
Synthia 34B V1.2 GPTQ16K / 17.7 GB277
CodeFuse CodeLlama 34B GPTQ16K / 17.7 GB319
CodeLlama 34B GPTQ16K / 18.3 GB3020
Note: green Score (e.g. "73.2") means that the model is better than codefuse-ai/CodeFuse-CodeLlama-34B-4bits.

Rank the CodeFuse CodeLlama 34B 4bits Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40066 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217