BigCodeLlama 92B By nisten: Benchmarks, Features and Detailed Analysis. Insights on BigCodeLlama 92B.

Autotrain compatible Base model:codellama/codellama... Base model:finetune:codellama/... Code Codegen Conversational Endpoints compatible Instruct Llama Merge Mergekit Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/nisten/BigCodeLlama-92b

BigCodeLlama 92B Benchmarks

ARC: 54.78 vs 96.7 (so35)^-43.4%

HellaSwag: 77.84 vs 95.3 (gpt4)^-18.3%

MMLU: 55.4 vs 88.3 (so35)^-37.3%

TruthfulQA: 51.34 vs 59 (gpt4)^-13%

WinoGrande: 73.09 vs 87.5 (gpt4)^-16.5%

GSM8K: 44.96 vs 96.4 (so35)^-53.4%

LLME Score: 0.17875

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

BigCodeLlama 92B (nisten/BigCodeLlama-92b)

BigCodeLlama 92B Parameters and Internals

Model Type

conversational

Additional Notes

This is an experimental 92B parameter model constructed by merging with `../CodeLlama-70b-Python-hf` and `../CodeLlama-70b-Instruct-hf` models using a specific YAML configuration. The model's type is conversational.

LLM Name	BigCodeLlama 92B
Repository 🤗	https://huggingface.co/nisten/BigCodeLlama-92b
Base Model(s)	CodeLlama 70B Instruct Hf codellama/CodeLlama-70b-Instruct-hf
Model Size	70b
Required VRAM	184 GB
Updated	2025-03-12
Maintainer	nisten
Model Type	llama
Instruction-Based	Yes
Model Files	9.8 GB: 1-of-19 9.8 GB: 2-of-19 9.6 GB: 3-of-19 9.8 GB: 4-of-19 9.9 GB: 5-of-19 9.7 GB: 6-of-19 10.0 GB: 7-of-19 9.9 GB: 8-of-19 9.6 GB: 9-of-19 10.0 GB: 10-of-19 9.8 GB: 11-of-19 9.9 GB: 12-of-19 9.8 GB: 13-of-19 9.8 GB: 14-of-19 10.0 GB: 15-of-19 9.9 GB: 16-of-19 9.6 GB: 17-of-19 9.7 GB: 18-of-19 7.4 GB: 19-of-19
Generates Code	Yes
Model Architecture	LlamaForCausalLM
License	mit
Context Length	2048
Model Max Length	2048
Transformers Version	4.37.2
Tokenizer Class	LlamaTokenizer
Vocabulary Size	32016
Torch Data Type	bfloat16

Best Alternatives to BigCodeLlama 92B

Best Alternatives	Context / RAM	Downloads	Likes
CodeLlama 70B Instruct Hf	4K / 72.3 GB	6005	207
CodeLlama 70B Instruct Hf	4K / 72.3 GB	11844	16
Code Llama 70B Python Instruct	4K / 138.1 GB	39	1
CodeLlama 70B Instruct Neuron	4K / GB	185	1
...Llama 70B Instruct Hf 4bit MLX	4K / 39.1 GB	35	25
...70B Instruct Nf4 Fp16 Upscaled	4K / 138.7 GB	24	2
...70B Instruct Hf 5.0bpw H6 EXL2	2K / 43.6 GB	19	6
...0B Instruct Hf 2.65bpw H6 EXL2	2K / 23.4 GB	18	3
...70B Instruct Hf 2.4bpw H6 EXL2	2K / 21.3 GB	15	1
...70B Instruct Hf 4.0bpw H6 EXL2	2K / 35.1 GB	13	1

Note: green Score (e.g. "73.2") means that the model is better than nisten/BigCodeLlama-92b.

Rank the BigCodeLlama 92B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 44902 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

BigCodeLlama 92B by nisten

» All LLMs » nisten » BigCodeLlama 92B URL Share it on

BigCodeLlama 92B Benchmarks

BigCodeLlama 92B Parameters and Internals

Best Alternatives to BigCodeLlama 92B

Rank the BigCodeLlama 92B Capabilities

What open-source LLMs or SLMs are you in search of? 44902 in total.