Llama 3 8B Instruct 64K By MaziyarPanahi: Benchmarks, Features and Detailed Analysis. Insights on Llama 3 8B Instruct 64K.

Arxiv:2309.10400 64k Autotrain compatible Axolotl Base model:finetune:winglian/l... Base model:winglian/llama-3-8b... Conversational Dataset:intel/orca dpo pairs Dpo En Facebook Finetuned Instruct Llama Llama-3 Meta Pose Pytorch Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/MaziyarPanahi/Llama-3-8B-Instruct-64k

Llama 3 8B Instruct 64K Benchmarks

LLME Score: 0.18344

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama 3 8B Instruct 64K (MaziyarPanahi/Llama-3-8B-Instruct-64k)

Llama 3 8B Instruct 64K Parameters and Internals

Model Type

text generation

Additional Notes

This model uses PoSE to extend Llama's context length from 8k to 64k.

Supported Languages

en (proficient)

Training Details

Data Sources:

RedPajama V1 dataset

Data Volume:

300M tokens

Methodology:

rank stabilized LoRA of rank 256

Context Length:

64000

Model Architecture:

Llama-3

LLM Name	Llama 3 8B Instruct 64K
Repository 🤗	https://huggingface.co/MaziyarPanahi/Llama-3-8B-Instruct-64k
Model Name	Llama-3-8B-Instruct-64k
Model Creator	MaziyarPanahi
Base Model(s)	Llama 3 8B 64K PoSE winglian/Llama-3-8b-64k-PoSE
Model Size	8b
Required VRAM	16.1 GB
Updated	2025-03-12
Maintainer	MaziyarPanahi
Model Type	llama
Instruction-Based	Yes
Model Files	5.0 GB: 1-of-4 5.0 GB: 2-of-4 4.9 GB: 3-of-4 1.2 GB: 4-of-4
Supported Languages	en
Model Architecture	LlamaForCausalLM
License	llama3
Context Length	8192
Model Max Length	8192
Transformers Version	4.40.0.dev0
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	128256
Torch Data Type	float16

Quantized Models of the Llama 3 8B Instruct 64K

Model	Likes	Downloads	VRAM
Llama 3 8B Instruct 64K GGUF	13	2250776	3 GB
... Instruct 64K HQQ 1bit Smashed	1	35	3 GB
Llama 3 8B Instruct 64K AWQ	0	20	5 GB

Best Alternatives to Llama 3 8B Instruct 64K

Best Alternatives	Context / RAM	Downloads	Likes
...a 3 8B Instruct Gradient 1048K	1024K / 16.1 GB	5133	681
P	1024K / 16.1 GB	119	0
93	1024K / 16.1 GB	84	0
117	1024K / 16.1 GB	33	0
Mpasila Viking 8B	1024K / 16.1 GB	84	0
93a	1024K / 16.1 GB	20	0
126	1024K / 16.1 GB	13	0
136	1024K / 16.1 GB	13	0
133	1024K / 16.1 GB	12	0
16	1024K / 16.1 GB	169	0

Note: green Score (e.g. "73.2") means that the model is better than MaziyarPanahi/Llama-3-8B-Instruct-64k.

Rank the Llama 3 8B Instruct 64K Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 44902 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer