Qwen2 1.5B Instruct By Qwen: Benchmarks, Features and Detailed Analysis. Insights on Qwen2 1.5B Instruct.

Autotrain compatible Chat Conversational En Endpoints compatible Instruct Qwen2 Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/Qwen/Qwen2-1.5B-Instruct

Qwen2 1.5B Instruct Benchmarks

MMLU Pro: 16.68

GPQA: 1.57

MUSR: 12.03

BBH: 13.7

IFEval: 33.71 vs 88 (so35)^-61.7%

MATH Lvl 5: 6.27

LLME Score: 0.37312

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Qwen2 1.5B Instruct (Qwen/Qwen2-1.5B-Instruct)

Qwen2 1.5B Instruct Parameters and Internals

Model Type

text-generation

Additional Notes

Qwen2 features an improved tokenizer adaptive to multiple natural languages and codes.

Training Details

Methodology:

Pretrained with large amounts of data and post-trained with supervised finetuning and direct preference optimization.

Model Architecture:

Transformer architecture with SwiGLU activation, attention QKV bias, group query attention.

LLM Name	Qwen2 1.5B Instruct
Repository 🤗	https://huggingface.co/Qwen/Qwen2-1.5B-Instruct
Model Size	1.5b
Required VRAM	3.1 GB
Updated	2024-12-21
Maintainer	Qwen
Model Type	qwen2
Instruction-Based	Yes
Model Files	3.1 GB
Supported Languages	en
Model Architecture	Qwen2ForCausalLM
License	apache-2.0
Context Length	32768
Model Max Length	32768
Transformers Version	4.40.1
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	151936
Torch Data Type	bfloat16
Errors	replace

Quantized Models of the Qwen2 1.5B Instruct

Model	Likes	Downloads	VRAM
Qwen2 1.5B Instruct GGUF	9	1953289	0 GB
Qwen2 1.5B Instruct AWQ	7	8332	1 GB
Qwen2 1.5B Instruct GPTQ Int4	3	7169	2 GB
... 1.5B Instruct Replete Adapted	2	686	3 GB
Qwen2 1.5B Instruct GGUF	0	554	0 GB
Qwen2 1.5B Instruct GPTQ Int8	3	57	3 GB
Hyper X Qwen2 1.5B It Python	0	17	1 GB
Qwen2 1.5B Instruct GGUF	0	47	0 GB

Best Alternatives to Qwen2 1.5B Instruct

Best Alternatives	Context / RAM	Downloads	Likes
Saba1 1.8B	128K / 3.6 GB	315	0
Gte Qwen2 1.5B Instruct	128K / 7.1 GB	54016	144
Miniclaus Qw1.5B UNAMGS	128K / 3.5 GB	185	8
Replete Coder Qwen2 1.5B	128K / 3.1 GB	233	23
Qwen2 1.5B Instruct Refine	128K / 3.1 GB	21	0
Samantha Qwen2 1.5B	128K / 3.1 GB	34	0
Qwen2.5 1.5B Instruct	32K / 3.1 GB	586994	252
Qwen2.5 Coder 1.5B Instruct	32K / 3.1 GB	21272	51
...struct W8a8 Int Dynamic Weight	32K / 3.1 GB	2005	0
...5 1.5B Instruct Abliterated V1	32K / 3.5 GB	404	4

Note: green Score (e.g. "73.2") means that the model is better than Qwen/Qwen2-1.5B-Instruct.

Rank the Qwen2 1.5B Instruct Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 40013 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241217

Support LLM Explorer