Qwen2.5 4B By qingy2024: Benchmarks, Features and Detailed Analysis. Insights on Qwen2.5 4B.

Merged Model Autotrain compatible Base model:finetune:qwen/qwen2... Base model:qwen/qwen2.5-3b Conversational Endpoints compatible Qwen2 Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/qingy2024/Qwen2.5-4B

Qwen2.5 4B Benchmarks

MMLU Pro: 16.94

GPQA: 5.48

MUSR: 16.53

BBH: 19.98

IFEval: 21.58 vs 88 (so35)^-75.5%

MATH Lvl 5: 3.32

LLME Score: 0.31101

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Qwen2.5 4B Parameters and Internals

LLM Name	Qwen2.5 4B
Repository 🤗	https://huggingface.co/qingy2024/Qwen2.5-4B
Base Model(s)	Qwen/Qwen2.5-3B Qwen/Qwen2.5-3B
Merged Model	Yes
Model Size	3b
Required VRAM	8.4 GB
Updated	2025-02-05
Maintainer	qingy2024
Model Type	qwen2
Model Files	5.0 GB: 1-of-2 3.4 GB: 2-of-2
Model Architecture	Qwen2ForCausalLM
Context Length	32768
Model Max Length	32768
Transformers Version	4.46.2
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	151936
Torch Data Type	bfloat16
Errors	replace

Best Alternatives to Qwen2.5 4B

Best Alternatives	Context / RAM	Downloads	Likes
Saba2 3B	128K / 6.2 GB	330	0
Saba1.5 Pro 3B	128K / 5.8 GB	137	1
Light 1.1 3B	128K / 6.2 GB	62	1
SmallThinker 3B Preview	32K / 6.8 GB	130211	379
Qwen2.5 3B Instruct	32K / 6.2 GB	326544	158
Qwen2.5 3B	32K / 6.2 GB	224641	54
Light 3B	32K / 6.9 GB	347	2
Calme 3.2 Instruct 3B	32K / 11 GB	3022	1
Calme 3.1 Baguette 3B	32K / 6.2 GB	3008	1
Calme 3.2 Baguette 3B	32K / 11 GB	2910	1

Note: green Score (e.g. "73.2") means that the model is better than qingy2024/Qwen2.5-4B.

Rank the Qwen2.5 4B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 42577 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Qwen2.5 4B by qingy2024

» All LLMs » qingy2024 » Qwen2.5 4B URL Share it on

Qwen2.5 4B Benchmarks

Qwen2.5 4B Parameters and Internals

Best Alternatives to Qwen2.5 4B

Rank the Qwen2.5 4B Capabilities

What open-source LLMs or SLMs are you in search of? 42577 in total.