MilkyLoong Qwen2.5 1.5B Base1 By Lunzima: Benchmarks, Features and Detailed Analysis. Insights on MilkyLoong Qwen2.5 1.5B Base1.

Merged Model Arxiv:2408.07990 Autotrain compatible Base model:bond005/meno-tiny-0... Base model:fblgit/miniclaus-qw... Base model:godlikehhd/alpaca d... Base model:godlikehhd/alpaca d... Base model:huihui-ai/qwen2.5-1... Base model:matteokhan/merging ... Base model:sakalti/saba1.5-1.5... Base model:youlln/ece-prymmal-... Base model:youlln/ece-prymmal-... Conversational Endpoints compatible Instruct Qwen2 Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/Lunzima/MilkyLoong-Qwen2.5-1.5B-base1

MilkyLoong Qwen2.5 1.5B Base1 Benchmarks

LLME Score: 0.2443

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

MilkyLoong Qwen2.5 1.5B Base1 (Lunzima/MilkyLoong-Qwen2.5-1.5B-base1)

MilkyLoong Qwen2.5 1.5B Base1 Parameters and Internals

LLM Name	MilkyLoong Qwen2.5 1.5B Base1
Repository 🤗	https://huggingface.co/Lunzima/MilkyLoong-Qwen2.5-1.5B-base1
Base Model(s)	huihui-ai/Qwen2.5-1.5B-Instruct-abliterated bond005/meno-tiny-0.1 ECE PRYMMAL YL 1B SLERP V2 Merging LLM Alpaca Data Ifd Min 2600 Alpaca Data Score Max 2500 ECE PRYMMAL YL 1B SLERP V1 Sakalti/Saba1.5-1.5B fblgit/miniclaus-qw1.5B-UNAMGS-GRPO huihui-ai/Qwen2.5-1.5B-Instruct-abliterated bond005/meno-tiny-0.1 Youlln/ECE-PRYMMAL-YL-1B-SLERP-V2 goulue5/merging_LLM godlikehhd/alpaca_data_ifd_min_2600 godlikehhd/alpaca_data_score_max_2500 Youlln/ECE-PRYMMAL-YL-1B-SLERP-V1 Sakalti/Saba1.5-1.5B fblgit/miniclaus-qw1.5B-UNAMGS-GRPO
Merged Model	Yes
Model Size	1b
Required VRAM	3.1 GB
Updated	2025-05-31
Maintainer	Lunzima
Model Type	qwen2
Instruction-Based	Yes
Model Files	3.1 GB
Model Architecture	Qwen2ForCausalLM
Context Length	32768
Model Max Length	32768
Transformers Version	4.51.1
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|im_end\|>
Vocabulary Size	151936
Torch Data Type	bfloat16
Errors	replace

Best Alternatives to MilkyLoong Qwen2.5 1.5B Base1

Best Alternatives	Context / RAM	Downloads	Likes
PRYMMAL ECE 1B SLERP V1	32K / 9.7 GB	25	0
Qwen2.5 1B Instruct	32K / 2 GB	234	0
ECE 1B Merge PRYMMAL	32K / 3.6 GB	50	0
ECE PRYMMAL1B FT V1	32K / 6.2 GB	26	0
Sailor2 1B Chat	4K / 2 GB	679	16
NanoLM 1B Instruct V2	4K / 2.1 GB	876	1
NanoLM 1B Instruct V1.1	4K / 2.1 GB	52	1

Note: green Score (e.g. "73.2") means that the model is better than Lunzima/MilkyLoong-Qwen2.5-1.5B-base1.

Rank the MilkyLoong Qwen2.5 1.5B Base1 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

MilkyLoong Qwen2.5 1.5B Base1 by Lunzima

» All LLMs » Lunzima » MilkyLoong Qwen2.5 1.5B Base1 URL Share it on

MilkyLoong Qwen2.5 1.5B Base1 Benchmarks

MilkyLoong Qwen2.5 1.5B Base1 Parameters and Internals

Best Alternatives to MilkyLoong Qwen2.5 1.5B Base1

Rank the MilkyLoong Qwen2.5 1.5B Base1 Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.