Qwen1.5 110B 4bit By mlx-community: Benchmarks, Features and Detailed Analysis. Insights on Qwen1.5 110B 4bit.

4bit Conversational En Mlx Pretrained Quantized Qwen2 Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/mlx-community/Qwen1.5-110B-4bit

Qwen1.5 110B 4bit Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Qwen1.5 110B 4bit (mlx-community/Qwen1.5-110B-4bit)

Qwen1.5 110B 4bit Parameters and Internals

Model Type

text-generation

Additional Notes

This model was converted to MLX format.

Supported Languages

en (high)

LLM Name	Qwen1.5 110B 4bit
Repository 🤗	https://huggingface.co/mlx-community/Qwen1.5-110B-4bit
Model Size	110b
Required VRAM	62.2 GB
Updated	2025-06-01
Maintainer	mlx-community
Model Type	qwen2
Model Files	5.4 GB: 1-of-12 5.4 GB: 2-of-12 5.3 GB: 3-of-12 5.3 GB: 4-of-12 5.3 GB: 5-of-12 5.3 GB: 6-of-12 5.3 GB: 7-of-12 5.3 GB: 8-of-12 5.3 GB: 9-of-12 5.3 GB: 10-of-12 5.3 GB: 11-of-12 3.7 GB: 12-of-12
Supported Languages	en
Quantization Type	4bit
Model Architecture	Qwen2ForCausalLM
License	other
Context Length	8192
Model Max Length	8192
Transformers Version	4.37.0
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	152064
Torch Data Type	bfloat16
Errors	replace

Best Alternatives to Qwen1.5 110B 4bit

Best Alternatives	Context / RAM	Downloads	Likes
Qwen1.5 110B Chat 4bit	32K / 62.2 GB	22	5
Qwen1.5 110B Chat 8bit	32K / 179.8 GB	12	1
...n1.5 110B Chat 3.35bpw H6 EXL2	32K / 49 GB	8	1
...n1.5 110B Chat 3.25bpw H6 EXL2	32K / 47.7 GB	9	1
Qwen1.5 110B Chat	32K / 158.3 GB	4351	127
Qwen1.5 110B	32K / 221.7 GB	323	99
Qwen1.5 110B Chat AWQ	32K / 61.7 GB	34	9
Qwen1.5 110B Chat AWQ	32K / 61.1 GB	16	0
Dolphin 2.9.1 Qwen 110B	32K / 193.4 GB	26	28
Qwen1.5 110B Chat GGUF	32K / 3.1 GB	49	2

Note: green Score (e.g. "73.2") means that the model is better than mlx-community/Qwen1.5-110B-4bit.

Rank the Qwen1.5 110B 4bit Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Qwen1.5 110B 4bit by mlx-community

» All LLMs » mlx-community » Qwen1.5 110B 4bit URL Share it on

Qwen1.5 110B 4bit Benchmarks

Qwen1.5 110B 4bit Parameters and Internals

Best Alternatives to Qwen1.5 110B 4bit

Rank the Qwen1.5 110B 4bit Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.