Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed By PrunaAI: Benchmarks, Features and Detailed Analysis. Insights on Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed.

4bit Autotrain compatible Conversational Custom code Endpoints compatible Hqq Instruct Phi3 Pruna-ai Quantized Region:us

Model Card on HF 🤗: https://huggingface.co/PrunaAI/microsoft-Phi-3-mini-128k-instruct-HQQ-4bit-smashed

Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed Benchmarks

LLME Score: 0.16713

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed (PrunaAI/microsoft-Phi-3-mini-128k-instruct-HQQ-4bit-smashed)

Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed Parameters and Internals

Additional Notes

Results mentioning 'first' are obtained after the first run of the model... 'Sync' metrics are obtained by syncing all GPU processes and stop measurement when all of them are executed. 'Async' metrics are obtained without syncing all GPU processes and stop when the model output can be used by the CPU.

Input Output

Performance Tips:

Test the efficiency gains directly in your use-cases.

LLM Name	Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed
Repository 🤗	https://huggingface.co/PrunaAI/microsoft-Phi-3-mini-128k-instruct-HQQ-4bit-smashed
Base Model(s)	ORIGINAL_REPO_NAME /ORIGINAL_REPO_NAME
Required VRAM	2.9 GB
Updated	2025-06-01
Maintainer	PrunaAI
Model Type	phi3
Instruction-Based	Yes
Model Files	2.9 GB
Quantization Type	4bit
Model Architecture	Phi3ForCausalLM
Context Length	131072
Model Max Length	131072
Transformers Version	4.48.2
Tokenizer Class	LlamaTokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	32064
Torch Data Type	bfloat16

Best Alternatives to Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed

Best Alternatives	Context / RAM	Downloads	Likes
...dium 128K Instruct 8 0bpw EXL2	128K / 13.4 GB	30	1
...m 128K Instruct 8.0bpw H8 EXL2	128K / 13.4 GB	15	4
...m 128K Instruct 6.0bpw H6 EXL2	128K / 10.7 GB	15	3
...m 128K Instruct 3.0bpw H6 EXL2	128K / 5.6 GB	15	0
...m 128K Instruct 5.0bpw H6 EXL2	128K / 8.9 GB	15	0
...128K Instruct HQQ 2bit Smashed	128K / 1.4 GB	25	0
...28K Instruct Ov Fp16 Int4 Asym	128K / 2.5 GB	17	0
NuExtract Bpw6 EXL2	4K / 3 GB	16	1
Phi 3 Mini 4K Instruct Fp16	4K / GB	461	5
...Mini 4K Geminified 3 0bpw EXL2	4K / 1.6 GB	20	0

Note: green Score (e.g. "73.2") means that the model is better than PrunaAI/microsoft-Phi-3-mini-128k-instruct-HQQ-4bit-smashed.

Rank the Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed by PrunaAI

» All LLMs » PrunaAI » Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed URL Share it on

Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed Benchmarks

Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed Parameters and Internals

Best Alternatives to Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed

Rank the Microsoft Phi 3 Mini 128K Instruct HQQ 4bit Smashed Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.