Phi 3 Medium 4K Instruct 8bit By nold: Benchmarks, Features and Detailed Analysis. Insights on Phi 3 Medium 4K Instruct 8bit.

8-bit 8bit Autotrain compatible Bitsandbytes Code Conversational Custom code Endpoints compatible Instruct Multilingual Phi3 Quantized Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/nold/phi-3-medium-4k-instruct-8bit

Phi 3 Medium 4K Instruct 8bit Benchmarks

LMSys ELO: 1123 vs 1272 (so35)^-11.7%

ARC: 67.32 vs 96.7 (so35)^-30.4%

HellaSwag: 85.76 vs 95.3 (gpt4)^-10%

MMLU: 77.83 vs 88.3 (so35)^-11.9%

TruthfulQA: 57.71 vs 59 (gpt4)^-2.2%

WinoGrande: 72.69 vs 87.5 (gpt4)^-16.9%

GSM8K: 79.38 vs 96.4 (so35)^-17.7%

LLME Score: 0.28443

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Phi 3 Medium 4K Instruct 8bit (nold/phi-3-medium-4k-instruct-8bit)

Phi 3 Medium 4K Instruct 8bit Parameters and Internals

Model Type

text generation, instruction following

Use Cases

Areas:

Commercial, Research

Applications:

General purpose AI systems, Memory/compute constrained environments, Latency bound scenarios, Strong reasoning (code, math, logic)

Primary Use Cases:

Intended for use in broad commercial and research fields

Limitations:

Not designed for all downstream purposes, Limited language support outside English

Considerations:

Evaluate and mitigate for accuracy, safety, and fairness

Supported Languages

multilingual (English), others (10% multilingual)

Training Details

Data Sources:

Publicly available documents, Newly created synthetic data, High quality chat format supervised data

Data Volume:

4.8T tokens

Methodology:

Supervised fine-tuning and Direct Preference Optimization (DPO)

Context Length:

4096

Training Time:

42 days

Hardware Used:

512 GPUs H100-80G

Model Architecture:

Dense decoder-only Transformer

Responsible Ai Considerations

Fairness:

Awareness of language variety performance, representation of harms and stereotypes

Transparency:

Disclosures on potential behaviors

Accountability:

Developers are responsible

Mitigation Strategies:

Follow best practices and implement additional mitigations for sensitive deployment contexts

Input Output

Input Format:

Chat-format

Accepted Modalities:

text

Output Format:

Text

Performance Tips:

Include specific tokens for improved reliability

LLM Name	Phi 3 Medium 4K Instruct 8bit
Repository 🤗	https://huggingface.co/nold/phi-3-medium-4k-instruct-8bit
Base Model(s)	Phi 3 Medium 4K Instruct microsoft/Phi-3-medium-4k-instruct
Model Size	14b
Required VRAM	14.3 GB
Updated	2025-03-12
Maintainer	nold
Model Type	phi3
Instruction-Based	Yes
Model Files	4.8 GB: 1-of-3 5.0 GB: 2-of-3 4.5 GB: 3-of-3
Quantization Type	8bit
Model Architecture	Phi3ForCausalLM
License	mit
Context Length	4096
Model Max Length	4096
Transformers Version	4.41.1
Tokenizer Class	LlamaTokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	32064
Torch Data Type	float16

Best Alternatives to Phi 3 Medium 4K Instruct 8bit

Best Alternatives	Context / RAM	Downloads	Likes
Phi 3 Medium 128K Instruct	128K / 28 GB	16512	381
...ess V2.5 Phi 3 Medium 128K 14B	128K / 28 GB	7589	4
Finetuned Phi3 Medium 128K	128K / 28 GB	9	0
Mahou 1.2 Phi 14B	128K / 28 GB	22	1
Shisa V1 Phi3 14B	128K / 28 GB	11	2
...colatine 14B Instruct DPO V1.3	16K / 29.4 GB	16	0
Phi4 Slerp2 14B	16K / 28 GB	24	0
Phi 3 Medium 4K Instruct	4K / 28 GB	31715	217
...colatine 14B Instruct DPO V1.2	4K / 28 GB	6734	14
Ph3della5 14B	4K / 28 GB	2151	0

Note: green Score (e.g. "73.2") means that the model is better than nold/phi-3-medium-4k-instruct-8bit.

Rank the Phi 3 Medium 4K Instruct 8bit Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 44902 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Phi 3 Medium 4K Instruct 8bit by nold

» All LLMs » nold » Phi 3 Medium 4K Instruct 8bit URL Share it on

Phi 3 Medium 4K Instruct 8bit Benchmarks

Phi 3 Medium 4K Instruct 8bit Parameters and Internals

Best Alternatives to Phi 3 Medium 4K Instruct 8bit

Rank the Phi 3 Medium 4K Instruct 8bit Capabilities

What open-source LLMs or SLMs are you in search of? 44902 in total.