Phi 4 Deepseek R1K RL EZO by AXCXEPT

 ยป  All LLMs  ยป  AXCXEPT  ยป  Phi 4 Deepseek R1K RL EZO   URL Share it on

  Autotrain compatible Base model:finetune:microsoft/...   Base model:microsoft/phi-4   Conversational   Custom code   Dataset:ai-mo/numinamath-tir Dataset:bespokelabs/bespoke-st...   Dataset:meta-math/metamathqa   En   Endpoints compatible   Ja   Phi3   Region:us   Safetensors   Sharded   Tensorflow

Phi 4 Deepseek R1K RL EZO Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Phi 4 Deepseek R1K RL EZO (AXCXEPT/phi-4-deepseek-R1K-RL-EZO)

Phi 4 Deepseek R1K RL EZO Parameters and Internals

LLM NamePhi 4 Deepseek R1K RL EZO
Repository ๐Ÿค—https://huggingface.co/AXCXEPT/phi-4-deepseek-R1K-RL-EZO 
Base Model(s)  Phi 4   microsoft/phi-4
Model Size32b
Required VRAM29.4 GB
Updated2025-05-12
MaintainerAXCXEPT
Model Typephi3
Model Files  4.9 GB: 1-of-6   5.0 GB: 2-of-6   4.9 GB: 3-of-6   4.8 GB: 4-of-6   4.8 GB: 5-of-6   5.0 GB: 6-of-6
Supported Languagesen ja
Model ArchitecturePhi3ForCausalLM
Licensemit
Context Length16384
Model Max Length16384
Transformers Version4.48.1
Tokenizer ClassGPT2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size100352
Torch Data Typebfloat16

Rank the Phi 4 Deepseek R1K RL EZO Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 47272 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227