Phi 2 Orange by rhysjones

 ยป  All LLMs  ยป  rhysjones  ยป  Phi 2 Orange   URL Share it on

  Autotrain compatible   Custom code Dataset:argilla/ultrafeedback-...   Dataset:intel/orca dpo pairs   Dataset:ldjnr/capybara   Dataset:ldjnr/pure-dove   Dataset:ldjnr/verified-camel   Dataset:meta-math/metamathqa Dataset:migtissera/synthia-v1.... Dataset:open-orca/slimorca-ded...   Endpoints compatible   Phi-msft   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/rhysjones/phi-2-orange 

Phi 2 Orange Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Phi 2 Orange (rhysjones/phi-2-orange)

Phi 2 Orange Parameters and Internals

Model Type 
text generation
Additional Notes 
An updated model is available at [rhysjones/phi-2-orange-v2](https://huggingface.co/rhysjones/phi-2-orange-v2) with higher evaluations.
Training Details 
Data Sources:
Open-Orca/SlimOrca-Dedup, migtissera/Synthia-v1.3, LDJnr/Verified-Camel, LDJnr/Pure-Dove, LDJnr/Capybara, meta-math/MetaMathQA, Intel/orca_dpo_pairs, argilla/ultrafeedback-binarized-preferences-cleaned
Methodology:
Two-step fine-tuning: a collection of broad training data followed by DPO fine-tuning
Input Output 
Input Format:
ChatML
Accepted Modalities:
text
Output Format:
Markdown format for Python outputs
LLM NamePhi 2 Orange
Repository ๐Ÿค—https://huggingface.co/rhysjones/phi-2-orange 
Model Size2.8b
Required VRAM5.6 GB
Updated2025-02-05
Maintainerrhysjones
Model Typephi-msft
Model Files  5.0 GB: 1-of-2   0.6 GB: 2-of-2
Model ArchitecturePhiForCausalLM
Licensemit
Model Max Length2048
Transformers Version4.37.0.dev0
Tokenizer ClassCodeGenTokenizer
Padding Token<|endoftext|>
Vocabulary Size51200
Torch Data Typebfloat16
Activation Functiongelu_new

Quantized Models of the Phi 2 Orange

Model
Likes
Downloads
VRAM
Phi 2 Orange GGUF203361 GB
Phi 2 Orange GPTQ4191 GB

Best Alternatives to Phi 2 Orange

Best Alternatives
Context / RAM
Downloads
Likes
MFANN3bv0.24128K / 11.1 GB50
MFANN3b128K / 11.1 GB1160
MFANN3bv1.3128K / 11.1 GB130
MFANN3bv1.1128K / 11.1 GB160
MFANN3bv0.23128K / 11.1 GB60
MFANN3b SFT128K / 5.6 GB1690
MFANN3b Rebase128K / 11.1 GB100
MFANN3bv1.2126K / 11.1 GB320
MFANN Phigments Slerp V232K / 5.6 GB1340
MFANN3bv0.2232K / 11.1 GB50
Note: green Score (e.g. "73.2") means that the model is better than rhysjones/phi-2-orange.

Rank the Phi 2 Orange Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227