Phi 2 Dolly Instruction Polish by s3nh

 ยป  All LLMs  ยป  s3nh  ยป  Phi 2 Dolly Instruction Polish   URL Share it on

Base model:adapter:microsoft/p...   Base model:microsoft/phi-2   Custom code   Generated from trainer   Instruct   Peft   Phi-msft   Region:us   Safetensors   Sharded   Tensorflow

Phi 2 Dolly Instruction Polish Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Phi 2 Dolly Instruction Polish (s3nh/phi-2_dolly_instruction_polish)

Phi 2 Dolly Instruction Polish Parameters and Internals

Additional Notes 
The model is a fine-tuned version of Microsoft PHI-2. Training involved using non-disclosed dataset and specific hyperparameters including a learning rate of 3e-06, batch size of 1, and optimizer Adam with certain parameters. It was trained over 4 epochs.
LLM NamePhi 2 Dolly Instruction Polish
Repository ๐Ÿค—https://huggingface.co/s3nh/phi-2_dolly_instruction_polish 
Base Model(s)  Phi 2   microsoft/phi-2
Model Size2.8b
Required VRAM5.6 GB
Updated2024-12-21
Maintainers3nh
Model Typephi-msft
Instruction-BasedYes
Model Files  5.0 GB: 1-of-2   0.6 GB: 2-of-2
Model ArchitecturePhiForCausalLM
Licenseother
Model Max Length2048
Transformers Version4.37.0.dev0
Tokenizer ClassCodeGenTokenizer
Vocabulary Size51200
Torch Data Typefloat16
Activation Functiongelu_new

Best Alternatives to Phi 2 Dolly Instruction Polish

Best Alternatives
Context / RAM
Downloads
Likes
Phi 2 Instruct V0.12K / 5.6 GB3502
Phi 2 Instruct Apo2K / 5.6 GB330
EEVE Korean Instruct 2.8B V1.02K / 5.7 GB355120
Att Model2K / 5.7 GB90
Eeve2.8 Base2K / 5.7 GB50
Eeve2.8 Ko2K / 5.7 GB180
... Instruct 2.8B V1.0 20240430 22K / 2.9 GB120
Phi 2 Code Instruct2K / 5.6 GB204
Dolphin 2 6 Phi 20K / 5.6 GB658192
Phi 2 Evol Instruct Chinese0K / 5.6 GB04
Note: green Score (e.g. "73.2") means that the model is better than s3nh/phi-2_dolly_instruction_polish.

Rank the Phi 2 Dolly Instruction Polish Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40013 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217