Phi 1 by microsoft

 ยป  All LLMs  ยป  microsoft  ยป  Phi 1   URL Share it on

  Arxiv:2306.11644   Autotrain compatible   Code   En   Endpoints compatible   Phi   Region:us   Safetensors

Phi 1 Benchmarks

Phi 1 Parameters and Internals

Model Type 
text-generation
Use Cases 
Areas:
research, not suitable for production coding tasks
Applications:
basic Python coding
Primary Use Cases:
code generation, coding assistance
Limitations:
limited to training data packages, may replicate scripts, can generate inaccurate code, unreliable with non-code formats, limited natural language comprehension
Additional Notes 
Security risks include directory traversal, injection attacks, misunderstanding requirements, lack of input validation, insecure defaults, and failure in error handling.
Supported Languages 
en (high)
Training Details 
Data Sources:
The Stack v1.2, StackOverflow, code_contests, synthetic Python textbooks and exercises
Data Volume:
54B tokens (7B unique tokens)
Training Time:
6 days
Hardware Used:
8 A100 GPUs
Model Architecture:
Transformer-based model with next-word prediction objective
Input Output 
Input Format:
Python code format with comments for generation
Accepted Modalities:
code
Output Format:
Python code
Performance Tips:
Users should manually verify all API uses if using packages other than the ones included in training set.
LLM NamePhi 1
Repository ๐Ÿค—https://huggingface.co/microsoft/phi-1 
Model Size1.4b
Required VRAM2.8 GB
Updated2024-11-21
Maintainermicrosoft
Model Typephi
Model Files  2.8 GB
Supported Languagesen
Model ArchitecturePhiForCausalLM
Licensemit
Context Length2048
Model Max Length2048
Transformers Version4.37.0
Tokenizer ClassCodeGenTokenizer
Vocabulary Size51200
Torch Data Typefloat32
Phi 1 (microsoft/phi-1)

Best Alternatives to Phi 1

Best Alternatives
Context / RAM
Downloads
Likes
Phi 1 52K / 2.8 GB1321451315
Phi 1 5 Instruct V0.12K / 2.8 GB4970
Phi 1 5 Tldr Sft2K / 5.7 GB150
...i 1 5 Hinglish Text Pretrained2K / 5.7 GB2900
...tic Curriculum Subjects 1 To 52K / 0 GB130
...1 5 FULL Arithmetic Curriculum2K / 0 GB130
...ath Phi 1 5 FULL Arithmetic 2K2K / 0 GB130
Phibode 1 5 Ultraalpaca2K / 5.7 GB443
Phi Sentiment Analysis Model2K / 2.8 GB301
BitLinear Phi 1.52K / 5.7 GB271
Note: green Score (e.g. "73.2") means that the model is better than microsoft/phi-1.

Rank the Phi 1 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 38199 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241110