Model Type | |
Use Cases |
Areas: | Research, Commercial applications |
|
Applications: | General purpose AI systems, Applications requiring memory/compute constrained environments, Latency bound scenarios, Strong reasoning applications (e.g., code, math, and logic) |
|
Primary Use Cases: | Accelerating research on language and multimodal models, Building block for generative AI powered features |
|
Limitations: | Not designed or evaluated for all downstream purposes, Performance disparities across languages, Potentially generate inaccurate information |
|
Considerations: | Evaluate and mitigate for accuracy, safety, and fairness. Adhere to applicable laws or regulations. |
|
|
Additional Notes | This model was trained 2x faster using Unsloth and Huggingface's TRL library. |
|
Supported Languages | Arabic (Supported), Chinese (Supported), Czech (Supported), Danish (Supported), Dutch (Supported), English (Supported), Finnish (Supported), French (Supported), German (Supported), Hebrew (Supported), Hungarian (Supported), Italian (Supported), Japanese (Supported), Korean (Supported), Norwegian (Supported), Polish (Supported), Portuguese (Supported), Russian (Supported), Spanish (Supported), Swedish (Supported), Thai (Supported), Turkish (Supported), Ukrainian (Supported) |
|
Training Details |
Data Sources: | Phi-3 synthetic data, filtered publicly available websites |
|
Data Volume: | |
Methodology: | supervised fine-tuning, proximal policy optimization, direct preference optimization |
|
Context Length: | |
Training Time: | |
Hardware Used: | |
Model Architecture: | dense decoder-only Transformer model |
|
|
Safety Evaluation |
Methodologies: | |
Findings: | Multilingual performance and safety gaps, Representation of Harms & Perpetuation of Stereotypes, Inappropriate or Offensive Content, Information Reliability, Limited Scope for Code, Long Conversation |
|
Risk Categories: | Quality of Service, Multilingual performance and safety gaps, Representation of Harms & Perpetuation of Stereotypes, Inappropriate or Offensive Content, Information Reliability, Limited Scope for Code, Long Conversation |
|
Ethical Considerations: | Developers should implement additional safeguards at the application level and deploy models with appropriate mitigation measures to address potential biases and risks. |
|
|
Responsible Ai Considerations |
Fairness: | Models may over- or under-represent groups of people, erase representation of some groups, or reinforce negative stereotypes. |
|
Transparency: | Developers should inform end-users they are interacting with an AI system and follow transparency best practices. |
|
Accountability: | Developers are accountable for the model's outputs within their specific use case and cultural, linguistic context. |
|
Mitigation Strategies: | Fine-tune models for the specific use case, leverage language-specific safeguards, and perform regular assessments of high-risk scenarios. |
|
|
Input Output |
Input Format: | |
Accepted Modalities: | |
Output Format: | Generated text in response to input |
|
Performance Tips: | Use the chat format for best prompt outputs. |
|
|
Release Notes |
Date: | |
Notes: | Update over the June 2024 instruction-tuned Phi-3 Mini release, based on user feedback, focused on multilingual, multi-turn conversation quality, and reasoning capability improvements. |
|
|
|