Model Type | Dense decoder-only Transformer, Text generation |
|
Use Cases |
Areas: | Research, Commercial applications |
|
Applications: | Memory/compute constrained environments, Latency bound scenarios, Strong reasoning in code, math and logic |
|
Primary Use Cases: | Language model building, Generative AI features |
|
Limitations: | Limited language support outside English, Misrepresentation of groups, Inappropriate responses possible, Potential for misinformation |
|
Considerations: | Assess outputs for context, legality and relevance of use. Utilize safety classifiers or custom solutions. |
|
|
Additional Notes | Cross-platform support via ONNX runtime for various devices. |
|
Supported Languages | |
Training Details |
Data Sources: | Publicly available documents, High-quality educational data, Synthetic data |
|
Data Volume: | |
Methodology: | Supervised fine-tuning (SFT), Direct Preference Optimization (DPO) |
|
Context Length: | |
Training Time: | |
Hardware Used: | |
Model Architecture: | Dense decoder-only Transformer |
|
|
Safety Evaluation |
Methodologies: | Post-training supervised fine-tuning, Direct Preference Optimization |
|
Findings: | Potential for bias in representation of groups, Possible generation of inappropriate content, Potential for misinformation |
|
Risk Categories: | Misinformation, Bias, Inappropriate content |
|
Ethical Considerations: | Evaluate suitability for high-risk scenarios. Ensure legality of use. |
|
|
Responsible Ai Considerations |
Fairness: | Recognize the potential for unfair or biased outputs. Evaluate and mitigate these risks before using in sensitive applications. |
|
Transparency: | Follow transparency best practices by informing users they are interacting with AI. Use known techniques to ground responses in use-case specific information. |
|
Accountability: | Developers are responsible for compliance with relevant laws and regulations. Implement necessary safeguards in high-risk scenarios. |
|
Mitigation Strategies: | Apply responsible AI best practices and debiasing techniques for high-stakes scenarios. |
|
|
Input Output |
Input Format: | Text, chat format prompts |
|
Accepted Modalities: | |
Output Format: | Generated responses to input |
|
Performance Tips: | Use few-shot prompting and ensure context is within 4K tokens. |
|
|