Model Type | text generation, multilingual |
|
Use Cases |
Areas: | |
Applications: | assistant-like chat, natural language generation, synthetic data generation, distillation |
|
Primary Use Cases: | Instruction tuned text models for multilingual dialogues |
|
Limitations: | Prohibited in violation of laws or regulations, Non-supported languages without fine-tuning and controls |
|
Considerations: | Developers responsible for ensuring responsible use in non-supported languages. |
|
|
Additional Notes | Static model trained on an offline dataset. Future tuned versions will incorporate safety improvements via community feedback. |
|
Supported Languages | en (native), de (high), fr (high), it (high), pt (high), hi (high), es (high), th (high) |
|
Training Details |
Data Sources: | A new mix of publicly available online data |
|
Data Volume: | |
Methodology: | |
Context Length: | |
Model Architecture: | Auto-regressive, optimized transformer |
|
|
Safety Evaluation |
Methodologies: | red teaming, adversarial tests |
|
Risk Categories: | CBRNE helpfulness, Child Safety, Cyber attack enablement |
|
Ethical Considerations: | Developers expected to use system safeguards to tailor safety for specific use cases. |
|
|
Responsible Ai Considerations |
Transparency: | Developers responsible for integrating safeguards with third-party tools. |
|
Mitigation Strategies: | Included safety fine-tuning; emphasis on refusals and tone guidance. |
|
|
Input Output |
Input Format: | |
Accepted Modalities: | |
Output Format: | Multilingual text and code |
|
|
Release Notes |
Version: | |
Date: | |
Notes: | Initial launch with longer context window, multilingual support, and fine-tuning capabilities. |
|
|
|