Model Type | text-generation, multimodal |
|
Use Cases |
Areas: | Commercial applications, Research use |
|
Applications: | Assistant-like chat, Natural language generation, Multilingual dialogue |
|
Primary Use Cases: | Assistant-like chat, Text completion, Code generation |
|
Limitations: | Unsuitable for unsupported languages without additional fine-tuning |
|
Considerations: | Encourages developers to responsibly deploy and use safeguards. |
|
|
Additional Notes | Model was trained 2x faster using Unsloth and Huggingface's TRL library. |
|
Supported Languages | English (High), German (High), French (High), Italian (High), Portuguese (High), Hindi (High), Spanish (High), Thai (High) |
|
Training Details |
Data Sources: | Publicly available online data |
|
Methodology: | Supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) |
|
Context Length: | |
Model Architecture: | Auto-regressive language model using an optimized transformer architecture with Grouped-Query Attention (GQA) |
|
|
Safety Evaluation |
Methodologies: | Red teaming, Adversarial testing |
|
Findings: | Potential vulnerabilities in multilingual capabilities, Inherent risks in capabilities such as coding and tool calls |
|
Risk Categories: | Misinformation, Bias, Cyber attack enablement |
|
Ethical Considerations: | Alignment with human preferences for safety through fine-tuning and reinforced learning with feedback. |
|
|
Responsible Ai Considerations |
Fairness: | Model designed to serve a wide range of use cases and backgrounds. |
|
Transparency: | Openly shares guidelines and system safeguards. |
|
Accountability: | Developers are expected to ensure responsible deployment and system safeguards. |
|
Mitigation Strategies: | Employs high-quality data selection and safety tuning datasets. |
|
|
Input Output |
Input Format: | ChatML or Alpaca templates for prompts. |
|
Accepted Modalities: | |
Output Format: | Multilingual text and code outputs. |
|
Performance Tips: | Use recommended prompts and ensure system safeguards are in place. |
|
|
Release Notes |
Version: | |
Date: | |
Notes: | Improved multilingual capabilities and released longer context window. |
|
|
|