Model Type | text generation, decoder-only, large language model |
|
Use Cases |
Areas: | Content Creation and Communication, Research and Education |
|
Applications: | Text Generation, Chatbots and Conversational AI, Text Summarization, NLP Research, Language Learning Tools, Knowledge Exploration |
|
Primary Use Cases: | Text generation tasks such as question answering, summarization, reasoning |
|
Limitations: | Biases or gaps in data, open-ended tasks may be challenging, accuracy on factual information |
|
Considerations: | Developers should be mindful of content safety and privacy issues. |
|
|
Additional Notes | Models require adequate safety safeguards based on application use cases. |
|
Supported Languages | |
Training Details |
Data Sources: | Web Documents, Code, Mathematics |
|
Data Volume: | 9B model with 8 trillion tokens |
|
Hardware Used: | |
Model Architecture: | text-to-text, decoder-only |
|
|
Safety Evaluation |
Methodologies: | internal red-teaming, structured evaluations |
|
Findings: | Meets internal policies for child safety, content safety, representational harms, memorization, large-scale harms |
|
Risk Categories: | child sexual abuse and exploitation, harassment, violence and gore, hate speech |
|
|
Responsible Ai Considerations |
Fairness: | Models undergo scrutiny for socio-cultural biases. |
|
Transparency: | Model details are shared in the model card. |
|
Accountability: | Guidelines for responsible use are provided. |
|
Mitigation Strategies: | Continuous monitoring and de-biasing encouraged. |
|
|
Input Output |
Input Format: | |
Accepted Modalities: | |
Output Format: | Generated English-language text |
|
|