Model Type | text-to-text, decoder-only, large language model |
|
Use Cases |
Areas: | Research, Commercial applications |
|
Applications: | Content Creation and Communication, Research and Education |
|
Primary Use Cases: | Text Generation, Chatbots and Conversational AI, Text Summarization |
|
Limitations: | Open-ended or highly complex tasks, Language ambiguity and nuance, Factual inaccuracy, Common sense reasoning |
|
|
Additional Notes | Trained using JAX and ML Pathways. |
|
Supported Languages | |
Training Details |
Data Sources: | Web Documents, Code, Mathematics |
|
Data Volume: | |
Hardware Used: | |
|
Safety Evaluation |
Methodologies: | Red-teaming, Internal evaluations |
|
Findings: | Within acceptable thresholds for internal policies |
|
Risk Categories: | Text-to-Text Content Safety, Text-to-Text Representational Harms, Memorization, Large-scale harm |
|
|
Responsible Ai Considerations |
Fairness: | Careful scrutiny and pre-processing of input data |
|
Transparency: | Model card summarizes the model's details |
|
Accountability: | Model creators accountable for evaluation processes |
|
Mitigation Strategies: | Continuous monitoring and exploration of de-biasing techniques |
|
|
Input Output |
Input Format: | |
Accepted Modalities: | |
Output Format: | Generated English-language text |
|
Performance Tips: | Use bfloat16 or float16 for better performance on compatible hardware. |
|
|