Model Type | |
Use Cases |
Areas: | research, commercial applications |
|
|
Additional Notes | Model includes a comprehensive 150,000-token vocabulary optimized for multi-languages. |
|
Supported Languages | zh (high), en (high), multilingual (medium) |
|
Training Details |
Data Sources: | |
Data Volume: | |
Context Length: | |
Model Architecture: | |
|
Input Output |
Input Format: | Standard transformer inputs |
|
Accepted Modalities: | |
Output Format: | |
Performance Tips: | Installation of optional flash-attention for increased efficiency and reduced memory. |
|
|