Model Type | |
Use Cases |
Areas: | text rewriting, summarization, function calling |
|
Applications: | research, commercial applications |
|
Limitations: | Primarily understands and generates content in English., Generated content may not be factually accurate, logically consistent, or free from biases., Should be used as assistive tools rather than definitive sources of information. |
|
Considerations: | Users should verify important information and critically evaluate any generated content. |
|
|
Additional Notes | The instruct version of the model is tuned to support tasks beyond standard language modeling. |
|
Supported Languages | |
Training Details |
Data Sources: | FineWeb-Edu, DCLM, The Stack, new mathematics dataset, coding dataset |
|
Data Volume: | |
Methodology: | Supervised fine-tuning (SFT), Direct Preference Optimization (DPO) |
|
Hardware Used: | |
Model Architecture: | |
|
Input Output |
Accepted Modalities: | |
Performance Tips: | |
|