Model Type | |
Use Cases | |
Additional Notes | OLMo is a series of open language models. |
|
Supported Languages | |
Training Details |
Data Sources: | allenai/tulu-3-sft-olmo-2-mixture, allenai/olmo-2-1124-13b-preference-mix, allenai/RLVR-GSM-MATH-IF-Mixed-Constraints |
|
Methodology: | supervised finetuning on TΓΌlu 3 dataset, DPO training, RLVR training |
|
|
Safety Evaluation |
Risk Categories: | |
Ethical Considerations: | Limited safety training, potential for problematic outputs. |
|
|
Input Output |
Accepted Modalities: | |
Output Format: | |
|
Release Notes |
Date: | |
Notes: | Post-trained variant with RLVR training. |
|
|
|