Training Details |
Data Sources: | jondurbin/airoboros-3.2, bluemoon-fandom-1-1-rp-cleaned, boolq, LDJnr/Capybara, jondurbin/cinematika-v0.1, glaiveai/glaive-function-calling-v2, grimulkan/LimaRP-augmented, piqa, Vezora/Tested-22k-Python-Alpaca, mattpscott/airoboros-summarization, unalignment/toxic-dpo-v0.2, allenai/ultrafeedback_binarized_cleaned, argilla/distilabel-intel-orca-dpo-pairs, jondurbin/contextual-dpo-v0.1, jondurbin/gutenberg-dpo-v0.1, jondurbin/py-dpo-v0.1, jondurbin/truthy-dpo-v0.1, lmsys/lmsys-chat-1m |
|
Model Architecture: | llama-3 instruct chat template |
|
|