Training Details |
Data Sources: | Riddle Joker (Private), Café Stella and the Reaper's Butterflies (Private), Senren*Banka (Private), roleplay4fun/aesir-v1.1, kalomaze/Opus_Instruct_3k, Gryphe/Sonnet3.5-SlimOrcaDedupCleaned, Aratako/Synthetic-JP-EN-Coding-Dataset-567k, Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted, Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted, Aratako_Rosebleu_1on1_Dialogues_RP, SkunkworksAI/reasoning-0.01, jondurbin_gutenberg_dpo, nbeerbower_gutenberg2_dpo, jondurbi_py_dpo, jondurbin_truthy_dpo, flammenai_character_roleplay_DPO, kyujinpy_orca_math_dpo, argilla_Capybara_Preferences, antiven0m_physical_reasoning_dpo, aixsatoshi_Swallow_MX_chatbot_DPO |
|
|