Training Details |
Data Sources: | Finnish-NLP/CulturaX_fi_cleaned, Finnish-NLP/HPLT_1.2_fi_cleaned, Finnish-NLP/wikipedia_20231101_fi_cleaned, Finnish-NLP/Reddit_fi_2006_2022, Yle Finnish News Archive 2011-2018, Yle Finnish News Archive 2019-2020, Finnish News Agency Archive (STT), The Suomi24 Sentences Corpus, Project LΓΆnnrot, Finnish parliament speeches, multilingual_cc_news, fi-news-corpus, Finnish higher education public theses, Finnish single-turn instruction-following datasets |
|
Data Volume: | |
Methodology: | Pretrained on Finnish language, resampling and filtering techniques used, included instruction-following examples mixed in. |
|
Context Length: | |
Training Time: | |
Hardware Used: | |
Model Architecture: | 3B parameter, decoder-only transformer |
|
|