Gpt3 Finnish 3B by TurkuNLP

 Β»  All LLMs  Β»  TurkuNLP  Β»  Gpt3 Finnish 3B   URL Share it on

  Arxiv:2203.02155   Bloom   Endpoints compatible   Feature-extraction   Fi   Pytorch   Region:us

Gpt3 Finnish 3B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Gpt3 Finnish 3B (TurkuNLP/gpt3-finnish-3B)

Gpt3 Finnish 3B Parameters and Internals

Model Type 
pretrained, language model, text generation
Additional Notes 
Note that the models are pure language models, meaning that they are not instruction finetuned for dialogue or answering questions. The intention is to use them as foundational models that can be instruction finetuned.
Supported Languages 
Finnish (native)
Training Details 
Data Sources:
Finnish Internet Parsebank, mC4, Common Crawl Finnish, Finnish Wikipedia, LΓΆnnrot Projekti, ePub National library, National library 'lehdet' collection, Suomi24, Reddit r/Suomi, STT Finnish News Agency Archive, Yle Finnish News Archive
Data Volume:
300B tokens
Model Architecture:
BLOOM-architecture
LLM NameGpt3 Finnish 3B
Repository πŸ€—https://huggingface.co/TurkuNLP/gpt3-finnish-3B 
Model Size3b
Required VRAM11.4 GB
Updated2025-02-22
MaintainerTurkuNLP
Model Typebloom
Model Files  11.4 GB
Supported Languagesfi
Model ArchitectureBloomModel
Licenseapache-2.0
Model Max Length2048
Transformers Version4.26.0.dev0
Tokenizer ClassBloomTokenizer
Padding Token<pad>
Vocabulary Size131072

Rank the Gpt3 Finnish 3B Capabilities

πŸ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227