GEITje 7B by Rijgersberg

 ยป  All LLMs  ยป  Rijgersberg  ยป  GEITje 7B   URL Share it on

  Autotrain compatible Base model:finetune:mistralai/... Base model:mistralai/mistral-7...   Conversational Dataset:rijgersberg/geitje-pre...   Endpoints compatible   Geitje   Generated from trainer   Mistral   Nl   Region:us   Safetensors   Tensorboard
Model Card on HF ๐Ÿค—: https://huggingface.co/Rijgersberg/GEITje-7B 

GEITje 7B Benchmarks

GEITje 7B (Rijgersberg/GEITje-7B)

GEITje 7B Parameters and Internals

Model Type 
language model, text generation
Additional Notes 
Further trained on Dutch language data to enhance Dutch language skills and knowledge.
Supported Languages 
nl (high)
Training Details 
Data Sources:
Dutch Gigacorpus, MADLAD-400
Data Volume:
10 billion tokens
Methodology:
full-parameter finetune
Context Length:
8192
Hardware Used:
8 GPUs
LLM NameGEITje 7B
Repository ๐Ÿค—https://huggingface.co/Rijgersberg/GEITje-7B 
Base Model(s)  mistralai/Mistral-7B-v0.1   mistralai/Mistral-7B-v0.1
Model Size7b
Required VRAM0 GB
Updated2025-05-16
MaintainerRijgersberg
Model Typemistral
Model Files  0.0 GB
Supported Languagesnl
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to GEITje 7B

Best Alternatives
Context / RAM
Downloads
Likes
...Nemo Instruct 2407 Abliterated1000K / 24.5 GB208115
MegaBeam Mistral 7B 512K512K / 14.4 GB141450
SpydazWeb AI HumanAI RP512K / 14.4 GB51
SpydazWeb AI HumanAI 002512K / 14.4 GB181
...daz Web AI ChatML 512K Project512K / 14.5 GB120
MegaBeam Mistral 7B 300K282K / 14.4 GB377916
MegaBeam Mistral 7B 300K282K / 14.4 GB110316
Hebrew Mistral 7B 200K256K / 30 GB306915
Astral 256K 7B V2250K / 14.4 GB130
Astral 256K 7B250K / 14.4 GB120
Note: green Score (e.g. "73.2") means that the model is better than Rijgersberg/GEITje-7B.

Rank the GEITje 7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 47377 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227