Gogpt 3B Bloom by golaxy

 ยป  All LLMs  ยป  golaxy  ยป  Gogpt 3B Bloom   URL Share it on

  Autotrain compatible   Bloom Dataset:bellegroup/school math... Dataset:bellegroup/train 0.5m ...   Dataset:bellegroup/train 1m cn   Dataset:bellegroup/train 2m cn Dataset:bellegroup/train 3.5m ...   Endpoints compatible   Pytorch   Region:us   Sharded   Tensorboard   Zh
Model Card on HF ๐Ÿค—: https://huggingface.co/golaxy/gogpt-3b-bloom 

Gogpt 3B Bloom Benchmarks

Gogpt 3B Bloom (golaxy/gogpt-3b-bloom)

Gogpt 3B Bloom Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
research, commercial applications
Additional Notes 
Development of GoGPT is ongoing, with future plans to incorporate RLFH training and English-Chinese parallel corpora.
Supported Languages 
zh (high)
Training Details 
Data Sources:
BelleGroup/train_2M_CN, BelleGroup/train_3.5M_CN, BelleGroup/train_1M_CN, BelleGroup/train_0.5M_CN, BelleGroup/school_math_0.25M
Methodology:
Fine-tuning using diverse Chinese instruction data.
Model Architecture:
BLOOM-based model
LLM NameGogpt 3B Bloom
Repository ๐Ÿค—https://huggingface.co/golaxy/gogpt-3b-bloom 
Model Size3b
Required VRAM14.6 GB
Updated2024-12-26
Maintainergolaxy
Model Typebloom
Model Files  9.9 GB: 1-of-2   4.7 GB: 2-of-2   0.0 GB
Supported Languageszh
Model ArchitectureBloomForCausalLM
Licenseapache-2.0
Model Max Length2048
Transformers Version4.29.1
Tokenizer ClassBloomTokenizer
Padding Token<pad>
Vocabulary Size250880
Torch Data Typefloat32

Best Alternatives to Gogpt 3B Bloom

Best Alternatives
Context / RAM
Downloads
Likes
Bloom 3B Conversational0K / 6.1 GB3541
Bloomz 3B Sft Chat0K / 6 GB114712
Gpt3 Finnish 3B Instruct0K / 12.7 GB231
... Bloom 3B Conversational 4bits0K / 2.6 GB120
Blossom V2 3B0K / 6 GB11930
Blossom V1 3B0K / 6 GB11951
Deer 3B0K / 14.6 GB11612
Bloom Zh 3B Chat0K / 6 GB122611
...oom 3B Bangla Hasib Pretrained0K / 3.6 GB410
...lish Dental Raw New Pretrained0K / 3.6 GB200
Note: green Score (e.g. "73.2") means that the model is better than golaxy/gogpt-3b-bloom.

Rank the Gogpt 3B Bloom Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40248 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217