Bloom 7b1 By bigscience: Benchmarks, Features and Detailed Analysis. Insights on Bloom 7b1.

ak (Akan), ar (Arabic), as (Assamese), bm (Bambara), bn (Bengali), ca (Catalan), code (Programming languages), en (English), es (Spanish), eu (Basque), fon (Fon), fr (French), gu (Gujarati), hi (Hindi), id (Indonesian), ig (Igbo), ki (Kikuyu), kn (Kannada), lg (Luganda), ln (Lingala), ml (Malayalam), mr (Marathi), ne (Nepali), nso (Northern Sotho), ny (Nyanja), or (Odia), pa (Punjabi), pt (Portuguese), rn (Kirundi), rw (Kinyarwanda), sn (Shona), st (Southern Sotho), sw (Swahili), ta (Tamil), te (Telugu), tn (Tswana), ts (Tsonga), tum (Tumbuka), tw (Twi), ur (Urdu), vi (Vietnamese), wo (Wolof), xh (Xhosa), yo (Yoruba), zh (Chinese), zhs (Simplified Chinese), zht (Traditional Chinese), zu (Zulu)

Training Details

Data Sources:

45 natural languages, 12 programming languages

Data Volume:

1.5TB (350B unique tokens)

Methodology:

Modified from Megatron-LM GPT2

Context Length:

2048

Training Time:

11th March 2022 to 5th July 2022

Hardware Used:

Jean Zay Supercomputer with 384 A100 80GB GPUs

Model Architecture:

Decoder-only architecture with layer normalization applied to word embeddings, 30 layers, 32 attention heads, ALiBI positional encodings with GeLU activation functions

Responsible Ai Considerations

Fairness:

Model may overrepresent some viewpoints and underrepresent others.

Input Output

Input Format:

Byte-level Byte Pair Encoding (BPE) algorithm

Accepted Modalities:

text

Output Format:

Standard natural language outputs

Performance Tips:

None specified

Release Notes

Version:

1.0.0

Date:

11 July 2022

Notes:

Initial release for public research on large language models.

LLM Name	Bloom 7b1
Repository 🤗	https://huggingface.co/bigscience/bloom-7b1
Model Size	7.1b
Required VRAM	14.2 GB
Updated	2025-06-01
Maintainer	bigscience
Model Type	bloom
Model Files	10.0 GB: 1-of-2 4.2 GB: 2-of-2 10.0 GB: 1-of-2 4.2 GB: 2-of-2
Supported Languages	ak ar as bm bn ca code en es eu fr gu hi id ig ki kn lg ln ml mr ne ny or pa pt rn rw sn st sw ta te tn ts tw ur vi wo xh yo zh zu
Model Architecture	BloomForCausalLM
License	bigscience-bloom-rail-1.0
Transformers Version	4.22.2
Tokenizer Class	BloomTokenizerFast
Padding Token	<pad>
Vocabulary Size	250880
Torch Data Type	float16

Quantized Models of the Bloom 7b1

Model	Likes	Downloads	VRAM
Bloom 7b1 GPTQ 4bit G128	2	11	7 GB
Bloom 7b1 Gptq 4bit	2	27	7 GB

Best Alternatives to Bloom 7b1

Best Alternatives	Context / RAM	Downloads	Likes
...ft Fpft Multilingual Bloom 7b1	0K / 28.2 GB	18	1
Bloomz 7b1 Mt Sft Chat	0K / 14.2 GB	23	16
Bloomz 7b1 Mt	0K / 14.1 GB	25759	140
Bloomz 7b1	0K / 14.1 GB	6139	144
Bloomz 7b1 P3	0K / 14.1 GB	39	6

Note: green Score (e.g. "73.2") means that the model is better than bigscience/bloom-7b1.

Rank the Bloom 7b1 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Bloom 7b1 by bigscience

» All LLMs » bigscience » Bloom 7b1 URL Share it on

Bloom 7b1 Benchmarks

Bloom 7b1 Parameters and Internals

Quantized Models of the Bloom 7b1

Best Alternatives to Bloom 7b1

Rank the Bloom 7b1 Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.