Granite 3.0 3B A800m Instruct By ibm-granite: Benchmarks, Features and Detailed Analysis. Insights on Granite 3.0 3B A800m Instruct.

Summarization, Text classification, Text extraction, Question-answering, Retrieval Augmented Generation (RAG), Code related tasks, Function-calling tasks, Multilingual dialog use cases

Limitations:

Might not perform equally across all languages as in English., Potential for inaccurate, biased, or unsafe responses without proper safety testing.

Considerations:

Proper safety testing and example tuning tailored for specific tasks.

Additional Notes

The model infrastructure is environmentally friendly, leveraging 100% renewable energy.

Supported Languages

English (supported), German (supported), Spanish (supported), French (supported), Japanese (supported), Portuguese (supported), Arabic (supported), Czech (supported), Italian (supported), Korean (supported), Dutch (supported), Chinese (supported)

Training Details

Data Sources:

publicly available datasets with permissive license, internal synthetic data, human-curated data

Methodology:

supervised finetuning, model alignment using reinforcement learning, and model merging

Context Length:

4096

Hardware Used:

IBM's supercomputing cluster, Blue Vela with NVIDIA H100 GPUs

Model Architecture:

decoder-only sparse Mixture of Experts (MoE) transformer architecture

Responsible Ai Considerations

Fairness:

multilingual data, but primary tuning on English instruction-response pairs.

Transparency:

Model developed by Granite Team, IBM. See accompanying technical documentation.

Mitigation Strategies:

Introducing few-shot learning for improved accuracy on multilingual tasks.

Input Output

Input Format:

chat template with role, content fields

Accepted Modalities:

text

Output Format:

text

Performance Tips:

Adjust sequence length as required.

Release Notes

Date:

October 21st, 2024

Notes:

Initial release with instruction tuning and multilingual capabilities.

LLM Name	Granite 3.0 3B A800m Instruct
Repository 🤗	https://huggingface.co/ibm-granite/granite-3.0-3b-a800m-instruct
Base Model(s)	ibm-granite/granite-3.0-3b-a800m-base ibm-granite/granite-3.0-3b-a800m-base
Model Size	3b
Required VRAM	6.8 GB
Updated	2025-06-01
Maintainer	ibm-granite
Model Type	granitemoe
Instruction-Based	Yes
Model Files	5.0 GB: 1-of-2 1.8 GB: 2-of-2
Model Architecture	GraniteMoeForCausalLM
License	apache-2.0
Context Length	4096
Model Max Length	4096
Transformers Version	4.46.0.dev0
Tokenizer Class	GPT2Tokenizer
Padding Token	<\|end_of_text\|>
Vocabulary Size	49155
Torch Data Type	bfloat16
Errors	replace

Best Alternatives to Granite 3.0 3B A800m Instruct

Best Alternatives	Context / RAM	Downloads	Likes
Granite 3.1 3B A800m Instruct	128K / 6.6 GB	61603	24

Note: green Score (e.g. "73.2") means that the model is better than ibm-granite/granite-3.0-3b-a800m-instruct.

Rank the Granite 3.0 3B A800m Instruct Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47770 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Granite 3.0 3B A800m Instruct by ibm-granite

» All LLMs » ibm-granite » Granite 3.0 3B A800m Instruct URL Share it on

Granite 3.0 3B A800m Instruct Benchmarks

Granite 3.0 3B A800m Instruct Parameters and Internals

Best Alternatives to Granite 3.0 3B A800m Instruct

Rank the Granite 3.0 3B A800m Instruct Capabilities

What open-source LLMs or SLMs are you in search of? 47770 in total.