Fireball 12B By EpistemeAI: Benchmarks, Features and Detailed Analysis. Insights on Fireball 12B.

Autotrain compatible Base model:epistemeai/fireball... Base model:finetune:epistemeai... Dataset:candenizkocak/code-alp... Dataset:reciperesearch/dolphin... Dataset:yahma/alpaca-cleaned En Endpoints compatible Mistral Model-index Region:us Safetensors Sharded Tensorflow Trl Unsloth

Model Card on HF 🤗: https://huggingface.co/EpistemeAI/Fireball-12B

Fireball 12B Benchmarks

MMLU Pro: 26.04

GPQA: 1.57

MUSR: 12.52

BBH: 30.67

IFEval: 18.34 vs 88 (so35)^-79.2%

MATH Lvl 5: 4.08

LLME Score: 0.2345

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Fireball 12B Parameters and Internals

Model Type

text-generation-inference, transformers, unsloth, mistral, trl

Additional Notes

Model is a pretrained base model and does not have any moderation mechanisms.

Training Details

Data Sources:

candenizkocak/code-alpaca-297k, yahma/alpaca-cleaned, reciperesearch/dolphin-sft-v0.1-preference

Methodology:

Supervised fine-tuning

Context Length:

128000

Model Architecture:

Transformer model with 40 layers, 5,120 dimensionality, 128 head dim, 14,436 hidden dim, SwiGLU activation, 32 heads, 8 kv-heads

LLM Name	Fireball 12B
Repository 🤗	https://huggingface.co/EpistemeAI/Fireball-12B
Base Model(s)	EpistemeAI/Fireball-Mistral-Nemo-Base-2407-sft-v2.2a EpistemeAI/Fireball-Mistral-Nemo-Base-2407-sft-v2.2a
Model Size	12b
Required VRAM	24.5 GB
Updated	2025-06-02
Maintainer	EpistemeAI
Model Type	mistral
Model Files	4.9 GB: 1-of-5 4.9 GB: 2-of-5 4.9 GB: 3-of-5 4.9 GB: 4-of-5 4.9 GB: 5-of-5
Supported Languages	en
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	1024000
Model Max Length	1024000
Transformers Version	4.44.0
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<pad>
Vocabulary Size	131072
Torch Data Type	bfloat16

Best Alternatives to Fireball 12B

Best Alternatives	Context / RAM	Downloads	Likes
...r Nemo 12B Instruct R 21 09 24	1000K / 24.5 GB	432631	119
Captain Eris Violet V0.420 12B	1000K / 24.5 GB	5269	47
MN 12B Mag Mell R1	1000K / 24.5 GB	12299	168
PLLuM 12B Nc Chat	1000K / 24.5 GB	5515	6
Captain Eris BMO Violent 12B	1000K / 24.5 GB	305	2
...s PersonalityEngine V1.1.0 12B	1000K / 24.5 GB	481	39
MISCHIEVOUS 12B Mix III Ex V	1000K / 24.5 GB	1549	0
Magnum V2 12B	1000K / 24.5 GB	25295	88
Dans SakuraKaze V1.0.0 12B	1000K / 24.5 GB	23	19
...n Eris BMO Violent GRPO V0.420	1000K / 24.5 GB	92	3

Note: green Score (e.g. "73.2") means that the model is better than EpistemeAI/Fireball-12B.

Rank the Fireball 12B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47771 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Fireball 12B by EpistemeAI

» All LLMs » EpistemeAI » Fireball 12B URL Share it on

Fireball 12B Benchmarks

Fireball 12B Parameters and Internals

Best Alternatives to Fireball 12B

Rank the Fireball 12B Capabilities

What open-source LLMs or SLMs are you in search of? 47771 in total.