Bagel 8x7b V0.2 GGUF By TheBloke: Benchmarks, Features and Detailed Analysis. Insights on Bagel 8x7b V0.2 GGUF.

Base model:jondurbin/bagel-8x7... Base model:quantized:jondurbin... Conversational Dataset:ai2 arc Dataset:allenai/ultrafeedback ... Dataset:boolq Dataset:cais/mmlu Dataset:cakiki/rosetta-code Dataset:codeparrot/apps Dataset:datasets/winogrande Dataset:drop Dataset:facebook/belebele Dataset:intel/orca dpo pairs Dataset:jondurbin/airoboros-3.... Dataset:jondurbin/cinematika-v... Dataset:jondurbin/truthy-dpo-v... Dataset:julielab/emobank Dataset:kingbri/pippa-sharegpt Dataset:ldjnr/capybara Dataset:lmsys/lmsys-chat-1m Dataset:migtissera/synthia-v1.... Dataset:muennighoff/natural-in... Dataset:nvidia/helpsteer Dataset:open-orca/slimorca Dataset:openbookqa Dataset:piqa Dataset:spider Dataset:squad v2 Dataset:squish42/bluemoon-fand... Dataset:tiger-lab/mathinstruct Dataset:unalignment/toxic-dpo-... Dataset:vezora/tested-22k-pyth... Gguf Mixtral Moe Quantized Region:us

Model Card on HF 🤗: https://huggingface.co/TheBloke/bagel-8x7b-v0.2-GGUF

Bagel 8x7b V0.2 GGUF Benchmarks

LLME Score: 0.1461

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Bagel 8x7b V0.2 GGUF (TheBloke/bagel-8x7b-v0.2-GGUF)

Bagel 8x7b V0.2 GGUF Parameters and Internals

Model Type

mixtral

Additional Notes

The model includes examples of function/args generation and execution planning output format is JSON or YAML.

Training Details

Data Sources:

ai2_arc, Jondurbin/airoboros-3.2, codeparrot/apps, facebook/belebele, boolq, jondurbin/cinematika-v0.1, drop, lmsys/lmsys-chat-1m, TIGER-Lab/MathInstruct, cais/mmlu, Muennighoff/natural-instructions, openbookqa, piqa, Vezora/Tested-22k-Python-Alpaca, cakiki/rosetta-code, Open-Orca/SlimOrca, spider, squad_v2, migtissera/Synthia-v1.3, datasets/winogrande, nvidia/HelpSteer, Intel/orca_dpo_pairs, unalignment/toxic-dpo-v0.1, jondurbin/truthy-dpo-v0.1, allenai/ultrafeedback_binarized_cleaned, Squish42/bluemoon-fandom-1-1-rp-cleaned, LDJnr/Capybara, JULIELab/EmoBank, kingbri/PIPPA-shareGPT

Methodology:

Experimental fine-tune using bagel. After SFT phase, before DPO has been applied.

Context Length:

32768

Training Time:

4 days, 15 hours, 6 minutes, and 42 seconds

Hardware Used:

8x A6000 GPUs

Model Architecture:

Mixtral-8x7b

Input Output

Input Format:

Prompt format varies. Includes Alpaca, Vicuna, chat-ML, and Llama-2 formats.

Accepted Modalities:

text

Output Format:

As per prompt instructions for various formats.

Performance Tips:

Use a very low temperature for specific tasks (e.g., RA(G)/contextual question answering).

Release Notes

Version:

0.2

Notes:

Experimental fine-tune using bagel. Supports multiple prompt formats.

LLM Name	Bagel 8x7b V0.2 GGUF
Repository 🤗	https://huggingface.co/TheBloke/bagel-8x7b-v0.2-GGUF
Model Name	Bagel 8X7B v0.2
Model Creator	Jon Durbin
Base Model(s)	jondurbin/bagel-8x7b-v0.2 jondurbin/bagel-8x7b-v0.2
Model Size	1m
Required VRAM	15.6 GB
Updated	2025-06-01
Maintainer	TheBloke
Model Type	mixtral
Model Files	15.6 GB 20.4 GB 26.4 GB 26.4 GB 32.2 GB 32.2 GB 38.4 GB 49.6 GB
GGUF Quantization	Yes
Quantization Type	gguf
Model Architecture	AutoModel
License	apache-2.0

Best Alternatives to Bagel 8x7b V0.2 GGUF

Best Alternatives	Context / RAM	Downloads	Likes
Triangulum 1B GGUF	0K / 0.8 GB	60	1
Llama Deepsync 1B GGUF	0K / 0.8 GB	33	2
... SmolTalk 3.2 1B Instruct GGUF	0K / 0.8 GB	79	4
Mergekit Slerp Ozbsicw GGUF	0K / 2.5 GB	11	0
Tinyllama 1B Alpaca Gguf	0K / 0.7 GB	7	0
Prem 1B Chat GGUF	0K / 0.7 GB	7	1
Bagel DPO 8x7b V0.2 GGUF	0K / 17.3 GB	146	6
...inyllama 2 1B Miniguanaco GGUF	0K / 0.5 GB	437	16
Llama 3.2 1B Creative Lora	0K / 0 GB	0	3
Prem 1B Chat ORPO	0K / 0.3 GB	0	1

Note: green Score (e.g. "73.2") means that the model is better than TheBloke/bagel-8x7b-v0.2-GGUF.

Rank the Bagel 8x7b V0.2 GGUF Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Bagel 8x7b V0.2 GGUF by TheBloke

» All LLMs » TheBloke » Bagel 8x7b V0.2 GGUF URL Share it on

Bagel 8x7b V0.2 GGUF Benchmarks

Bagel 8x7b V0.2 GGUF Parameters and Internals

Best Alternatives to Bagel 8x7b V0.2 GGUF

Rank the Bagel 8x7b V0.2 GGUF Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.