MAmmoTH2 8x7B Plus By TIGER-Lab: Benchmarks, Features and Detailed Analysis. Insights on MAmmoTH2 8x7B Plus.

Merged Model Arxiv:2405.03548 Autotrain compatible Conversational Dataset:tiger-lab/webinstructs... En Endpoints compatible Mixtral Moe Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/TIGER-Lab/MAmmoTH2-8x7B-Plus

MAmmoTH2 8x7B Plus Benchmarks

LLME Score: 0.22039

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

MAmmoTH2 8x7B Plus (TIGER-Lab/MAmmoTH2-8x7B-Plus)

MAmmoTH2 8x7B Plus Parameters and Internals

Model Type

text generation, instruction tuning

Use Cases

Areas:

research, commercial applications

Applications:

reasoning benchmarks, chatbot benchmarks

Primary Use Cases:

enhancing reasoning abilities of LLMs, instruction tuning

Limitations:

Performance may vary based on the complexity and specifics of the math problem., Not all mathematical fields are covered comprehensively.

Additional Notes

Models show improvement in performance after fine-tuning on WEBINSTRUCT.

Supported Languages

en (English - High proficiency)

Training Details

Data Sources:

https://huggingface.co/datasets/TIGER-Lab/WebInstructSub

Methodology:

Fine-tuning with the WEBINSTRUCT dataset.

Input Output

Input Format:

Prompting with instructions formatted using special tokens `~~[INST]` and `[/INST]`.

Accepted Modalities:

text

Output Format:

Generated text responses

Performance Tips:

Model is not very sensitive to the chat template.

Release Notes

Version:

MAmmoTH2-7B

Notes:

Achieves performance of 36.7% on MATH and 68.4% on GSM8K.

Version:

MAmmoTH2-8B

Notes:

Achieves performance of 35.8% on MATH and 70.4% on GSM8K.

Version:

MAmmoTH2-8x7B

Notes:

Achieves outstanding performance with a variety of datasets.

LLM Name	MAmmoTH2 8x7B Plus
Repository 🤗	https://huggingface.co/TIGER-Lab/MAmmoTH2-8x7B-Plus
Merged Model	Yes
Model Size	46.7b
Required VRAM	93.6 GB
Updated	2025-02-22
Maintainer	TIGER-Lab
Model Type	mixtral
Model Files	4.9 GB: 1-of-19 5.0 GB: 2-of-19 5.0 GB: 3-of-19 4.9 GB: 4-of-19 5.0 GB: 5-of-19 5.0 GB: 6-of-19 4.9 GB: 7-of-19 5.0 GB: 8-of-19 5.0 GB: 9-of-19 4.9 GB: 10-of-19 5.0 GB: 11-of-19 5.0 GB: 12-of-19 5.0 GB: 13-of-19 4.9 GB: 14-of-19 5.0 GB: 15-of-19 5.0 GB: 16-of-19 4.9 GB: 17-of-19 5.0 GB: 18-of-19 4.2 GB: 19-of-19 0.0 GB
Supported Languages	en
Model Architecture	MixtralForCausalLM
License	mit
Context Length	32768
Model Max Length	32768
Transformers Version	4.40.0
Tokenizer Class	LlamaTokenizer
Padding Token	</s>
Vocabulary Size	32000
Torch Data Type	bfloat16

Best Alternatives to MAmmoTH2 8x7B Plus

Best Alternatives	Context / RAM	Downloads	Likes
Mixtral 8x7B Instruct V0.1	32K / 93.6 GB	529368	4311
Nous Hermes 2 Mixtral 8x7B DPO	32K / 93.6 GB	2915	424
Mixtral 8x7B V0.1	32K / 93.6 GB	30874	1677
GritLM 8x7B KTO	32K / 93.6 GB	3667	3
Smaug Mixtral V0.1	32K / 187.7 GB	3627	12
Sensualize Mixtral Bf16	32K / 93.6 GB	0	0
Skadi Mixtral V1	32K / 93.5 GB	0	0
Franziska Mixtral V1	32K / 93.5 GB	0	0
Typhon Mixtral V1	32K / 93.4 GB	0	0
Merge Mixtral Prometheus 8x7B	32K / 91.9 GB	12	2

Note: green Score (e.g. "73.2") means that the model is better than TIGER-Lab/MAmmoTH2-8x7B-Plus.

Rank the MAmmoTH2 8x7B Plus Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

MAmmoTH2 8x7B Plus by TIGER-Lab

» All LLMs » TIGER-Lab » MAmmoTH2 8x7B Plus URL Share it on

MAmmoTH2 8x7B Plus Benchmarks

MAmmoTH2 8x7B Plus Parameters and Internals

Best Alternatives to MAmmoTH2 8x7B Plus

Rank the MAmmoTH2 8x7B Plus Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.