Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 By RyanYr: Benchmarks, Features and Detailed Analysis. Insights on Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1.

Arxiv:2305.18290 Autotrain compatible Base model:finetune:mistralai/... Base model:mistralai/mistral-s... Conversational Dpo Endpoints compatible Generated from trainer Instruct Mistral Region:us Safetensors Sharded Tensorflow Trl

Model Card on HF 🤗: https://huggingface.co/RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter1

Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 Benchmarks

LLME Score: 0.21839

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 (RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter1)

Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 Parameters and Internals

LLM Name	Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1
Repository 🤗	https://huggingface.co/RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter1
Model Name	self-correct_mistral-small-it_mMQA_dpo_iter1
Base Model(s)	Mistral Small Instruct 2409 mistralai/Mistral-Small-Instruct-2409
Model Size	8b
Required VRAM	16.1 GB
Updated	2024-11-15
Maintainer	RyanYr
Model Type	mistral
Instruction-Based	Yes
Model Files	5.0 GB: 1-of-4 5.0 GB: 2-of-4 5.0 GB: 3-of-4 1.1 GB: 4-of-4 0.0 GB
Model Architecture	MistralForCausalLM
Context Length	32768
Model Max Length	32768
Transformers Version	4.45.2
Tokenizer Class	LlamaTokenizer
Padding Token	[PAD]
Vocabulary Size	131073
Torch Data Type	bfloat16

Best Alternatives to Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1

Best Alternatives	Context / RAM	Downloads	Likes
Ministral 8B Instruct 2410 HF	32K / 32 GB	82	10
Ministrations 8B V1	32K / 16.1 GB	28	20
Ministral 8B Slerp	32K / 29.2 GB	7	0
DeepOpus 1 8B Preview	32K / 16.1 GB	75	2
DeepNeo 1 8B Preview	32K / 16.1 GB	63	2
Bigger Body 8B	32K / 16.1 GB	11	5
Forgotten Safeword 8B V2.2	32K / 16.1 GB	6	1
Reflect Single Mini8B SftT12	32K / 16.1 GB	8	0
...ct Mini8B MistlrgOrcl460kSftT1	32K / 16.1 GB	9	0
...ct Mini8B MistlrgOrcl460kSftT2	32K / 16.1 GB	7	0

Note: green Score (e.g. "73.2") means that the model is better than RyanYr/self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter1.

Rank the Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 by RyanYr

» All LLMs » RyanYr » Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 URL Share it on

Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 Benchmarks

Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 Parameters and Internals

Best Alternatives to Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1

Rank the Self Correct Ministral 8B Instruct 2410 MetaMathQA DPO Iter1 Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.