Reflect Mini8Bit Om2 460K Sft DPO T1 By RyanYr: Benchmarks, Features and Detailed Analysis. Insights on Reflect Mini8Bit Om2 460K Sft DPO T1.

Arxiv:2305.18290 Autotrain compatible Base model:finetune:ryanyr/ref... Base model:ryanyr/reflect mini... Conversational Dpo Endpoints compatible Generated from trainer Mistral Region:us Safetensors Sharded Tensorflow Trl

Model Card on HF 🤗: https://huggingface.co/RyanYr/reflect_mini8Bit_om2-460k_sft-dpo-t1

Reflect Mini8Bit Om2 460k Sft DPO T1 Benchmarks

LLME Score: 0.28891

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Reflect Mini8Bit Om2 460K Sft DPO T1 (RyanYr/reflect_mini8Bit_om2-460k_sft-dpo-t1)

Reflect Mini8Bit Om2 460K Sft DPO T1 Parameters and Internals

LLM Name	Reflect Mini8Bit Om2 460k Sft DPO T1
Repository 🤗	https://huggingface.co/RyanYr/reflect_mini8Bit_om2-460k_sft-dpo-t1
Model Name	reflect_mini8Bit_om2-460k_sft-dpo-t1
Base Model(s)	...flect Mini8Bit Om2 460k Sft T1 RyanYr/reflect_mini8Bit_om2-460k_sft-t1
Model Size	8b
Required VRAM	16.1 GB
Updated	2024-12-21
Maintainer	RyanYr
Model Type	mistral
Model Files	5.0 GB: 1-of-4 5.0 GB: 2-of-4 5.0 GB: 3-of-4 1.1 GB: 4-of-4 0.0 GB
Model Architecture	MistralForCausalLM
Context Length	32768
Model Max Length	32768
Transformers Version	4.45.2
Tokenizer Class	LlamaTokenizer
Padding Token	[PAD]
Vocabulary Size	131073
Torch Data Type	bfloat16

Best Alternatives to Reflect Mini8Bit Om2 460K Sft DPO T1

Best Alternatives	Context / RAM	Downloads	Likes
Ministral 8B Instruct 2410 HF	32K / 32 GB	50097	10
Ministrations 8B V1	32K / 16.1 GB	143	15
...inistral8Bit Om2 Sft T2 Lr.5 6	32K / 16.1 GB	938	0
Ministral 8B Slerp	32K / 29.2 GB	27	0
...flect Mini8Bit Om2 460k Sft T1	32K / 16.1 GB	130	0
...ButDuctTapeIsSilver Slurpee 7B	32K / 14.5 GB	22	0
...ruct 2410 MetaMathQA DPO Iter1	32K / 16.1 GB	387	0
Mistral Pro 8B V0.1	32K / 17.9 GB	654	66
...t Ministral8Bit MMQA Mix Iter2	32K / 16.1 GB	132	0
...t Ministral8Bit MMQA DPO Iter1	32K / 60.8 GB	89	0

Note: green Score (e.g. "73.2") means that the model is better than RyanYr/reflect_mini8Bit_om2-460k_sft-dpo-t1.

Rank the Reflect Mini8Bit Om2 460K Sft DPO T1 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 40013 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241217

Support LLM Explorer

Reflect Mini8Bit Om2 460K Sft DPO T1 by RyanYr

» All LLMs » RyanYr » Reflect Mini8Bit Om2 460K Sft DPO T1 URL Share it on

Reflect Mini8Bit Om2 460k Sft DPO T1 Benchmarks

Reflect Mini8Bit Om2 460K Sft DPO T1 Parameters and Internals

Best Alternatives to Reflect Mini8Bit Om2 460K Sft DPO T1

Rank the Reflect Mini8Bit Om2 460K Sft DPO T1 Capabilities

What open-source LLMs or SLMs are you in search of? 40013 in total.