OLMoE 1B 7B 0924 SFT By allenai: Benchmarks, Features and Detailed Analysis. Insights on OLMoE 1B 7B 0924 SFT.

Arxiv:2409.02060 Base model:allenai/olmoe-1b-7b... Base model:finetune:allenai/ol... Co2 eq emissions Dataset:allenai/tulu-v3.1-mix-... En Moe Olmo Olmoe Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/allenai/OLMoE-1B-7B-0924-SFT

OLMoE 1B 7B 0924 SFT Benchmarks

LLME Score: 0.22239

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

OLMoE 1B 7B 0924 SFT (allenai/OLMoE-1B-7B-0924-SFT)

OLMoE 1B 7B 0924 SFT Parameters and Internals

Model Type

moe, olmo, olmoe

Additional Notes

This model is an intermediate training checkpoint during post-training, after the Supervised Fine-Tuning (SFT) step. Recommended for best performance: Use the OLMoE-Instruct version.

Supported Languages

en (primary)

Training Details

Data Sources:

allenai/tulu-v3.1-mix-preview-4096-OLMoE

Methodology:

Intermediate training checkpoint post-Supervised Fine-Tuning (SFT), Direct Preference Optimization/Kahneman-Tversky Optimization (DPO/KTO)

Model Architecture:

Mixture-of-Experts

LLM Name	OLMoE 1B 7B 0924 SFT
Repository 🤗	https://huggingface.co/allenai/OLMoE-1B-7B-0924-SFT
Base Model(s)	OLMoE 1B 7B 0924 allenai/OLMoE-1B-7B-0924
Model Size	1b
Required VRAM	13.8 GB
Updated	2025-02-22
Maintainer	allenai
Model Type	olmoe
Model Files	5.0 GB: 1-of-3 5.0 GB: 2-of-3 3.8 GB: 3-of-3
Supported Languages	en
Model Architecture	OlmoeForCausalLM
License	apache-2.0
Context Length	4096
Model Max Length	4096
Transformers Version	4.44.0.dev0
Tokenizer Class	GPTNeoXTokenizer
Padding Token	<\|padding\|>
Vocabulary Size	50304
Torch Data Type	bfloat16

Best Alternatives to OLMoE 1B 7B 0924 SFT

Best Alternatives	Context / RAM	Downloads	Likes
OLMoE 1B 7B 0125 Instruct	4K / 13.8 GB	1925	32
OLMoE 1B 7B 0924 Instruct	4K / 13.8 GB	6628	88
OLMoE 1B 7B 0924	4K / 13.8 GB	24449	110
OLMoE 1B 7B 0125	4K / 27.7 GB	475	13
OLMoE 1B 7B 0125 SFT	4K / 13.8 GB	155	1
OLMoE 1B 7B 0125 DPO	4K / 13.8 GB	126	0

Note: green Score (e.g. "73.2") means that the model is better than allenai/OLMoE-1B-7B-0924-SFT.

Rank the OLMoE 1B 7B 0924 SFT Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

OLMoE 1B 7B 0924 SFT by allenai

» All LLMs » allenai » OLMoE 1B 7B 0924 SFT URL Share it on

OLMoE 1B 7B 0924 SFT Benchmarks

OLMoE 1B 7B 0924 SFT Parameters and Internals

Best Alternatives to OLMoE 1B 7B 0924 SFT

Rank the OLMoE 1B 7B 0924 SFT Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.