Reglu 100B By SparseLLM: Benchmarks, Features and Detailed Analysis. Insights on Reglu 100B.

Arxiv:2402.03804 Autotrain compatible En Endpoints compatible Llama Pytorch Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/SparseLLM/reglu-100B

Reglu 100B Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Reglu 100B Parameters and Internals

Additional Notes

The study focused on identifying efficient activation functions for sparse computation in large language models through experimentation with ReLU, SwiGLU, ReGLU, and Squared ReLU.

Training Details

Data Sources:

Refinedweb, SlimPajama

Data Volume:

100 billion tokens

Hardware Used:

64xA100(80G)

LLM Name	Reglu 100B
Repository 🤗	https://huggingface.co/SparseLLM/reglu-100B
Model Size	100b
Required VRAM	2.6 GB
Updated	2025-06-01
Maintainer	SparseLLM
Model Type	llama
Model Files	2.6 GB 2.6 GB
Supported Languages	en
Model Architecture	LlamaForCausalLM
License	llama2
Context Length	2048
Model Max Length	2048
Transformers Version	4.36.2
Tokenizer Class	LlamaTokenizer
Vocabulary Size	32000
Torch Data Type	bfloat16

Best Alternatives to Reglu 100B

Best Alternatives	Context / RAM	Downloads	Likes
SmollerLM2 100M Instruct Sft	8K / 0.3 GB	26	0
SmollerLM2 100M	8K / 0.2 GB	5	0
Stockmark 2 100B Instruct Beta	4K / 192.9 GB	52	10
Stockmark 100B	4K / 191.9 GB	1038	33
Saily 100b	4K / 235.5 GB	10	7
Plankton 100M	4K / 0.4 GB	10	0
...lisLM 100M Layer Hidden Pruned	2K / 0.2 GB	820	0
Llama 161M 100B	1K / 0.3 GB	15	23
...ephyr Smol Llama 100M DPO Full	1K / 0.2 GB	12	3
...ephyr Smol Llama 100M DPO Full	1K / GB	13	1

Note: green Score (e.g. "73.2") means that the model is better than SparseLLM/reglu-100B.

Rank the Reglu 100B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Reglu 100B by SparseLLM

» All LLMs » SparseLLM » Reglu 100B URL Share it on

Reglu 100B Benchmarks

Reglu 100B Parameters and Internals

Best Alternatives to Reglu 100B

Rank the Reglu 100B Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.