SmolLM2 135M By HuggingFaceTB: Benchmarks, Features and Detailed Analysis. Insights on SmolLM2 135M.

Models primarily understand and generate English., Content may not be factually accurate or logically consistent., Presence of biases inherent to training data.

Considerations:

Models are assistive tools and should not be used as definitive information sources.

Additional Notes

Memory footprint of the 135M model is 723.56 MB when loaded.

Supported Languages

en (High proficiency)

Training Details

Data Sources:

FineWeb-Edu, DCLM, The Stack, UltraFeedback

Data Volume:

2 trillion tokens

Methodology:

Direct Preference Optimization (DPO), Supervised Fine-tuning (SFT)

Context Length:

8000

Hardware Used:

64 H100 GPUs

Model Architecture:

Transformer decoder

Input Output

Input Format:

Token sequences encoded with a tokenizer

Accepted Modalities:

text

Output Format:

Generated token sequences

Performance Tips:

Use multiple GPUs and specific precision settings (e.g., bfloat16) for optimal performance

LLM Name	SmolLM2 135M
Repository 🤗	https://huggingface.co/HuggingFaceTB/SmolLM2-135M
Model Size	135m
Required VRAM	0.3 GB
Updated	2025-05-31
Maintainer	HuggingFaceTB
Model Type	llama
Model Files	0.3 GB
Supported Languages	en
Model Architecture	LlamaForCausalLM
License	apache-2.0
Context Length	8192
Model Max Length	8192
Transformers Version	4.40.1
Tokenizer Class	GPT2Tokenizer
Vocabulary Size	49152
Torch Data Type	bfloat16

Quantized Models of the SmolLM2 135M

Model	Likes	Downloads	VRAM
SmolLM2 135M Bnb 4bit	2	846	0 GB

Best Alternatives to SmolLM2 135M

Best Alternatives	Context / RAM	Downloads	Likes
SmolLM2 135M Instruct	8K / 0.3 GB	228614	196
SmolLM2 FT MyDataset	8K / 0.5 GB	65	0
SmolLM2 135M Eagle	8K / 0.3 GB	18	3
SmolLM2 135M Instruct Ita	8K / 0.1 GB	12	0
Sft Output	8K / 0.5 GB	11	0
SmolLM2 135M	8K / 0.3 GB	2763	1
SmolLM2 135M Instruct	8K / 0.3 GB	1948	3
SmolLM2 FT Eng	8K / 0.5 GB	10	0
... Fineweb Uncovai Human Removed	8K / 0.5 GB	14	0
SmolLM2 135M Grpo Gsm8k	8K / 0.5 GB	57	7

Rank the SmolLM2 135M Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

SmolLM2 135M by HuggingFaceTB

» All LLMs » HuggingFaceTB » SmolLM2 135M URL Share it on

SmolLM2 135M Benchmarks

SmolLM2 135M Parameters and Internals

Quantized Models of the SmolLM2 135M

Best Alternatives to SmolLM2 135M

Rank the SmolLM2 135M Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.