SmolLM2 360M Instruct by HuggingFaceTB

 ยป  All LLMs  ยป  HuggingFaceTB  ยป  SmolLM2 360M Instruct   URL Share it on

  Autotrain compatible   Conversational   En   Endpoints compatible   Ext 8k   Instruct   Llama   Onnx   Region:us   Safetensors   Tensorboard   Transformers.js

SmolLM2 360M Instruct Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
SmolLM2 360M Instruct (HuggingFaceTB/SmolLM2-360M-Instruct)

SmolLM2 360M Instruct Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
Research, Commercial applications
Limitations:
Primarily understands and generates English, Generated content may not always be factually accurate, May not be logically consistent or free from biases
Considerations:
Should be used as assistive tools; verify important information.
Supported Languages 
en (High)
Training Details 
Data Sources:
FineWeb-Edu, DCLM, The Stack, Custom curated datasets
Data Volume:
4T tokens
Methodology:
Supervised fine-tuning with Direct Preference Optimization using UltraFeedback
Hardware Used:
64 H100 GPUs
Model Architecture:
Transformer decoder
Input Output 
Input Format:
Input chat template through tokenizer
Accepted Modalities:
text
Output Format:
Generated text responses
Performance Tips:
Utilize GPUs for optimal performance.
LLM NameSmolLM2 360M Instruct
Repository ๐Ÿค—https://huggingface.co/HuggingFaceTB/SmolLM2-360M-Instruct 
Model Size360m
Required VRAM0.7 GB
Updated2024-12-08
MaintainerHuggingFaceTB
Model Typellama
Instruction-BasedYes
Model Files  0.7 GB   0.0 GB
Supported Languagesen
Context Length8k
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length8192
Model Max Length8192
Transformers Version4.42.3
Tokenizer ClassGPT2Tokenizer
Padding Token<|im_end|>
Vocabulary Size49152
Torch Data Typebfloat16

Quantized Models of the SmolLM2 360M Instruct

Model
Likes
Downloads
VRAM
SmolLM2 360M Instruct Bnb 4bit02160 GB

Best Alternatives to SmolLM2 360M Instruct

Best Alternatives
Context / RAM
Downloads
Likes
SmolLM2 360M Instruct FT8K / 1.4 GB841
SmolLM 360M Instruct2K / 0.7 GB1142477
SmolLM 360M Instruct2K / 0.7 GB4150
SmolLM 360M2K / 0.7 GB3980
SmolLM 360M Instruct2K / 0.7 GB4422
SmolLM2 360M Instruct Bnb 4bit8K / 0.3 GB2160
SmolLM 360M Instruct 8bit2K / 0.4 GB292

Rank the SmolLM2 360M Instruct Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 39016 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124