LLM Name | SmolLM2 360M Instruct FT |
Repository ๐ค | https://huggingface.co/belyakoff/SmolLM2-360M-Instruct-FT |
Base Model(s) | |
Model Size | 360m |
Required VRAM | 1.4 GB |
Updated | 2025-01-14 |
Maintainer | belyakoff |
Model Type | llama |
Instruction-Based | Yes |
Model Files | |
Supported Languages | ru |
Model Architecture | LlamaForCausalLM |
License | apache-2.0 |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.43.3 |
Tokenizer Class | GPT2Tokenizer |
Padding Token | <|endoftext|> |
Vocabulary Size | 49152 |
Torch Data Type | float32 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
SmolLM2 360M Instruct | 8K / 0.7 GB | 1113664 | 68 |
SmolLM2 CoT 360M | 8K / 1.4 GB | 263 | 12 |
SmolLM2 360M Instruct | 8K / 0.7 GB | 1419 | 0 |
SmolLM2 360M Instruct Ita | 8K / 0.7 GB | 11 | 0 |
SmolLM 360M Instruct | 2K / 0.7 GB | 14705 | 77 |
SmolLM 360M | 2K / 0.7 GB | 876 | 0 |
SmolLM 360M Instruct | 2K / 0.7 GB | 638 | 0 |
SmolLM 360M Instruct | 2K / 0.7 GB | 232 | 2 |
SmolLM2 360M Instruct Bnb 4bit | 8K / 0.3 GB | 866 | 0 |
SmolLM 360M Instruct 8bit | 2K / 0.4 GB | 22 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐