LLM Name | SmolLM2 135M |
Repository ๐ค | https://huggingface.co/unsloth/SmolLM2-135M |
Base Model(s) | |
Model Size | 135m |
Required VRAM | 0.3 GB |
Updated | 2024-11-07 |
Maintainer | unsloth |
Model Type | llama |
Model Files | |
Supported Languages | en |
Model Architecture | LlamaForCausalLM |
License | apache-2.0 |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.46.1 |
Tokenizer Class | GPT2Tokenizer |
Padding Token | <|PAD_TOKEN|> |
Vocabulary Size | 49153 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
SmolLM2 135M Instruct | 8K / 0.3 GB | 4861 | 48 |
SmolLM2 135M | 8K / 0.3 GB | 3852 | 23 |
SmolLM2 Prompt Enhance | 8K / 0.5 GB | 32 | 4 |
SmolLM2 135M Instruct | 8K / 0.3 GB | 42 | 0 |
SmolLM 135M | 2K / 0.5 GB | 43741 | 170 |
SmolLM 135M Instruct | 2K / 0.3 GB | 26856 | 95 |
AMD Llama 135M | 2K / 0.5 GB | 11754 | 110 |
... Instruct Layer Pruned 90M Raw | 2K / 0.2 GB | 255 | 1 |
AMD Llama 135M Code | 2K / 0.5 GB | 1282 | 12 |
...olLM 135M Layer Pruned 90M Raw | 2K / 0.2 GB | 12 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐