LLM Name | Gpt2 Kgw K1 Delta2.0 LearnabilityScratch |
Repository ๐ค | https://huggingface.co/Grogros/gpt2-kgw-k1-delta2.0-LearnabilityScratch |
Model Size | 124.4m |
Required VRAM | 0.5 GB |
Updated | 2025-02-22 |
Maintainer | Grogros |
Model Type | gpt2 |
Model Files | |
Model Architecture | GPT2LMHeadModel |
Model Max Length | 1024 |
Transformers Version | 4.46.3 |
Tokenizer Class | GPT2Tokenizer |
Vocabulary Size | 50257 |
Torch Data Type | float32 |
Activation Function | gelu_new |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
TisaleoGPT2Bot1 | 0K / 0.5 GB | 83 | 0 |
Gpt2 Therapist Finetuned | 0K / 0.5 GB | 1010 | 0 |
TisaleoRuta | 0K / 0.5 GB | 22 | 0 |
AI Guru | 0K / 0.5 GB | 184 | 0 |
Gpt2 Small III | 0K / 0.5 GB | 208 | 2 |
Pop Lyrics Generator V1 | 0K / 0.5 GB | 250 | 7 |
NeuraMed | 0K / 0.5 GB | 190 | 0 |
NeoMed | 0K / 0.5 GB | 79 | 0 |
...2 Kgw K1 Delta2.0 LogitDistill | 0K / 0.5 GB | 51 | 0 |
D2nwg Causal Gpt2 V1 | 0K / 0.2 GB | 154 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐