LLM Name | Gpt2 Scratch |
Repository ๐ค | https://huggingface.co/Harshatheeswar/gpt2-scratch |
Base Model(s) | |
Model Size | 124.4m |
Required VRAM | 0.5 GB |
Updated | 2024-12-21 |
Maintainer | Harshatheeswar |
Model Type | gpt2 |
Model Files | |
Model Architecture | GPT2LMHeadModel |
Model Max Length | 1024 |
Transformers Version | 4.44.2 |
Tokenizer Class | GPT2Tokenizer |
Padding Token | <|endoftext|> |
Vocabulary Size | 50257 |
Torch Data Type | float32 |
Activation Function | gelu_new |
Errors | replace |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
PlayPart AI Personal Trainer | 0K / 0.5 GB | 301 | 0 |
OCRonos Vintage | 0K / 0.2 GB | 452 | 76 |
Testmod | 0K / 0.5 GB | 370 | 0 |
Originos Icn Savant | 0K / 0.5 GB | 367 | 1 |
DialoGPT Small Garycoleman | 0K / 0.5 GB | 365 | 1 |
D2nwg Causal Gpt2 V1 | 0K / 0.2 GB | 24 | 0 |
Quble Test Model V1 Pretrain | 0K / 0.5 GB | 381 | 2 |
D2nwg Causal Gpt2 | 0K / 0.2 GB | 21 | 0 |
DialoGPT Medium Loki | 0K / 0.5 GB | 520 | 0 |
Ftgpt | 0K / 0.2 GB | 22 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐