LLM Name | Gpt2 Scratch |
Repository ๐ค | https://huggingface.co/Harshatheeswar/gpt2-scratch |
Base Model(s) | |
Model Size | 124.4m |
Required VRAM | 0.5 GB |
Updated | 2024-11-09 |
Maintainer | Harshatheeswar |
Model Type | gpt2 |
Model Files | |
Model Architecture | GPT2LMHeadModel |
Model Max Length | 1024 |
Transformers Version | 4.44.2 |
Tokenizer Class | GPT2Tokenizer |
Padding Token | <|endoftext|> |
Vocabulary Size | 50257 |
Torch Data Type | float32 |
Activation Function | gelu_new |
Errors | replace |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
OCRonos Vintage | 0K / 0.2 GB | 1313 | 66 |
Originos Icn Savant | 0K / 0.5 GB | 326 | 1 |
D2nwg Causal Gpt2 | 0K / 0.2 GB | 36 | 0 |
Quble Test Model V1 Pretrain | 0K / 0.5 GB | 503 | 2 |
D2nwg Causal Gpt2 V1 | 0K / 0.2 GB | 33 | 0 |
Causal Gpt2 | 0K / 0.2 GB | 35 | 0 |
Gpt2 NoLN | 0K / 0.5 GB | 509 | 2 |
Awareness Test | 0K / 0.5 GB | 17 | 1 |
Math Gpt2 Sft | 0K / 0.5 GB | 708 | 2 |
Chat Gpt2 DPO | 0K / 0.5 GB | 713 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐