Training Details |
|
LLM Name | Gpt2 Large Lora Sft |
Repository ๐ค | https://huggingface.co/Mikivis/gpt2-large-lora-sft |
Base Model(s) | |
Model Size | 774m |
Required VRAM | 1.6 GB |
Updated | 2025-01-20 |
Maintainer | Mikivis |
Model Type | gpt2 |
Model Files | |
Model Architecture | GPT2LMHeadModel |
License | mit |
Model Max Length | 1024 |
Transformers Version | 4.32.1 |
Tokenizer Class | GPT2Tokenizer |
Vocabulary Size | 50257 |
LoRA Model | Yes |
Torch Data Type | float16 |
Activation Function | gelu_new |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...T2 774M CINDER SHOW MULTI CHAT | 0K / 0 GB | 205 | 2 |
Alpaca Refine Gpt2 E1 Se0 | 0K / 3.1 GB | 677 | 0 |
Alpaca Tuned Gpt2 | 0K / 3.1 GB | 692 | 0 |
Alpaca Refine Tuned Gpt2 Large | 0K / 3.1 GB | 686 | 0 |
Alpaca Spin Gpt2 E1 Se0 | 0K / 3.1 GB | 681 | 0 |
Alpaca Refine Gpt2 E0 Se1 | 0K / 3.1 GB | 685 | 0 |
Alpaca Spin Tuned Gpt2 Large | 0K / 3.1 GB | 683 | 0 |
Alpaca Spin Gpt2 E0 Se1 | 0K / 3.1 GB | 679 | 0 |
Turkish Gpt2 Large | 0K / 3.1 GB | 5074 | 37 |
Gpt2 Large Lora Sft1 | 0K / 1.6 GB | 722 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐