Additional Notes |
| |||
Training Details |
|
LLM Name | Gpt2 Large Lora Sft2 |
Repository ๐ค | https://huggingface.co/Mikivis/gpt2-large-lora-sft2 |
Model Size | 774m |
Required VRAM | 1.6 GB |
Updated | 2025-01-17 |
Maintainer | Mikivis |
Model Type | gpt2 |
Model Files | |
Model Architecture | GPT2LMHeadModel |
License | apache-2.0 |
Model Max Length | 1024 |
Transformers Version | 4.32.1 |
Tokenizer Class | GPT2Tokenizer |
Vocabulary Size | 50257 |
LoRA Model | Yes |
Torch Data Type | float16 |
Activation Function | gelu_new |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...T2 774M CINDER SHOW MULTI CHAT | 0K / 0 GB | 238 | 2 |
Alpaca Refine Gpt2 E1 Se0 | 0K / 3.1 GB | 695 | 0 |
Alpaca Tuned Gpt2 | 0K / 3.1 GB | 699 | 0 |
Alpaca Spin Gpt2 E1 Se0 | 0K / 3.1 GB | 700 | 0 |
Alpaca Refine Tuned Gpt2 Large | 0K / 3.1 GB | 688 | 0 |
Alpaca Spin Tuned Gpt2 Large | 0K / 3.1 GB | 695 | 0 |
Alpaca Refine Gpt2 E0 Se1 | 0K / 3.1 GB | 686 | 0 |
Alpaca Spin Gpt2 E0 Se1 | 0K / 3.1 GB | 684 | 0 |
Turkish Gpt2 Large | 0K / 3.1 GB | 5039 | 37 |
Gpt2 Large Lora Sft1 | 0K / 1.6 GB | 731 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐