LLM Name | GPT Neo 125M |
Repository ๐ค | https://huggingface.co/anezatra/gpt-neo-125M |
Model Size | 125m |
Required VRAM | 0 GB |
Updated | 2024-07-04 |
Maintainer | anezatra |
Model Type | gpt_neo |
Model Files | |
Model Architecture | GPTNeoForCausalLM |
Context Length | 2048 |
Model Max Length | 2048 |
Transformers Version | 4.41.1 |
Tokenizer Class | GPT2Tokenizer |
Padding Token | <|endoftext|> |
Vocabulary Size | 50257 |
Torch Data Type | float32 |
Activation Function | gelu_new |
Errors | replace |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Aitextgen | 2K / 0.5 GB | 161 | 0 |
GPT Neo 125M Sft | 2K / 0 GB | 106 | 0 |
GPT Neo Plantuml | 2K / 0.5 GB | 136 | 0 |
GPT Neo 125M Lama | 2K / 0.5 GB | 115 | 0 |
GPT Neo Small | 2K / 0 GB | 164 | 0 |
GPT Neo Plantuml Sol1 | 2K / 0.5 GB | 108 | 0 |
GPT Neo 125M Code Alpaca | 2K / 0 GB | 113 | 0 |
Model | 2K / 0.2 GB | 108 | 0 |
Neox 125m Storytelling | 2K / 0.5 GB | 8 | 0 |
Epfl Cs 522 Istari Mcqa | 2K / 0.5 GB | 9 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐