Model Type |
| |||||||||||||||
Use Cases |
| |||||||||||||||
Additional Notes |
| |||||||||||||||
Supported Languages |
| |||||||||||||||
Training Details |
| |||||||||||||||
Input Output |
|
LLM Name | Pythia 160M C2s |
Repository ๐ค | https://huggingface.co/vandijklab/pythia-160m-c2s |
Model Size | 160m |
Required VRAM | 0.6 GB |
Updated | 2025-06-01 |
Maintainer | vandijklab |
Model Type | gpt_neox |
Model Files | |
Supported Languages | en |
Model Architecture | GPTNeoXForCausalLM |
License | cc-by-nc-nd-4.0 |
Context Length | 9200 |
Model Max Length | 9200 |
Transformers Version | 4.37.1 |
Tokenizer Class | GPTNeoXTokenizer |
Vocabulary Size | 50304 |
Torch Data Type | float32 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Pythia 160M Xsum Roya | 2K / 0.6 GB | 18 | 0 |
Pythia 160M | 2K / 0.4 GB | 143715 | 32 |
Pythia 160m Sft | 2K / 0 GB | 16 | 0 |
Sheared Pythia 160M | 2K / 0.7 GB | 12 | 4 |
Pythia 160M Dolphin Extended | 2K / 0.3 GB | 31 | 0 |
Pythia 160M Storytelling | 2K / 0.3 GB | 23 | 0 |
Pythia160m Sft Tldr | 2K / 0.6 GB | 19 | 0 |
Pythia 160M Deduped | 2K / 0.4 GB | 42451 | 3 |
Pythia 160m Ft CookingRecipes | 2K / 0.6 GB | 11 | 0 |
Ppo | 2K / 0.3 GB | 12 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐