Pythia 1.4B by EleutherAI

 ยป  All LLMs  ยป  EleutherAI  ยป  Pythia 1.4B   URL Share it on

  Arxiv:2101.00027   Arxiv:2201.07311   Arxiv:2304.01373   Autotrain compatible   Dataset:eleutherai/the pile   En   Endpoints compatible   Gpt neox   Pythia   Pytorch   Region:us   Safetensors
Model Card on HF ๐Ÿค—: https://huggingface.co/EleutherAI/pythia-1.4b 

Pythia 1.4B Benchmarks

Pythia 1.4B (EleutherAI/pythia-1.4b)

Pythia 1.4B Parameters and Internals

Model Type 
Transformer-based Language Model
Use Cases 
Areas:
Research
Applications:
Interpretability research
Primary Use Cases:
Scientific experiments on large language models' behavior and limitations
Limitations:
Not intended for deployment or human-facing interactions
Considerations:
Evaluate the risks associated with your use case. The model may generate harmful or offensive text.
Additional Notes 
The model is intended for research and interpretability purposes and includes checkpoints for experimentation.
Supported Languages 
English (Primary)
Training Details 
Data Sources:
EleutherAI/the_pile
Data Volume:
825GiB
Methodology:
Training on Pile dataset with and without deduplication
Model Architecture:
GPT-NeoX
Responsible Ai Considerations 
Fairness:
See Section 6 of the Pile paper for a discussion of documented biases.
Mitigation Strategies:
Conduct your own risk and bias assessment when using this model.
Input Output 
Input Format:
Text inputs
Accepted Modalities:
Text
Output Format:
Token predictions
LLM NamePythia 1.4B
Repository ๐Ÿค—https://huggingface.co/EleutherAI/pythia-1.4b 
Model Size1.4b
Required VRAM2.9 GB
Updated2025-02-22
MaintainerEleutherAI
Model Typegpt_neox
Model Files  2.9 GB   2.9 GB
Supported Languagesen
Model ArchitectureGPTNeoXForCausalLM
Licenseapache-2.0
Context Length2048
Model Max Length2048
Transformers Version4.24.0
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50304
Torch Data Typefloat16

Best Alternatives to Pythia 1.4B

Best Alternatives
Context / RAM
Downloads
Likes
Pythia 1.4B Deduped 8K Base8K / 7.3 GB41
Pythia 1.4B Deduped 4K Base4K / 6.1 GB1622
Pythia 1.4B Sft Full2K / 2.8 GB861
RCC Ins Reconstruction2K / 6.1 GB1841
Pythia Delphi Suboptimal2K / 5.7 GB60
Pythia 1.4B Deduped Sharegpt2K / 2.8 GB20042
Pythia 1.4B Deduped Sharegpt2K / 2.8 GB20600
Pythia 1.4b Sft Policy2K / 2.9 GB2251
Pythia 1.4B Deduped2K / 2.9 GB1373719
ShortKing 1.4B V0.12K / 2.8 GB21702
Note: green Score (e.g. "73.2") means that the model is better than EleutherAI/pythia-1.4b.

Rank the Pythia 1.4B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227