Pythia 160M Pg19 by DarwinAnim8or

 ยป  All LLMs  ยป  DarwinAnim8or  ยป  Pythia 160M Pg19   URL Share it on

  Autotrain compatible   Dataset:deepmind/pg19   En   Endpoints compatible   Gpt neox   Pytorch   Region:us

Pythia 160M Pg19 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Pythia 160M Pg19 (DarwinAnim8or/pythia-160m-pg19)

Pythia 160M Pg19 Parameters and Internals

Model Type 
text generation
Additional Notes 
This is an experiment to see if Pythia pretrained from scratch on pg19 could work. It follows the same settings as the regular Pythia-160M.
Supported Languages 
en (proficient)
Training Details 
Data Sources:
deepmind/pg19
Data Volume:
150,000,000 tokens
Methodology:
Pretrained from scratch
LLM NamePythia 160M Pg19
Repository ๐Ÿค—https://huggingface.co/DarwinAnim8or/pythia-160m-pg19 
Model Size160m
Required VRAM1.9 GB
Updated2025-02-22
MaintainerDarwinAnim8or
Model Typegpt_neox
Model Files  1.9 GB
Supported Languagesen
Model ArchitectureAutoModelForCausalLM
Context Length2048
Model Max Length2048
Transformers Version4.24.0
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50304
Torch Data Typefloat16

Best Alternatives to Pythia 160M Pg19

Best Alternatives
Context / RAM
Downloads
Likes
... 160M Text Simplification Ptbr0K / 0 GB1142

Rank the Pythia 160M Pg19 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227