Pythia 1B Deduped Tldr Sft by trl-lib

 ยป  All LLMs  ยป  trl-lib  ยป  Pythia 1B Deduped Tldr Sft   URL Share it on

  Gpt neox   Pytorch   Region:us   Safetensors

Pythia 1B Deduped Tldr Sft Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Pythia 1B Deduped Tldr Sft (trl-lib/pythia-1b-deduped-tldr-sft)

Pythia 1B Deduped Tldr Sft Parameters and Internals

LLM NamePythia 1B Deduped Tldr Sft
Repository ๐Ÿค—https://huggingface.co/trl-lib/pythia-1b-deduped-tldr-sft 
Model Size1b
Required VRAM2 GB
Updated2025-04-19
Maintainertrl-lib
Model Typegpt_neox
Model Files  2.0 GB   4.0 GB
Model ArchitectureGPTNeoXForCausalLM
Context Length2048
Model Max Length2048
Transformers Version4.42.3
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50304
Torch Data Typebfloat16

Best Alternatives to Pythia 1B Deduped Tldr Sft

Best Alternatives
Context / RAM
Downloads
Likes
Pythia 2.8B Deduped Rp 710M 4K4K / 11.7 GB51
Pythia 1.4B Deduped Rp 420M 4K4K / 6.1 GB51
Pythia 1.4B Deduped Rp 280M 4K4K / 6.1 GB51
...eduped Tldr Preference Sft Trl2K / 2 GB140
Pythia 1B Kto Iter02K / 2 GB60
Pythia 1B Self Kto Iter02K / 2 GB60
...rAI Pythia 1B Deduped Sft Tldr2K / 4 GB23360
Rloo Trial22K / 2 GB80
Rloo Tldr2K / 2 GB50
TinyLM 1B Test22K / 2.7 GB60
Note: green Score (e.g. "73.2") means that the model is better than trl-lib/pythia-1b-deduped-tldr-sft.

Rank the Pythia 1B Deduped Tldr Sft Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 46445 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227