Pythia 160M Helpful Sft By lomahony: Benchmarks, Features and Detailed Analysis. Insights on Pythia 160M Helpful Sft.

Arxiv:2101.00027 Autotrain compatible Dataset:anthropic/hh-rlhf En Endpoints compatible Gpt neox Pythia Pytorch Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/lomahony/pythia-160m-helpful-sft

Pythia 160M Helpful Sft Benchmarks

LLME Score: 0.13027

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Pythia 160M Helpful Sft (lomahony/pythia-160m-helpful-sft)

Pythia 160M Helpful Sft Parameters and Internals

Model Type

causal-lm

Additional Notes

Model was finetuned with the helpful subset of the Anthropic-hh-rlhf dataset. Checkpoints and fully reproducible code available on GitHub.

Training Details

Data Sources:

Anthropic/hh-rlhf

Methodology:

Supervised finetuning using TRLx library with the helpful subset of Anthropic-hh-rlhf dataset for 1 epoch.

LLM Name	Pythia 160M Helpful Sft
Repository 🤗	https://huggingface.co/lomahony/pythia-160m-helpful-sft
Model Size	160m
Required VRAM	0.3 GB
Updated	2025-06-01
Maintainer	lomahony
Model Type	gpt_neox
Model Files	0.3 GB 0.3 GB
Supported Languages	en
Model Architecture	GPTNeoXForCausalLM
License	apache-2.0
Context Length	2048
Model Max Length	2048
Transformers Version	4.34.1
Tokenizer Class	GPTNeoXTokenizer
Padding Token	<\|padding\|>
Vocabulary Size	50304
Torch Data Type	bfloat16

Best Alternatives to Pythia 160M Helpful Sft

Best Alternatives	Context / RAM	Downloads	Likes
Pythia 160M C2s	8K / 0.6 GB	49	6
Pythia 160M Xsum Roya	2K / 0.6 GB	18	0
Pythia 160M	2K / 0.4 GB	143715	32
Pythia 160m Sft	2K / 0 GB	16	0
Sheared Pythia 160M	2K / 0.7 GB	12	4
Pythia 160M Dolphin Extended	2K / 0.3 GB	31	0
Pythia 160M Storytelling	2K / 0.3 GB	23	0
Pythia 160M Deduped	2K / 0.4 GB	42451	3
Pythia160m Sft Tldr	2K / 0.6 GB	19	0
Pythia 160m Ft CookingRecipes	2K / 0.6 GB	11	0

Note: green Score (e.g. "73.2") means that the model is better than lomahony/pythia-160m-helpful-sft.

Rank the Pythia 160M Helpful Sft Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47770 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Pythia 160M Helpful Sft by lomahony

» All LLMs » lomahony » Pythia 160M Helpful Sft URL Share it on

Pythia 160M Helpful Sft Benchmarks

Pythia 160M Helpful Sft Parameters and Internals

Best Alternatives to Pythia 160M Helpful Sft

Rank the Pythia 160M Helpful Sft Capabilities

What open-source LLMs or SLMs are you in search of? 47770 in total.