Pythia 1.4B Helpful Sft By lomahony: Benchmarks, Features and Detailed Analysis. Insights on Pythia 1.4B Helpful Sft.

Arxiv:2101.00027 Autotrain compatible Dataset:anthropic/hh-rlhf En Endpoints compatible Gpt neox Pythia Pytorch Region:us

Model Card on HF 🤗: https://huggingface.co/lomahony/pythia-1.4b-helpful-sft

Pythia 1.4B Helpful Sft Benchmarks

LLME Score: 0.14224

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Pythia 1.4B Helpful Sft (lomahony/pythia-1.4b-helpful-sft)

Pythia 1.4B Helpful Sft Parameters and Internals

Model Type

causal-lm

Additional Notes

Fully reproducible finetuning code is available on GitHub.

Training Details

Data Sources:

Anthropic/hh-rlhf

Methodology:

Supervised fine-tuning using TRLx library

Training Time:

1 epoch

LLM Name	Pythia 1.4B Helpful Sft
Repository 🤗	https://huggingface.co/lomahony/pythia-1.4b-helpful-sft
Model Size	1.4b
Required VRAM	2.8 GB
Updated	2025-02-22
Maintainer	lomahony
Model Type	gpt_neox
Model Files	2.8 GB
Supported Languages	en
Model Architecture	GPTNeoXForCausalLM
License	apache-2.0
Context Length	2048
Model Max Length	2048
Transformers Version	4.34.1
Tokenizer Class	GPTNeoXTokenizer
Padding Token	<\|padding\|>
Vocabulary Size	50304
Torch Data Type	bfloat16

Best Alternatives to Pythia 1.4B Helpful Sft

Best Alternatives	Context / RAM	Downloads	Likes
Pythia 1.4B Deduped 8K Base	8K / 7.3 GB	4	1
Pythia 1.4B Deduped 4K Base	4K / 6.1 GB	162	2
Pythia 1.4B Sft Full	2K / 2.8 GB	86	1
RCC Ins Reconstruction	2K / 6.1 GB	184	1
Pythia Delphi Suboptimal	2K / 5.7 GB	6	0
Pythia 1.4B	2K / 2.9 GB	37537	23
Pythia 1.4B Deduped Sharegpt	2K / 2.8 GB	2004	2
Pythia 1.4B Deduped Sharegpt	2K / 2.8 GB	2060	0
Pythia 1.4b Sft Policy	2K / 2.9 GB	225	1
Pythia 1.4B Deduped	2K / 2.9 GB	13737	19

Note: green Score (e.g. "73.2") means that the model is better than lomahony/pythia-1.4b-helpful-sft.

Rank the Pythia 1.4B Helpful Sft Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Pythia 1.4B Helpful Sft by lomahony

» All LLMs » lomahony » Pythia 1.4B Helpful Sft URL Share it on

Pythia 1.4B Helpful Sft Benchmarks

Pythia 1.4B Helpful Sft Parameters and Internals

Best Alternatives to Pythia 1.4B Helpful Sft

Rank the Pythia 1.4B Helpful Sft Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.