KULLM RLHF By Trofish: Benchmarks, Features and Detailed Analysis. Insights on KULLM RLHF.

Arxiv:2303.16634 Autotrain compatible Endpoints compatible Gpt neox Pytorch Region:us

Model Card on HF 🤗: https://huggingface.co/Trofish/KULLM-RLHF

KULLM RLHF Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

KULLM RLHF Parameters and Internals

Model Type

Conversational AI, Chatbot

Use Cases

Areas:

Research, Chatbot Development

Applications:

Korean language conversational AI

Primary Use Cases:

Friendly and harmless everyday conversations in Korean

Additional Notes

The final model, implemented with RLHF and DeepSpeedChat, aims to produce high-quality conversational responses that are user-friendly and ethical.

Supported Languages

Korean (high)

Training Details

Data Sources:

Self-Instruct using GPT-4, RLHF with human feedback, DeepSpeed optimization

Methodology:

Reinforcement Learning from Human Feedback, Self-Instruct Data Augmentation, DeepSpeed for large-scale distributed deep learning

Hardware Used:

Google Colab A100 40GB GPU

Model Architecture:

Used KULLM as baseline model trained with RLHF and SFT (Supervised Fine-tuning) techniques

LLM Name	KULLM RLHF
Repository 🤗	https://huggingface.co/Trofish/KULLM-RLHF
Required VRAM	25.8 GB
Updated	2025-02-22
Maintainer	Trofish
Model Type	gpt_neox
Model Files	25.8 GB
Model Architecture	GPTNeoXForCausalLM
Context Length	2048
Model Max Length	2048
Transformers Version	4.31.0
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<\|endoftext\|>
Vocabulary Size	30008
Torch Data Type	float16

Best Alternatives to KULLM RLHF

Best Alternatives	Context / RAM	Downloads	Likes
Catlm	8K / 7.8 GB	45	4
...Prover 14final Checkpoint 5830	4K / 14.9 GB	5	0
Neox Musenet Untrained	4K / 7.3 GB	6	0
Stabillm Instruct De	4K / 31.8 GB	5	0
Open Calm Large	2K / 1.8 GB	3495	10
MonoCoder OMP	2K / 3.6 GB	202	0
ProofGPT V0.1	2K / 2.9 GB	1974	3
GPT NeoX Pretrain News	2K / 0.3 GB	306	0
Step3 Mk7	2K / 25.8 GB	18	0
GPT NeoX Pretrain 1GB	2K / 0.3 GB	162	0

Note: green Score (e.g. "73.2") means that the model is better than Trofish/KULLM-RLHF.

Rank the KULLM RLHF Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

KULLM RLHF by Trofish

» All LLMs » Trofish » KULLM RLHF URL Share it on

KULLM RLHF Benchmarks

KULLM RLHF Parameters and Internals

Best Alternatives to KULLM RLHF

Rank the KULLM RLHF Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.