Korean GPT Neox 125M by cateto

 ยป  All LLMs  ยป  cateto  ยป  Korean GPT Neox 125M   URL Share it on

  Autotrain compatible   Endpoints compatible   Gpt neox   Ko   Pytorch   Region:us   Safetensors

Korean GPT Neox 125M Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Korean GPT Neox 125M (cateto/korean-gpt-neox-125M)

Korean GPT Neox 125M Parameters and Internals

Model Type 
text generation
Supported Languages 
Korean (fluent)
Input Output 
Accepted Modalities:
text
Output Format:
text
LLM NameKorean GPT Neox 125M
Repository ๐Ÿค—https://huggingface.co/cateto/korean-gpt-neox-125M 
Model Size125m
Required VRAM0.4 GB
Updated2025-02-22
Maintainercateto
Model Typegpt_neox
Model Files  0.4 GB   0.4 GB
Supported Languagesko
Model ArchitectureGPTNeoXForCausalLM
Licensecc-by-3.0
Context Length2048
Model Max Length2048
Transformers Version4.28.1
Vocabulary Size52096
Torch Data Typefloat16

Best Alternatives to Korean GPT Neox 125M

Best Alternatives
Context / RAM
Downloads
Likes
Pythia 125M Storywriter2K / 0.6 GB1220
... 125M Response Full Static Sft2K / 0.7 GB1521
Pythia 125M Static Sft2K / 0.7 GB71
Openchatgpt Neox 125M2K / 0.7 GB1544
Taco2K / 0.7 GB91
Note: green Score (e.g. "73.2") means that the model is better than cateto/korean-gpt-neox-125M.

Rank the Korean GPT Neox 125M Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227