Kogpt J 350M by heegyu

 ยป  All LLMs  ยป  heegyu  ยป  Kogpt J 350M   URL Share it on

  Autotrain compatible Dataset:heegyu/korean-petition...   Dataset:heegyu/kowikitext Dataset:heegyu/namuwiki-extrac...   Endpoints compatible   Gptj   Jax   Ko   Pytorch   Region:us
Model Card on HF ๐Ÿค—: https://huggingface.co/heegyu/kogpt-j-350m 

Kogpt J 350M Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Kogpt J 350M (heegyu/kogpt-j-350m)

Kogpt J 350M Parameters and Internals

Model Type 
text-generation
Additional Notes 
This model's training data may include various discriminatory/offensive data, and no removal process has been conducted. Consequently, the model may generate sentences that contain discriminatory/offensive statements regarding certain individuals, races, genders, or disabilities.
Supported Languages 
ko (Fluent)
Training Details 
Data Sources:
AIHub SNS ๋Œ€ํ™”, AIHub ๊ตฌ์–ด์ฒด, AIHub ๋„์„œ, AIHub ๋Œ€๊ทœ๋ชจ ์›น๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ํ•œ๊ตญ์–ด ๋ง๋ญ‰์น˜, ํ•œ๊ตญ์–ด ์œ„ํ‚ค, ๋‚˜๋ฌด์œ„ํ‚ค, ๊ตญ๋ฆฝ๊ตญ์–ด์› ๋ฉ”์‹ ์ € ๋Œ€ํ™”, ๊ตญ๋ฆฝ๊ตญ์–ด์› ์ผ์ƒ๋Œ€ํ™” ๋ง๋ญ‰์น˜, ๊ตญ๋ฆฝ๊ตญ์–ด์› ๋ฌธ์–ด ๋ง๋ญ‰์น˜, ๊ตญ๋ฆฝ๊ตญ์–ด์› ๊ตฌ์–ด ๋ง๋ญ‰์น˜, ๊ตญ๋ฆฝ๊ตญ์–ด์› ์‹ ๋ฌธ ๋ง๋ญ‰์น˜, ์ฒญ์™€๋Œ€ ๊ตญ๋ฏผ์ฒญ์›
Data Volume:
Approx. 7B tokens
Context Length:
1024
Training Time:
2023/1/25 ~ 2023/1/29
Hardware Used:
TPU V2-8
Model Architecture:
20 Layers, 1024 hidden dim, 4096 intermediate, 16 heads, 51200 vocab size
LLM NameKogpt J 350M
Repository ๐Ÿค—https://huggingface.co/heegyu/kogpt-j-350m 
Model Size350m
Required VRAM1.4 GB
Updated2025-06-02
Maintainerheegyu
Model Typegptj
Model Files  1.4 GB
Supported Languagesko
Model ArchitectureGPTJForCausalLM
Licensemit
Transformers Version4.25.1
Tokenizer ClassGPT2Tokenizer
Beginning of Sentence Token<|endoftext|>
End of Sentence Token<|endoftext|>
Unk Token<|endoftext|>
Vocabulary Size51200
Torch Data Typefloat32
Activation Functiongelu_new
Errorsreplace

Best Alternatives to Kogpt J 350M

Best Alternatives
Context / RAM
Downloads
Likes
Codegen 350M List Manip 5 Len0K / 1.5 GB171
Codegen 350M Multi Gptj0K / 0.8 GB274
Codegen 350M Mono Gptj0K / 0.8 GB122

Rank the Kogpt J 350M Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 47771 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227