Japanese Large Lm 3.6B By line-corporation: Benchmarks, Features and Detailed Analysis. Insights on Japanese Large Lm 3.6B.

Autotrain compatible Dataset:cc100 Dataset:mc4 Dataset:oscar Dataset:wikipedia Endpoints compatible Gpt neox Ja Pytorch Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/line-corporation/japanese-large-lm-3.6b

Japanese Large Lm 3.6B Benchmarks

LLME Score: 0.13222

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Japanese Large Lm 3.6B (line-corporation/japanese-large-lm-3.6b)

Japanese Large Lm 3.6B Parameters and Internals

Model Type

text generation, Japanese language model

Use Cases

Areas:

Research, Commercial applications

Applications:

Text generation, Japanese language applications

Primary Use Cases:

Japanese text generation, Japanese language applications

Additional Notes

Uses a sentencepiece tokenizer with a unigram language model and byte-fallback. Does not apply pre-tokenization.

Supported Languages

ja (high proficiency)

Training Details

Data Sources:

wikipedia, mc4, cc100, oscar, Web texts crawled by in-house system

Data Volume:

650 GB

Methodology:

Standard language model training

Model Architecture:

GPTNeoX with RoPE for position encoding

Input Output

Input Format:

Raw Japanese sentences

Accepted Modalities:

text

Output Format:

Generated text in Japanese

Performance Tips:

Use GPU for faster inference and setting the seed for reproducibility

LLM Name	Japanese Large Lm 3.6B
Repository 🤗	https://huggingface.co/line-corporation/japanese-large-lm-3.6b
Model Size	3.6b
Required VRAM	7.2 GB
Updated	2025-02-22
Maintainer	line-corporation
Model Type	gpt_neox
Model Files	7.2 GB 7.2 GB
Supported Languages	ja
Model Architecture	GPTNeoXForCausalLM
License	apache-2.0
Context Length	2048
Model Max Length	2048
Transformers Version	4.29.2
Tokenizer Class	T5Tokenizer
Padding Token	<pad>
Vocabulary Size	51200
Torch Data Type	float16

Best Alternatives to Japanese Large Lm 3.6B

Best Alternatives	Context / RAM	Downloads	Likes
Japanese GPT Neox 3.6B	2K / 7.4 GB	3805	98
...y Jimba Japanese Large Lm 3.6B	2K / 7.1 GB	64	0
...rrowSmartPlus 3.6B Instruction	2K / 14.3 GB	5	1
...rrowSmartPlus 3.6B Instant Sft	2K / 14.3 GB	7	1
...rtPlus 3.6B Instant Sft JHSVer	2K / 14.3 GB	9	1
...T Neox 3.6B Instruction Sft V2	2K / 7.4 GB	54638	26
... Large Lm 3.6B Instruction Sft	2K / 7.2 GB	890	25
... GPT Neox 3.6B Instruction Ppo	2K / 7.4 GB	2587	70
... GPT Neox 3.6B Instruction Sft	2K / 7.4 GB	900	101
...tion Sft 8bit 1g Actorder True	2K / 2.8 GB	84	3

Note: green Score (e.g. "73.2") means that the model is better than line-corporation/japanese-large-lm-3.6b.

Rank the Japanese Large Lm 3.6B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43508 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Japanese Large Lm 3.6B by line-corporation

» All LLMs » line-corporation » Japanese Large Lm 3.6B URL Share it on

Japanese Large Lm 3.6B Benchmarks

Japanese Large Lm 3.6B Parameters and Internals

Best Alternatives to Japanese Large Lm 3.6B

Rank the Japanese Large Lm 3.6B Capabilities

What open-source LLMs or SLMs are you in search of? 43508 in total.