Bilingual GPT Neox 4B 8K By rinna: Benchmarks, Features and Detailed Analysis. Insights on Bilingual GPT Neox 4B 8K.

The model's standard configuration requires transformers version 4.31.0 or higher to operate correctly. Special attention to hyper-parameters is needed for optimal performance.

Supported Languages

English (proficient), Japanese (proficient)

Training Details

Data Sources:

Japanese CC-100, Japanese C4, The Pile, Redpajama, Wikipedia

Data Volume:

1.5 billion tokens

Methodology:

fine-tuning using RoPE positional interpolation

Context Length:

8192

Model Architecture:

A 36-layer, 2816-hidden-size transformer-based language model

Input Output

Performance Tips:

Since the model is sensitive to decoding hyper-parameters (e.g., temperature, top_p, top_k, repetition_penalty), it is suggested to explore the best setting for your task.

LLM Name	Bilingual GPT Neox 4B 8K
Repository 🤗	https://huggingface.co/rinna/bilingual-gpt-neox-4b-8k
Base Model(s)	Bilingual GPT Neox 4B rinna/bilingual-gpt-neox-4b
Model Size	4b
Required VRAM	7.7 GB
Updated	2025-02-22
Maintainer	rinna
Model Type	gpt_neox
Model Files	7.7 GB 7.7 GB
Supported Languages	ja en
Context Length	8k
Model Architecture	GPTNeoXForCausalLM
License	mit
Context Length	2048
Model Max Length	2048
Tokenizer Class	T5Tokenizer
Padding Token	[PAD]
Vocabulary Size	65536
Torch Data Type	float16

Best Alternatives to Bilingual GPT Neox 4B 8K

Best Alternatives	Context / RAM	Downloads	Likes
Bilingual GPT Neox 4B	2K / 7.7 GB	5108	29
...al GPT Neox 4B Instruction Ppo	2K / 7.7 GB	834	15
Sft Tldr Pythia 1 4b	2K / 5.7 GB	5	0
Tora 4B	2K / 7.6 GB	76	2
...x 4B Instruction Sft En Ja 84K	2K / 7.6 GB	89	1
...al GPT Neox 4B Instruction Sft	2K / 7.6 GB	365	18
StellarX 4B V0.2	2K / 16 GB	2154	2
StellarX 4B V0	2K / 8.1 GB	2118	1
StellarX 4B V0.2 GPTQ	2K / 1.8 GB	10	1

Note: green Score (e.g. "73.2") means that the model is better than rinna/bilingual-gpt-neox-4b-8k.

Rank the Bilingual GPT Neox 4B 8K Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Bilingual GPT Neox 4B 8K by rinna

» All LLMs » rinna » Bilingual GPT Neox 4B 8K URL Share it on

Bilingual GPT Neox 4B 8K Benchmarks

Bilingual GPT Neox 4B 8K Parameters and Internals

Best Alternatives to Bilingual GPT Neox 4B 8K

Rank the Bilingual GPT Neox 4B 8K Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.