Sarashina1 65B by sbintuitions

 ยป  All LLMs  ยป  sbintuitions  ยป  Sarashina1 65B   URL Share it on

  Autotrain compatible   Endpoints compatible   Gpt neox   Ja   Pytorch   Region:us   Sharded

Sarashina1 65B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Sarashina1 65B (sbintuitions/sarashina1-65b)

Sarashina1 65B Parameters and Internals

Model Type 
Language Model
Additional Notes 
Sarashina1 has not been tuned to follow an instruction yet and might generate some meaningless sequences, some inaccurate instances or biased/objectionable outputs.
Supported Languages 
Japanese (native)
Training Details 
Data Sources:
Common Crawl corpus
Data Volume:
550B tokens
Model Architecture:
GPTNeoX
LLM NameSarashina1 65B
Repository ๐Ÿค—https://huggingface.co/sbintuitions/sarashina1-65b 
Model Size65b
Required VRAM131 GB
Updated2025-02-22
Maintainersbintuitions
Model Typegpt_neox
Model Files  10.0 GB: 1-of-14   9.7 GB: 2-of-14   9.7 GB: 3-of-14   9.7 GB: 4-of-14   9.7 GB: 5-of-14   9.7 GB: 6-of-14   9.7 GB: 7-of-14   9.7 GB: 8-of-14   9.7 GB: 9-of-14   9.7 GB: 10-of-14   9.7 GB: 11-of-14   9.7 GB: 12-of-14   9.7 GB: 13-of-14   4.6 GB: 14-of-14
Supported Languagesja
Model ArchitectureGPTNeoXForCausalLM
Licensemit
Context Length2048
Model Max Length2048
Transformers Version4.30.2
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<pad>
Vocabulary Size51200
Torch Data Typefloat16

Rank the Sarashina1 65B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227