Sarashina1 13B by sbintuitions

 ยป  All LLMs  ยป  sbintuitions  ยป  Sarashina1 13B   URL Share it on

  Autotrain compatible   Endpoints compatible   Gpt neox   Ja   Pytorch   Region:us   Sharded

Sarashina1 13B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Sarashina1 13B (sbintuitions/sarashina1-13b)

Sarashina1 13B Parameters and Internals

Model Type 
Language Model
Additional Notes 
Sarashina1 has not been tuned to follow an instruction yet and might generate some meaningless sequences, some inaccurate instances or biased/objectionable outputs.
Supported Languages 
Japanese (native)
Training Details 
Data Sources:
Common Crawl corpus
Data Volume:
550B tokens
Model Architecture:
GPTNeoX
LLM NameSarashina1 13B
Repository ๐Ÿค—https://huggingface.co/sbintuitions/sarashina1-13b 
Model Size13b
Required VRAM26.3 GB
Updated2025-02-22
Maintainersbintuitions
Model Typegpt_neox
Model Files  10.0 GB: 1-of-3   9.9 GB: 2-of-3   6.4 GB: 3-of-3
Supported Languagesja
Model ArchitectureGPTNeoXForCausalLM
Licensemit
Context Length2048
Model Max Length2048
Transformers Version4.30.2
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<pad>
Vocabulary Size51200
Torch Data Typefloat16

Best Alternatives to Sarashina1 13B

Best Alternatives
Context / RAM
Downloads
Likes
CodeFuse 13B4K / 54.6 GB3248
Polyglot Ko Kullm V2 Fix2K / 51.7 GB20100
Pythia 13B Deduped Green Devil2K / 23.9 GB218110
KORani V1 13B2K / 51.8 GB357
CodeFuse 13B GPTQ4K / 8.6 GB374

Rank the Sarashina1 13B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227