Open Calm Small by cyberagent

 ยป  All LLMs  ยป  cyberagent  ยป  Open Calm Small   URL Share it on

  Autotrain compatible   Dataset:cc100   Dataset:mc4   Dataset:wikipedia   Gpt neox   Ja   Japanese   Pytorch   Region:us

Open Calm Small Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Open Calm Small (cyberagent/open-calm-small)

Open Calm Small Parameters and Internals

Model Type 
Transformer-based Language Model, Causal Language Model
Additional Notes 
OpenCALM is a suite of decoder-only language models pre-trained on Japanese datasets.
Supported Languages 
Japanese (Primary)
Training Details 
Data Sources:
Wikipedia (ja), Common Crawl (ja)
Methodology:
Fine-tuning
Model Architecture:
Decoder-only
Input Output 
Input Format:
Tokenized text input
Accepted Modalities:
text
Output Format:
Generated text output
LLM NameOpen Calm Small
Repository ๐Ÿค—https://huggingface.co/cyberagent/open-calm-small 
Required VRAM0.4 GB
Updated2025-02-22
Maintainercyberagent
Model Typegpt_neox
Model Files  0.4 GB
Supported Languagesja
Model ArchitectureGPTNeoXForCausalLM
Licensecc-by-sa-4.0
Context Length2048
Model Max Length2048
Transformers Version4.27.0.dev0
Tokenizer ClassGPTNeoXTokenizer
Padding Token<|padding|>
Vocabulary Size52096
Torch Data Typefloat16

Best Alternatives to Open Calm Small

Best Alternatives
Context / RAM
Downloads
Likes
Catlm8K / 7.8 GB454
...Prover 14final Checkpoint 58304K / 14.9 GB50
Neox Musenet Untrained4K / 7.3 GB60
Stabillm Instruct De4K / 31.8 GB50
Open Calm Large2K / 1.8 GB349510
MonoCoder OMP2K / 3.6 GB2020
KULLM RLHF2K / 25.8 GB20263
ProofGPT V0.12K / 2.9 GB19743
GPT NeoX Pretrain News2K / 0.3 GB3060
Step3 Mk72K / 25.8 GB180
Note: green Score (e.g. "73.2") means that the model is better than cyberagent/open-calm-small.

Rank the Open Calm Small Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227