Bahasa 4B by Bahasalab

 ยป  All LLMs  ยป  Bahasalab  ยป  Bahasa 4B   URL Share it on

  Autotrain compatible   Conversational   Endpoints compatible   Id   Qwen2   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/Bahasalab/Bahasa-4b 

Bahasa 4B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Bahasa 4B (Bahasalab/Bahasa-4b)

Bahasa 4B Parameters and Internals

Model Type 
NLP
Use Cases 
Areas:
NLP tasks, Indonesian language understanding
Applications:
question answering, sentiment analysis, document summarization
Supported Languages 
Indonesian (proficient)
Training Details 
Data Sources:
Indonesian dataset
Data Volume:
10 billion text
Methodology:
continued training
LLM NameBahasa 4B
Repository ๐Ÿค—https://huggingface.co/Bahasalab/Bahasa-4b 
Model Size4b
Required VRAM7.9 GB
Updated2025-02-22
MaintainerBahasalab
Model Typeqwen2
Model Files  5.0 GB: 1-of-2   2.9 GB: 2-of-2   0.0 GB
Supported Languagesid
Model ArchitectureQwen2ForCausalLM
Licenseother
Context Length32768
Model Max Length32768
Transformers Version4.39.1
Tokenizer ClassQwen2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size151936
Torch Data Typebfloat16
Errorsreplace

Best Alternatives to Bahasa 4B

Best Alternatives
Context / RAM
Downloads
Likes
Qwarkstar 4B Instruct Preview32K / 9 GB1622
Qwarkstar 4B32K / 9 GB1630
Qwen1.5 4B Chat32K / 7.9 GB4490338
Qwen1.5 4B32K / 7.9 GB802035
Qwarkstar 4B Instruct32K / 9 GB301
Sailor 4B32K / 7.9 GB2086
Nusantara 4B Indo Chat32K / 7.9 GB2021
Sailor 4B Chat32K / 7.9 GB1412
Reyna CoT 4B V0.132K / 7.9 GB1706
Qwen 4B Flock 171985500232K / 7.9 GB620
Note: green Score (e.g. "73.2") means that the model is better than Bahasalab/Bahasa-4b.

Rank the Bahasa 4B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227