MiniCPM MoE 8x2B by openbmb

 ยป  All LLMs  ยป  openbmb  ยป  MiniCPM MoE 8x2B   URL Share it on

  Autotrain compatible   Conversational   Custom code   Moe   Pytorch   Region:us

MiniCPM MoE 8x2B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
MiniCPM MoE 8x2B (openbmb/MiniCPM-MoE-8x2B)

MiniCPM MoE 8x2B Parameters and Internals

Model Type 
decoder-only, transformer-based, generative, Mixture-of-Experts (MoE)
Additional Notes 
- Instruction tuned but without other RLHF methods.\n- Model weights are in bfloat16 precision.\n- For more inference throughput, use vLLM (>=0.4.1) which is compatible with this model.
LLM NameMiniCPM MoE 8x2B
Repository ๐Ÿค—https://huggingface.co/openbmb/MiniCPM-MoE-8x2B 
Model Size2b
Required VRAM27.7 GB
Updated2024-12-21
Maintaineropenbmb
Model Files  27.7 GB
Model ArchitectureMiniCPMForCausalLM
Context Length4096
Model Max Length4096
Transformers Version4.36.0
Tokenizer ClassLlamaTokenizer
Vocabulary Size122753
Torch Data Typebfloat16

Best Alternatives to MiniCPM MoE 8x2B

Best Alternatives
Context / RAM
Downloads
Likes
MiniCPM 2B 128K64K / 6 GB66941
MiniCPM 2B Sft Fp324K / 10.9 GB4216295
MiniCPM 2B Sft Bf164K / 5.5 GB8362118
...iCPM 2B RAFT Lora Hotpotqa Dev4K / 5.5 GB90
MiniCPM Duplex4K / 5.5 GB132
MiniCPM 2B DPO Bf164K / 5.5 GB69347
...iniCPM 2B DPO Fp32 Safetensors4K / 10.9 GB111
...iniCPM 2B DPO Bf16 Safetensors4K / 5.5 GB71
...iniCPM 2B Sft Fp32 Safetensors4K / 10.9 GB71
...iniCPM 2B Sft Fp32 Safetensors4K / 10.9 GB61
Note: green Score (e.g. "73.2") means that the model is better than openbmb/MiniCPM-MoE-8x2B.

Rank the MiniCPM MoE 8x2B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40013 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217