EastAsia 4x7B MoE Experiment by Heng666

 ยป  All LLMs  ยป  Heng666  ยป  EastAsia 4x7B MoE Experiment   URL Share it on

  Augmxnt/shisa-7b-v1   Autotrain compatible   Beomi/open-solar-ko-10.7b   Endpoints compatible   Instruct   Ja   Ko   Lazymergekit Mediatek-research/breeze-7b-in...   Merge   Mergekit   Mixtral   Model-index   Moe   Region:us   Safetensors   Sharded   Tensorflow   Tw   Zh

EastAsia 4x7B MoE Experiment Benchmarks

EastAsia 4x7B MoE Experiment (Heng666/EastAsia-4x7B-Moe-experiment)

EastAsia 4x7B MoE Experiment Parameters and Internals

Model Type 
Mixture of Experts, text generation
Additional Notes 
EastAsia-4x7B-Moe-experiment is a Mixture of Experts model created using LazyMergekit and combines several base models.
Supported Languages 
zh (unknown proficiency), ja (unknown proficiency), ko (unknown proficiency), tw (unknown proficiency)
LLM NameEastAsia 4x7B MoE Experiment
Repository ๐Ÿค—https://huggingface.co/Heng666/EastAsia-4x7B-Moe-experiment 
Model Size18.5b
Required VRAM37.1 GB
Updated2025-03-14
MaintainerHeng666
Model Typemixtral
Instruction-BasedYes
Model Files  9.9 GB: 1-of-4   10.0 GB: 2-of-4   10.0 GB: 3-of-4   7.2 GB: 4-of-4
Supported Languageszh ja ko tw
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.2
Tokenizer ClassLlamaTokenizer
Padding Token<s>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to EastAsia 4x7B MoE Experiment

Best Alternatives
Context / RAM
Downloads
Likes
Rava 3x7B V0.132K / 37.1 GB171

Rank the EastAsia 4x7B MoE Experiment Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 45019 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227