NaruMOE 3x7B V2 by Alsebay

 ยป  All LLMs  ยป  Alsebay  ยป  NaruMOE 3x7B V2   URL Share it on

  Autotrain compatible Base model:alsebay/narumashirt... Base model:merge:alsebay/narum... Base model:merge:nitral-ai/kuk... Base model:merge:sanjiwatsuki/... Base model:nitral-ai/kukulstan... Base model:sanjiwatsuki/kunoic...   Endpoints compatible   Merge   Mixtral   Moe   Region:us   Roleplay   Safetensors   Sharded   Tensorflow

NaruMOE 3x7B V2 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

NaruMOE 3x7B V2 Parameters and Internals

LLM NameNaruMOE 3x7B V2
Repository ๐Ÿค—https://huggingface.co/Alsebay/NaruMOE-3x7B-v2 
Base Model(s)  NarumashiRTS V2   Kunoichi DPO V2 7B   KukulStanta 7B   Alsebay/NarumashiRTS-V2   SanjiWatsuki/Kunoichi-DPO-v2-7B   Nitral-AI/KukulStanta-7B
Model Size18.5b
Required VRAM37.1 GB
Updated2024-09-07
MaintainerAlsebay
Model Typemixtral
Model Files  1.9 GB: 1-of-19   2.0 GB: 2-of-19   2.0 GB: 3-of-19   2.0 GB: 4-of-19   2.0 GB: 5-of-19   2.0 GB: 6-of-19   2.0 GB: 7-of-19   2.0 GB: 8-of-19   2.0 GB: 9-of-19   2.0 GB: 10-of-19   2.0 GB: 11-of-19   2.0 GB: 12-of-19   2.0 GB: 13-of-19   2.0 GB: 14-of-19   2.0 GB: 15-of-19   2.0 GB: 16-of-19   2.0 GB: 17-of-19   2.0 GB: 18-of-19   1.2 GB: 19-of-19
Model ArchitectureMixtralForCausalLM
Licensecc-by-nc-4.0
Context Length32768
Model Max Length32768
Transformers Version4.39.3
Tokenizer ClassLlamaTokenizer
Padding Token<s>
Vocabulary Size32000
Torch Data Typebfloat16
NaruMOE 3x7B V2 (Alsebay/NaruMOE-3x7B-v2)

Best Alternatives to NaruMOE 3x7B V2

Best Alternatives
Context / RAM
Downloads
Likes
Lumina 3.532K / 37.1 GB25590
Topxtral 4x7B V0.132K / 37.1 GB48944
EastAsia 4x7B MoE Experiment32K / 37.1 GB6781
Hyperion 3.0 Mixtral 3x7B32K / 37.1 GB584
Blitz AI MoE V0.732K / 37.1 GB581
Blitz AI MoE V0.432K / 37.1 GB591
HeroBophades 3x7B32K / 37.1 GB41
Wizard Kun Lake 3x7B MoE32K / 37.1 GB11
MoE 3x7b QA Code Inst32K / 37 GB794
Pearl 3x7B32K / 37.1 GB641
Note: green Score (e.g. "73.2") means that the model is better than Alsebay/NaruMOE-3x7B-v2.

Rank the NaruMOE 3x7B V2 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 35941 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803