MixtureofMerges MoE 4x7b V5 by jsfs11

 ยป  All LLMs  ยป  jsfs11  ยป  MixtureofMerges MoE 4x7b V5   URL Share it on

  Autotrain compatible Base model:eren23/dpo-binarize... Base model:kukedlc/neural4gsm8... Base model:merge:eren23/dpo-bi... Base model:merge:kukedlc/neura... Base model:merge:mlabonne/alph... Base model:merge:paulml/omnibe... Base model:mlabonne/alphamonar... Base model:paulml/omnibeaglesq...   Endpoints compatible Eren23/dpo-binarized-neutrixom...   Frankenmoe   Kukedlc/neural4gsm8k   Lazymergekit   Merge   Mergekit   Mixtral   Mlabonne/alphamonarch-7b   Model-index   Moe Paulml/omnibeaglesquaredmbx-v3...   Region:us   Safetensors   Sharded   Tensorflow

MixtureofMerges MoE 4x7b V5 Benchmarks

MixtureofMerges MoE 4x7b V5 Parameters and Internals

LLM NameMixtureofMerges MoE 4x7b V5
Repository ๐Ÿค—https://huggingface.co/jsfs11/MixtureofMerges-MoE-4x7b-v5 
Base Model(s)  OmniBeagleSquaredMBX V3 7B V2   AlphaMonarch 7B   Neural4gsm8k   DPO Binarized NeutrixOmnibe 7B   paulml/OmniBeagleSquaredMBX-v3-7B-v2   mlabonne/AlphaMonarch-7B   Kukedlc/Neural4gsm8k   eren23/dpo-binarized-NeutrixOmnibe-7B
Model Size24.2b
Required VRAM48.3 GB
Updated2024-09-16
Maintainerjsfs11
Model Typemixtral
Model Files  9.9 GB: 1-of-5   10.0 GB: 2-of-5   10.0 GB: 3-of-5   10.0 GB: 4-of-5   8.4 GB: 5-of-5
Model ArchitectureMixtralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.38.1
Tokenizer ClassLlamaTokenizer
Padding Token<s>
Vocabulary Size32000
Torch Data Typebfloat16
MixtureofMerges MoE 4x7b V5 (jsfs11/MixtureofMerges-MoE-4x7b-v5)

Best Alternatives to MixtureofMerges MoE 4x7b V5

Best Alternatives
Context / RAM
Downloads
Likes
Dzakwan MoE 4x7b Beta32K / 48.4 GB42180
Beyonder 4x7B V332K / 48.3 GB432557
Mera Mix 4x7B32K / 48.3 GB483018
Calme 4x7B MoE V0.232K / 48.3 GB48842
Calme 4x7B MoE V0.132K / 48.3 GB42732
CognitiveFusion2 4x7B BF1632K / 48.3 GB48813
MixtureofMerges MoE 4x7b V432K / 48.3 GB42884
NeuralStar FusionWriter 4x7b32K / 48.3 GB215
LCARS AI 1x4 003 SuperAI32K / 48.5 GB552
Mixtral 4x7b Slerp32K / 96.8 GB11
Note: green Score (e.g. "73.2") means that the model is better than jsfs11/MixtureofMerges-MoE-4x7b-v5.

Rank the MixtureofMerges MoE 4x7b V5 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 35926 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803