CodeLlama 3x7B Dialect Experts Random Gate Base by MrezaPRZ

 ยป  All LLMs  ยป  MrezaPRZ  ยป  CodeLlama 3x7B Dialect Experts Random Gate Base   URL Share it on

  Autotrain compatible   Codegen   Endpoints compatible   Mixtral   Moe   Region:us   Safetensors   Sharded   Tensorflow

CodeLlama 3x7B Dialect Experts Random Gate Base Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
CodeLlama 3x7B Dialect Experts Random Gate Base (MrezaPRZ/CodeLlama-3x7B-dialect-experts-random-gate-base)

CodeLlama 3x7B Dialect Experts Random Gate Base Parameters and Internals

LLM NameCodeLlama 3x7B Dialect Experts Random Gate Base
Repository ๐Ÿค—https://huggingface.co/MrezaPRZ/CodeLlama-3x7B-dialect-experts-random-gate-base 
Model Size15.4b
Required VRAM30.7 GB
Updated2025-02-22
MaintainerMrezaPRZ
Model Typemixtral
Model Files  9.9 GB: 1-of-4   10.0 GB: 2-of-4   9.9 GB: 3-of-4   0.9 GB: 4-of-4
Generates CodeYes
Model ArchitectureMixtralForCausalLM
Context Length16384
Model Max Length16384
Transformers Version4.41.2
Tokenizer ClassCodeLlamaTokenizer
Padding Token<s>
Vocabulary Size32016
Torch Data Typebfloat16

Best Alternatives to CodeLlama 3x7B Dialect Experts Random Gate Base

Best Alternatives
Context / RAM
Downloads
Likes
...eLlama 3x7B Dialect Experts It16K / 30.9 GB80
CodeLlama 3x7B Base16K / 30.7 GB90
...lama 3x7B Dialect Experts Base16K / 30.7 GB50

Rank the CodeLlama 3x7B Dialect Experts Random Gate Base Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227