Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 By dev137: Benchmarks, Features and Detailed Analysis. Insights on Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8.

Autotrain compatible Endpoints compatible Exl2 Mixtral Moe Quantized Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/dev137/cloudyu_Mixtral_34Bx2_MoE_60B-exl2-2.0bpw-h8

Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 (dev137/cloudyu_Mixtral_34Bx2_MoE_60B-exl2-2.0bpw-h8)

Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 Parameters and Internals

Model Type

MoE, causal language model

Use Cases

Areas:

research, non-commercial applications

Applications:

text generation, multilingual applications

Primary Use Cases:

creative writing, storytelling, education

Limitations:

commercial applications

Supported Languages

English (high), Chinese (high)

Input Output

Input Format:

text prompt

Accepted Modalities:

text

Output Format:

generated text

Performance Tips:

For better performance, use a GPU with CUDA support.

LLM Name	Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8
Repository 🤗	https://huggingface.co/dev137/cloudyu_Mixtral_34Bx2_MoE_60B-exl2-2.0bpw-h8
Model Size	60b
Required VRAM	18.1 GB
Updated	2025-02-05
Maintainer	dev137
Model Type	mixtral
Model Files	8.6 GB: 1-of-3 8.6 GB: 2-of-3 0.9 GB: 3-of-3
Quantization Type	exl2
Model Architecture	MixtralForCausalLM
License	cc-by-nc-4.0
Context Length	200000
Model Max Length	200000
Transformers Version	4.36.2
Tokenizer Class	LlamaTokenizer
Padding Token	<s>
Vocabulary Size	64000
Torch Data Type	bfloat16

Best Alternatives to Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8

Best Alternatives	Context / RAM	Downloads	Likes
... 34Bx2 MoE 60B 2.65bpw H6 EXL2	195K / 21.2 GB	6	1
...i 34Bx2 MoE 60B 5.0bpw H6 EXL2	195K / 38.8 GB	5	1
...i 34Bx2 MoE 60B 6.0bpw H6 EXL2	195K / 46.2 GB	4	1
...l 34Bx2 MoE 60B 8.0bpw H8 EXL2	195K / 61.3 GB	4	2
...l 34Bx2 MoE 60B 2.4bpw H6 EXL2	195K / 19.3 GB	5	3
... 34Bx2 MoE 60B 4.65bpw H6 EXL2	195K / 36.2 GB	4	1
... 34Bx2 MoE 60B 2.65bpw H6 EXL2	195K / 21.2 GB	4	2
...l 34Bx2 MoE 60B 3.0bpw H6 EXL2	195K / 23.8 GB	8	2
...l 34Bx2 MoE 60B 4.0bpw H6 EXL2	195K / 31.3 GB	5	2
...l 34Bx2 MoE 60B 6.0bpw H6 EXL2	195K / 46.2 GB	6	1

Note: green Score (e.g. "73.2") means that the model is better than dev137/cloudyu_Mixtral_34Bx2_MoE_60B-exl2-2.0bpw-h8.

Rank the Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 42625 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 by dev137

» All LLMs » dev137 » Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 URL Share it on

Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 Benchmarks

Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 Parameters and Internals

Best Alternatives to Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8

Rank the Cloudyu Mixtral 34Bx2 MoE 60B EXL2 2.0bpw H8 Capabilities

What open-source LLMs or SLMs are you in search of? 42625 in total.