Model Type |
| |||
Training Details |
|
LLM Name | TomGrc FusionNet 34Bx2 MoE V0.1 DPO F16 5.0bpw H6 EXL2 |
Repository ๐ค | https://huggingface.co/LoneStriker/TomGrc_FusionNet_34Bx2_MoE_v0.1_DPO_f16-5.0bpw-h6-exl2 |
Required VRAM | 38.8 GB |
Updated | 2025-02-05 |
Maintainer | LoneStriker |
Model Type | mixtral |
Model Files | |
Quantization Type | fp16|exl2 |
Model Architecture | MixtralForCausalLM |
License | other |
Context Length | 200000 |
Model Max Length | 200000 |
Transformers Version | 4.37.2 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <s> |
Vocabulary Size | 64000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...oE V0.1 DPO F16 4.0bpw H6 EXL2 | 195K / 31.3 GB | 6 | 0 |
...2 Mixtral 8x22b 6.0bpw H8 EXL2 | 64K / 105.8 GB | 6 | 1 |
WizardLM 2 8x22 EXL2 4.0bpw | 64K / 70.9 GB | 7 | 1 |
...rdLM 2 8x22B Beige EXL2 5.0bpw | 64K / 88.4 GB | 13 | 0 |
...M 2 8x22B Beige 4.0bpw H6 EXL2 | 64K / 70.8 GB | 10 | 0 |
...M 2 8x22B Beige 3.0bpw H6 EXL2 | 64K / 53.2 GB | 6 | 0 |
...M 2 8x22B Beige 5.0bpw H6 EXL2 | 64K / 88.5 GB | 6 | 0 |
...M 2 8x22B Beige 2.4bpw H6 EXL2 | 64K / 42.7 GB | 5 | 0 |
...B Instruct V0.1 8.0bpw H8 EXL2 | 64K / 120.2 GB | 5 | 1 |
...2 Mixtral 8x22b 8.0bpw H8 EXL2 | 64K / 125.1 GB | 7 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐