Model Type |
| ||||||
Use Cases |
| ||||||
Additional Notes |
| ||||||
Supported Languages |
| ||||||
Training Details |
| ||||||
Input Output |
|
LLM Name | Calme 4x7B MoE V0.1 |
Repository ๐ค | https://huggingface.co/MaziyarPanahi/Calme-4x7B-MoE-v0.1 |
Model Name | Calme-4x7B-MoE-v0.1 |
Model Creator | MaziyarPanahi |
Model Size | 24.2b |
Required VRAM | 48.3 GB |
Updated | 2025-03-13 |
Maintainer | MaziyarPanahi |
Model Type | mixtral |
Model Files | |
Model Architecture | MixtralForCausalLM |
License | apache-2.0 |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.37.2 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <s> |
Vocabulary Size | 32000 |
Torch Data Type | bfloat16 |
Model |
Likes |
Downloads |
VRAM |
---|---|---|---|
Calme 4x7B MoE V0.1 GGUF | 0 | 251 | 8 GB |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Dzakwan MoE 4x7b Beta | 32K / 48.4 GB | 2022 | 0 |
Proto Athena 4x7B | 32K / 48.4 GB | 15 | 0 |
Proto Athena V0.2 4x7B | 32K / 48.4 GB | 12 | 0 |
Calme 4x7B MoE V0.2 | 32K / 48.3 GB | 3653 | 2 |
Beyonder 4x7B V3 | 32K / 48.3 GB | 2081 | 58 |
Mera Mix 4x7B | 32K / 48.3 GB | 1771 | 18 |
CognitiveFusion2 4x7B BF16 | 32K / 48.3 GB | 1779 | 3 |
...e 4x7B MoE ECE PRYMMAL Martial | 32K / 48.6 GB | 30 | 1 |
MixtureofMerges MoE 4x7b V5 | 32K / 48.3 GB | 141 | 1 |
LCARS AI 1x4 003 SuperAI | 32K / 48.5 GB | 69 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐