Llama4Some SOVL 4x8B L3 V1 by saishf

 ยป  All LLMs  ยป  saishf  ยป  Llama4Some SOVL 4x8B L3 V1   URL Share it on

  Merged Model   Arxiv:2401.04088   Autotrain compatible Base model:saishf/ortho-sovl-8... Base model:saishf/sovlish-maid...   Conversational   Endpoints compatible   License:cc-by-nc-4.0   Mixtral   Moe   Region:us   Safetensors   Sharded   Tensorflow

Rank the Llama4Some SOVL 4x8B L3 V1 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Llama4Some SOVL 4x8B L3 V1 (saishf/Llama4Some-SOVL-4x8B-L3-V1)

Best Alternatives to Llama4Some SOVL 4x8B L3 V1

Best Alternatives
HF Rank
Context/RAM
Downloads
Likes
L3 ChaoticSoliloquy V1.5 4x8B8K / 49.9 GB318
L3 SnowStorm 4x8B8K / 49.9 GB02
Llama 3 4x8B8K / 49.9 GB5410
... ChaoticSoliloquy V2 4x8B Test8K / 49.9 GB110
...ama 3 Aplite Instruct 4x8B MoE8K / 50 GB233938
ChaoticSoliloquy 4x8B8K / 50 GB16024
L3 Badger Mushroom 4x8b8K / 50 GB11433
Skyro 4X8B8K / 50 GB19241
Ionic2 4x88K / 50 GB1720
Llama 3 8B Instruct MoE 28K / 50 GB820

Llama4Some SOVL 4x8B L3 V1 Parameters and Internals

LLM NameLlama4Some SOVL 4x8B L3 V1
RepositoryOpen on ๐Ÿค— 
Base Model(s)  Ortho SOVL 8B L3   SOVLish Maid L3 8B   Merge Mayhem L3 V2.1   Merge Mayhem L3 V2   saishf/Ortho-SOVL-8B-L3   saishf/SOVLish-Maid-L3-8B   saishf/Merge-Mayhem-L3-V2.1   saishf/Merge-Mayhem-L3-V2
Merged ModelYes
Model Size24.9b
Required VRAM50 GB
Updated2024-05-22
Maintainersaishf
Model Typemixtral
Model Files  9.9 GB: 1-of-6   10.0 GB: 2-of-6   10.0 GB: 3-of-6   9.9 GB: 4-of-6   9.1 GB: 5-of-6   1.1 GB: 6-of-6
Model ArchitectureMixtralForCausalLM
Licensecc-by-nc-4.0
Context Length8192
Model Max Length8192
Transformers Version4.40.1
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|begin_of_text|>
Vocabulary Size128256
Initializer Range0.02
Torch Data Typebfloat16

What open-source LLMs or SLMs are you in search of? 35549 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024042801