Mixtral AI CyberTron Ultra by LeroyDyer

 ยป  All LLMs  ยป  LeroyDyer  ยป  Mixtral AI CyberTron Ultra   URL Share it on

  Autotrain compatible Base model:leroydyer/mixtral a...   Code   Conversational   Cyber-series Dataset:cognitivecomputations/... Dataset:databricks/databricks-... Dataset:gretelai/synthetic tex... Dataset:huggingfacetb/cosmoped... Dataset:ise-uiuc/magicoder-evo...   Dataset:meta-math/metamathqa Dataset:mwitiderrick/swahilipl...   Dataset:open-orca/openorca   Dataset:open-orca/slimorca Dataset:rogendo/english-swahil...   Dataset:swahili   Dataset:teknium/openhermes-2.5   Dataset:uonlp/culturax   Dataset:yahma/alpaca-cleaned   Doctor   En   Endpoints compatible   Farmer   Instruct   License:apache-2.0   Medical   Mega-series   Milestone   Mistral   Region:us   Role-play   Safetensors   Self-rag   Sharded   Spydazwebai   Tensorflow   Thinkingbot   Trl   Unsloth

Mixtral AI CyberTron Ultra Benchmarks

Mixtral AI CyberTron Ultra Parameters and Internals

Model Type 
text-generation-inference, fine-tuned
Use Cases 
Areas:
medical, farmer, doctor, Mega-Series, Cyber-Series, Role-Play, Self-Rag, ThinkingBot
Additional Notes 
The model is highly tuned for text generation, role-play, and can maintain personas. It is designed for strategic merging and tuning to maintain different capabilities separately.
Supported Languages 
en (Proficient)
Training Details 
Data Sources:
gretelai/synthetic_text_to_sql, HuggingFaceTB/cosmopedia, teknium/OpenHermes-2.5, Open-Orca/SlimOrca, Open-Orca/OpenOrca, cognitivecomputations/dolphin-coder, databricks/databricks-dolly-15k, yahma/alpaca-cleaned, uonlp/CulturaX, mwitiderrick/SwahiliPlatypus, Rogendo/English-Swahili-Sentence-Pairs, ise-uiuc/Magicoder-Evol-Instruct-110K, meta-math/MetaMathQA
Methodology:
fine-tuning, unsloth
Context Length:
32000
Input Output 
Accepted Modalities:
text
LLM NameMixtral AI CyberTron Ultra
Repository ๐Ÿค—https://huggingface.co/LeroyDyer/Mixtral_AI_CyberTron_Ultra 
Base Model(s)  Mixtral AI CyberTron Ultra   LeroyDyer/Mixtral_AI_CyberTron_Ultra
Model Size7.2b
Required VRAM14.4 GB
Updated2024-07-04
MaintainerLeroyDyer
Model Typemistral
Instruction-BasedYes
Model Files  1.9 GB: 1-of-8   1.9 GB: 2-of-8   2.0 GB: 3-of-8   1.9 GB: 4-of-8   2.0 GB: 5-of-8   1.9 GB: 6-of-8   2.0 GB: 7-of-8   0.8 GB: 8-of-8
Supported Languagesen
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.38.2
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size32000
Torch Data Typefloat16
Mixtral AI CyberTron Ultra (LeroyDyer/Mixtral_AI_CyberTron_Ultra)

Best Alternatives to Mixtral AI CyberTron Ultra

Best Alternatives
Context / RAM
Downloads
Likes
...zWeb AI LCARS Humanization 003512K / 14.4 GB100
...AI ChatQA Reasoning101 Project512K / 14.4 GB301
Spydaz Web AI ChatQA 007512K / 14.4 GB111
...A ReAct Project UltraFineTuned512K / 14.4 GB181
Spydaz Web AI 08512K / 14.5 GB761
Spydaz Web AI ChatQA 006512K / 14.4 GB101
Spydaz Web AI ChatQA 004512K / 14.4 GB21
Spydaz Web AI ChatQA 001 UFT512K / 14.4 GB241
Spydaz Web AI ChatQA 001 SFT512K / 14.4 GB231
Spydaz Web AI 010512K / 14.5 GB140

Rank the Mixtral AI CyberTron Ultra Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 38149 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241110