Mistral 3B Instruct V0.2 Init by Aryanne

 ยป  All LLMs  ยป  Aryanne  ยป  Mistral 3B Instruct V0.2 Init   URL Share it on

  Autotrain compatible   Conversational   Gguf   Instruct   Mistral   Q3   Quantized   Region:us   Safetensors   Sharded   Tensorflow

Mistral 3B Instruct V0.2 Init Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Mistral 3B Instruct V0.2 Init (Aryanne/Mistral-3B-Instruct-v0.2-init)

Mistral 3B Instruct V0.2 Init Parameters and Internals

Additional Notes 
This version has reduced feed_forward_length sizes from 14336 to 3072.
LLM NameMistral 3B Instruct V0.2 Init
Repository ๐Ÿค—https://huggingface.co/Aryanne/Mistral-3B-Instruct-v0.2-init 
Base Model(s)  sanchit-gandhi/Mistral-3B-Instruct-v0.2   sanchit-gandhi/Mistral-3B-Instruct-v0.2
Model Size3b
Required VRAM5.7 GB
Updated2024-12-22
MaintainerAryanne
Model Typemistral
Instruction-BasedYes
Model Files  2.0 GB: 1-of-3   1.9 GB: 2-of-3   1.8 GB: 3-of-3   1.4 GB
GGUF QuantizationYes
Quantization Typegguf|q3
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to Mistral 3B Instruct V0.2 Init

Best Alternatives
Context / RAM
Downloads
Likes
Phi 3 Phituguese 3B FP164K / 7.6 GB210
Ministral 3B Instruct128K / 6.7 GB699435
Mistral 3B Instruct V0.232K / 11.5 GB774
...inerva 3B Llama3 Instruct V0.116K / 5.8 GB31700
Minerva 3B Instruct V1.016K / 5.8 GB637
Note: green Score (e.g. "73.2") means that the model is better than Aryanne/Mistral-3B-Instruct-v0.2-init.

Rank the Mistral 3B Instruct V0.2 Init Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40123 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217