Mistral 300M by ayousanz

 »  All LLMs  »  ayousanz  »  Mistral 300M   URL Share it on

  Autotrain compatible   Endpoints compatible   Mistral   Region:us   Safetensors

Mistral 300M Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Mistral 300M (ayousanz/mistral-300M)

Mistral 300M Parameters and Internals

Model Type 
mistral
Additional Notes 
Acknowledgement to the resources from the ‘Local AI Hackathon’ organized by ‘ローカルLLMに向き合う会’.
Supported Languages 
Japanese (primary)
Training Details 
Data Sources:
wiki.txt
Context Length:
1024
Hardware Used:
A5000 × 7
Model Architecture:
MistralForCausalLM
Input Output 
Accepted Modalities:
text
Output Format:
text
LLM NameMistral 300M
Repository 🤗https://huggingface.co/ayousanz/mistral-300M 
Model Size300m
Required VRAM0 GB
Updated2025-02-22
Maintainerayousanz
Model Typemistral
Model Files  1.4 GB   0.7 GB   0.0 GB   0.0 GB
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length4096
Model Max Length4096
Transformers Version4.35.2
Tokenizer ClassT5Tokenizer
Padding Token[PAD]
Vocabulary Size50257
Torch Data Typefloat32

Best Alternatives to Mistral 300M

Best Alternatives
Context / RAM
Downloads
Likes
Lite Oute 1 300M Instruct4K / 1.2 GB48510
Lite Oute 1 300M4K / 1.2 GB3547
...anese Mistral 300M Instruction4K / 1.4 GB1223
Japanese Mistral 300M Base4K / 2.8 GB1283
Note: green Score (e.g. "73.2") means that the model is better than ayousanz/mistral-300M.

Rank the Mistral 300M Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227