TinyMistral 248M By Locutusque: Benchmarks, Features and Detailed Analysis. Insights on TinyMistral 248M.

Autotrain compatible Dataset:jeankaddour/minipile Dataset:skylion007/openwebtext En Endpoints compatible Mistral Pytorch Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/Locutusque/TinyMistral-248M

TinyMistral 248M Benchmarks

LLME Score: 0.17113

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

TinyMistral 248M (Locutusque/TinyMistral-248M)

TinyMistral 248M Parameters and Internals

Model Type

text-generation

Use Cases

Primary Use Cases:

fine-tuning on a downstream task

Additional Notes

This model aims to prove trillion-scale datasets are not necessary for language model pretraining.

Supported Languages

en (fluent)

Training Details

Data Sources:

Skylion007/openwebtext, JeanKaddour/minipile

Data Volume:

7,488,000 examples

Context Length:

32768

Hardware Used:

single GPU (Titan V)

Input Output

Accepted Modalities:

text

Output Format:

text-generation

LLM Name	TinyMistral 248M
Repository 🤗	https://huggingface.co/Locutusque/TinyMistral-248M
Model Size	248m
Required VRAM	1 GB
Updated	2025-02-05
Maintainer	Locutusque
Model Type	mistral
Model Files	1.0 GB 1.0 GB
Supported Languages	en
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	32768
Model Max Length	32768
Transformers Version	4.35.0
Tokenizer Class	LlamaTokenizer
Padding Token	[PAD]
Vocabulary Size	32005
Torch Data Type	float16

Best Alternatives to TinyMistral 248M

Best Alternatives	Context / RAM	Downloads	Likes
TinyMistral 248M V2.5	32K / 1 GB	290	27
TinyMistral 248M V2	32K / 1 GB	1293	17
...adrin TinyMistral248M Instruct	32K / 0.5 GB	1367	6
TinyMistral 248M V2.5 Instruct	32K / 1 GB	7	11
...al V2.5 MiniPile Guidelines E1	32K / 0.6 GB	5	2
TinyMistral V2 Test1	32K / 1 GB	18	1
TinyMistral 248M 8bits	32K / 0.3 GB	22	1
Tinymistv1	32K / 0.5 GB	17	0
TinyMistral Haiku	32K / 1 GB	205	0
TinyMistral 248M Instruct	32K / 1 GB	23	11

Note: green Score (e.g. "73.2") means that the model is better than Locutusque/TinyMistral-248M.

Rank the TinyMistral 248M Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 42577 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

TinyMistral 248M by Locutusque

» All LLMs » Locutusque » TinyMistral 248M URL Share it on

TinyMistral 248M Benchmarks

TinyMistral 248M Parameters and Internals

Best Alternatives to TinyMistral 248M

Rank the TinyMistral 248M Capabilities

What open-source LLMs or SLMs are you in search of? 42577 in total.