Theory Of Mind Mistral by jeiku

 ยป  All LLMs  ยป  jeiku  ยป  Theory Of Mind Mistral   URL Share it on

  4-bit   Adapter Base model:adapter:mistralai/m... Base model:mistralai/mistral-7...   Bitsandbytes   Finetuned   Generated from trainer   Lora   Mistral   Peft   Region:us

Theory Of Mind Mistral Benchmarks

nn.n% โ€” How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Sponsored by Nebius

Theory Of Mind Mistral Parameters and Internals

Model Type 
MistralForCausalLM
Training Details 
Data Sources:
theory_of_mind_airoboros_fixed.json
Context Length:
2048
LLM NameTheory Of Mind Mistral
Repository ๐Ÿค—https://huggingface.co/jeiku/Theory_of_Mind_Mistral 
Base Model(s)  mistralai/Mistral-7B-v0.1   mistralai/Mistral-7B-v0.1
Model Size7b
Required VRAM1.3โ€ฏGB
Updated2025-03-14
Maintainerjeiku
Model Files  1.3 GB
Model ArchitectureAdapter
Licenseapache-2.0
Is Biasednone
Tokenizer ClassLlamaTokenizer
Padding Token</s>
PEFT TypeLORA
LoRA ModelYes
PEFT Target Modulesv_proj|o_proj|q_proj|down_proj|k_proj|up_proj|gate_proj
LoRA Alpha256
LoRA Dropout0.05
R Param128

Best Alternatives to Theory Of Mind Mistral

Best Alternatives
Context / RAM
Downloads
Likes
Qwen Megumin0K / 0.1โ€‰GB130
...s 25 Mistral 7B Irca DPO Pairs0K / 0.1โ€‰GB90
Qwen1.5 7B Chat Sa V0.10K / 0โ€‰GB140
Zephyr 7B Ipo 0K 15K I10K / 0.7โ€‰GB80
Deepthink Reasoning Adapter0K / 0.2โ€‰GB5611
Deepseek Llm 7B Chat Sa V0.10K / 0โ€‰GB50
... Days Of Sodom LoRA Mistral 7B0K / 0.2โ€‰GB70
Mistral 7B Instruct Sa V0.10K / 0โ€‰GB60
Qwen2.5 7b NotesCorrector0K / 0.6โ€‰GB120
CodeAstra 7B0K / 0โ€‰GB75810
Note: green Score (e.g. "73.2") means that the model is better than jeiku/Theory_of_Mind_Mistral.

Rank the Theory Of Mind Mistral Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
โ—โ—โ—โ—
Factuality and Completeness of Knowledge  
โ—โ—โ—โ—
Censorship and Alignment  
โ—โ—โ—โ—
Data Analysis and Insight Generation  
โ—โ—โ—โ—
Text Generation  
โ—โ—โ—โ—
Text Summarization and Feature Extraction  
โ—โ—โ—โ—
Code Generation  
โ—โ—โ—โ—
Multi-Language Support and Translation  
โ—โ—โ—โ—

What open-source LLMs or SLMs are you in search of? 45095 in total.

Our Social Media โ†’  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227