Model Type |
| ||||||
Training Details |
|
LLM Name | Theory Of Mind Mistral |
Repository ๐ค | https://huggingface.co/jeiku/Theory_of_Mind_Mistral |
Base Model(s) | |
Model Size | 7b |
Required VRAM | 1.3โฏGB |
Updated | 2025-03-14 |
Maintainer | jeiku |
Model Files | |
Model Architecture | Adapter |
License | apache-2.0 |
Is Biased | none |
Tokenizer Class | LlamaTokenizer |
Padding Token | </s> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | v_proj|o_proj|q_proj|down_proj|k_proj|up_proj|gate_proj |
LoRA Alpha | 256 |
LoRA Dropout | 0.05 |
R Param | 128 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Qwen Megumin | 0K / 0.1โGB | 13 | 0 |
...s 25 Mistral 7B Irca DPO Pairs | 0K / 0.1โGB | 9 | 0 |
Qwen1.5 7B Chat Sa V0.1 | 0K / 0โGB | 14 | 0 |
Zephyr 7B Ipo 0K 15K I1 | 0K / 0.7โGB | 8 | 0 |
Deepthink Reasoning Adapter | 0K / 0.2โGB | 56 | 11 |
Deepseek Llm 7B Chat Sa V0.1 | 0K / 0โGB | 5 | 0 |
... Days Of Sodom LoRA Mistral 7B | 0K / 0.2โGB | 7 | 0 |
Mistral 7B Instruct Sa V0.1 | 0K / 0โGB | 6 | 0 |
Qwen2.5 7b NotesCorrector | 0K / 0.6โGB | 12 | 0 |
CodeAstra 7B | 0K / 0โGB | 758 | 10 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐