ChatWaifu V2.0 22B by spow12

 »  All LLMs  »  spow12  »  ChatWaifu V2.0 22B   URL Share it on

  Autotrain compatible Base model:finetune:mistralai/... Base model:mistralai/mistral-s...   Conversational Dataset:aixsatoshi swallow mx ... Dataset:antiven0m physical rea... Dataset:aratako/synthetic-japa... Dataset:aratako/synthetic-japa... Dataset:aratako/synthetic-jp-e... Dataset:aratako rosebleu 1on1 ... Dataset:argilla capybara prefe... Dataset:flammenai character ro... Dataset:gryphe/sonnet3.5-slimo...   Dataset:jondurbi py dpo Dataset:jondurbin gutenberg dp...   Dataset:jondurbin truthy dpo Dataset:kalomaze/opus instruct...   Dataset:kyujinpy orca math dpo Dataset:nbeerbower gutenberg2 ... Dataset:roleplay4fun/aesir-v1.... Dataset:skunkworksai/reasoning...   En   Endpoints compatible   Instruct   Ja   Merge   Mergekit   Mistral   Model-index   Nsfw   Region:us   Roleplay   Safetensors   Sharded   Tensorflow   Visual novel

ChatWaifu V2.0 22B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
ChatWaifu V2.0 22B (spow12/ChatWaifu_v2.0_22B)

ChatWaifu V2.0 22B Parameters and Internals

Model Type 
CausalLM
Additional Notes 
The model may generate NSFW content.
Supported Languages 
japanese (NLP)
Training Details 
Data Sources:
Riddle Joker (Private), Café Stella and the Reaper's Butterflies (Private), Senren*Banka (Private), roleplay4fun/aesir-v1.1, kalomaze/Opus_Instruct_3k, Gryphe/Sonnet3.5-SlimOrcaDedupCleaned, Aratako/Synthetic-JP-EN-Coding-Dataset-567k, Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted, Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted, Aratako_Rosebleu_1on1_Dialogues_RP, SkunkworksAI/reasoning-0.01, jondurbin_gutenberg_dpo, nbeerbower_gutenberg2_dpo, jondurbi_py_dpo, jondurbin_truthy_dpo, flammenai_character_roleplay_DPO, kyujinpy_orca_math_dpo, argilla_Capybara_Preferences, antiven0m_physical_reasoning_dpo, aixsatoshi_Swallow_MX_chatbot_DPO
Release Notes 
Version:
12B and 22B Ver 2.0
Date:
2024.10.11
Notes:
Update 12B and 22B Ver 2.0
Version:
22B Ver 2.0_preview
Date:
2024.09.23
Notes:
Update 22B, Ver 2.0_preview
LLM NameChatWaifu V2.0 22B
Repository 🤗https://huggingface.co/spow12/ChatWaifu_v2.0_22B 
Base Model(s)  Mistral Small Instruct 2409   mistralai/Mistral-Small-Instruct-2409
Model Size22b
Required VRAM44.7 GB
Updated2025-02-22
Maintainerspow12
Model Typemistral
Instruction-BasedYes
Model Files  4.9 GB: 1-of-9   5.0 GB: 2-of-9   5.0 GB: 3-of-9   4.9 GB: 4-of-9   5.0 GB: 5-of-9   5.0 GB: 6-of-9   4.9 GB: 7-of-9   5.0 GB: 8-of-9   5.0 GB: 9-of-9
Supported Languagesen ja
Model ArchitectureMistralForCausalLM
Licensecc-by-nc-4.0
Context Length32768
Model Max Length32768
Transformers Version4.45.1
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size32768
Torch Data Typebfloat16

Best Alternatives to ChatWaifu V2.0 22B

Best Alternatives
Context / RAM
Downloads
Likes
MS Schisandra 22B V0.2128K / 44.7 GB409
...ntheon RP Pure 1.6.2 22B Small128K / 44.7 GB12827
MS Meadowlark 22B128K / 44.7 GB12513
... V4x1.6.2RP Cydonia VXXX 22B 8128K / 44.7 GB3055
... V4x1.6.2RP Cydonia VXXX 22B 6128K / 44.7 GB2862
Beeper King 22B128K / 44.7 GB317
MS Moingooistral 2409 22B128K / 44.7 GB340
MS Dampf 2409 22B128K / 44.7 GB280
MS A Coolyte 2409 22B128K / 44.7 GB280
MS Fujin 2409 22B128K / 44.7 GB270
Note: green Score (e.g. "73.2") means that the model is better than spow12/ChatWaifu_v2.0_22B.

Rank the ChatWaifu V2.0 22B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227