Kitsunebi V1 Gemma2 8K 9B by grimjim

 ยป  All LLMs  ยป  grimjim  ยป  Kitsunebi V1 Gemma2 8K 9B   URL Share it on

  Merged Model   Arxiv:2405.14734   Arxiv:2406.14491   Autotrain compatible Base model:axcxept/ezo-common-... Base model:princeton-nlp/gemma...   Conversational   Endpoints compatible   Ext 8k   Gemma2   Region:us   Safetensors   Sharded   Tensorflow

Kitsunebi V1 Gemma2 8K 9B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Kitsunebi V1 Gemma2 8K 9B (grimjim/Kitsunebi-v1-Gemma2-8k-9B)

Kitsunebi V1 Gemma2 8K 9B Parameters and Internals

Model Type 
text generation
Additional Notes 
The model was merged using the SLERP method and is effective for coherence rather than textual richness.
Training Details 
Methodology:
context-based synthesized instruction pre-training data for supervised multitask pre-training
Model Architecture:
based on gemma-2 and fine-tuned by Axcxept co., ltd
LLM NameKitsunebi V1 Gemma2 8K 9B
Repository ๐Ÿค—https://huggingface.co/grimjim/Kitsunebi-v1-Gemma2-8k-9B 
Base Model(s)  Gemma 2 9B It SimPO   EZO Common 9B Gemma 2 It   princeton-nlp/gemma-2-9b-it-SimPO   HODACHI/EZO-Common-9B-gemma-2-it
Merged ModelYes
Model Size9b
Required VRAM18.5 GB
Updated2025-02-05
Maintainergrimjim
Model Typegemma2
Model Files  3.1 GB: 1-of-5   5.0 GB: 2-of-5   4.9 GB: 3-of-5   5.0 GB: 4-of-5   0.5 GB: 5-of-5
Context Length8k
Model ArchitectureGemma2ForCausalLM
Licensegemma
Context Length8192
Model Max Length8192
Transformers Version4.43.4
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typebfloat16

Best Alternatives to Kitsunebi V1 Gemma2 8K 9B

Best Alternatives
Context / RAM
Downloads
Likes
G2 GSHT 32K32K / 20.4 GB110
Gemma 2 9B It SimPO8K / 18.6 GB29255142
Gemma 2 9B It8K / 18.6 GB388793642
Gemma 2 9B8K / 37.1 GB78866635
...2 9B Cpt Sahabatai V1 Instruct8K / 18.6 GB427634
MT4 Gen5 Gemma 2 9B8K / 20.4 GB1552
SILMA 9B Instruct V1.08K / 18.6 GB1881363
Recoilme Gemma 2 9B V0.38K / 20.4 GB51703
...erge 02012025163610 Gemma 2 9B8K / 20.4 GB461
...erge 02012025163610 Gemma 2 9B8K / 20.4 GB541
Note: green Score (e.g. "73.2") means that the model is better than grimjim/Kitsunebi-v1-Gemma2-8k-9B.

Rank the Kitsunebi V1 Gemma2 8K 9B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42565 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227