Magot V1 Gemma2 8K 9B by grimjim

 ยป  All LLMs  ยป  grimjim  ยป  Magot V1 Gemma2 8K 9B   URL Share it on

  Merged Model   Autotrain compatible Base model:anthracite-org/magn... Base model:grimjim/kitsunebi-v...   Conversational   Endpoints compatible   Ext 8k   Gemma2   Region:us   Safetensors   Sharded   Tensorflow

Magot V1 Gemma2 8K 9B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Magot V1 Gemma2 8K 9B (grimjim/Magot-v1-Gemma2-8k-9B)

Magot V1 Gemma2 8K 9B Parameters and Internals

Model Type 
text-generation
Additional Notes 
This model is an experiment in using merger as a method of making an Instruct-heavy model less constrained in its text generation. The low weight (0.2) infusion of the Magnum model provided needed variety to text generation. Inherent model safety is still strong due to Instruct base, but narratives are less bounded by positivity. Metadata linkbacks to this model are appreciated when used.
LLM NameMagot V1 Gemma2 8K 9B
Repository ๐Ÿค—https://huggingface.co/grimjim/Magot-v1-Gemma2-8k-9B 
Base Model(s)  Kitsunebi V1 Gemma2 8K 9B   Magnum V3 9B Customgemma2   grimjim/Kitsunebi-v1-Gemma2-8k-9B   anthracite-org/magnum-v3-9b-customgemma2
Merged ModelYes
Model Size9b
Required VRAM18.6 GB
Updated2025-06-02
Maintainergrimjim
Model Typegemma2
Model Files  1.9 GB: 1-of-10   2.0 GB: 2-of-10   2.0 GB: 3-of-10   2.0 GB: 4-of-10   2.0 GB: 5-of-10   2.0 GB: 6-of-10   2.0 GB: 7-of-10   2.0 GB: 8-of-10   2.0 GB: 9-of-10   0.7 GB: 10-of-10
Context Length8k
Model ArchitectureGemma2ForCausalLM
Licensegemma
Context Length8192
Model Max Length8192
Transformers Version4.44.2
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typebfloat16

Best Alternatives to Magot V1 Gemma2 8K 9B

Best Alternatives
Context / RAM
Downloads
Likes
G2 GSHT 32K32K / 20.4 GB121
SystemGemma2 9B It32K / 18.6 GB221
Gemma 2 9B It SimPO8K / 18.6 GB16291164
Gemma 2 9B It8K / 18.6 GB301771709
Gemma 2 9B8K / 37.1 GB51100655
Darkest Muse V18K / 20.4 GB304176
...2 9B Cpt Sahabatai V1 Instruct8K / 18.6 GB306342
Magnolia V3 Gemma2 8K 9B8K / 18.6 GB5892
SILMA 9B Instruct V1.08K / 18.6 GB1985474
Gemma 2 Ataraxy V4d 9B8K / 20.4 GB67916
Note: green Score (e.g. "73.2") means that the model is better than grimjim/Magot-v1-Gemma2-8k-9B.

Rank the Magot V1 Gemma2 8K 9B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 47771 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227