Magot V2 Gemma2 8K 9B by grimjim

 ยป  All LLMs  ยป  grimjim  ยป  Magot V2 Gemma2 8K 9B   URL Share it on

  Merged Model   Autotrain compatible Base model:anthracite-org/magn... Base model:grimjim/kitsunebi-v...   Conversational   Endpoints compatible   Ext 8k   Gemma2   Region:us   Safetensors   Sharded   Tensorflow

Magot V2 Gemma2 8K 9B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Magot V2 Gemma2 8K 9B (grimjim/Magot-v2-Gemma2-8k-9B)

Magot V2 Gemma2 8K 9B Parameters and Internals

Model Type 
text-generation
Additional Notes 
There appears to be some damage to the model from merging. Stability has been improved by retaining embed_tokens and lm_head weights from the base model. Coherence is moderately high, though not perfect. The model tends towards melodramatic motifs, indicating certain favored tropes are baked into the model.
Training Details 
Context Length:
8000
LLM NameMagot V2 Gemma2 8K 9B
Repository ๐Ÿค—https://huggingface.co/grimjim/Magot-v2-Gemma2-8k-9B 
Base Model(s)  Magnum V3 9B Customgemma2   Kitsunebi V1 Gemma2 8K 9B   anthracite-org/magnum-v3-9b-customgemma2   grimjim/Kitsunebi-v1-Gemma2-8k-9B
Merged ModelYes
Model Size9b
Required VRAM18.6 GB
Updated2024-12-22
Maintainergrimjim
Model Typegemma2
Model Files  3.9 GB: 1-of-5   4.0 GB: 2-of-5   4.0 GB: 3-of-5   4.0 GB: 4-of-5   2.7 GB: 5-of-5
Context Length8k
Model ArchitectureGemma2ForCausalLM
Licensegemma
Context Length8192
Model Max Length8192
Transformers Version4.44.2
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typebfloat16

Best Alternatives to Magot V2 Gemma2 8K 9B

Best Alternatives
Context / RAM
Downloads
Likes
Gemma 2 9B It SimPO8K / 18.6 GB16451136
Gemma 2 9B It8K / 18.6 GB340182594
Gemma 2 9B8K / 37.1 GB114489613
...2 9B Cpt Sahabatai V1 Instruct8K / 18.6 GB347627
SILMA 9B Instruct V1.08K / 18.6 GB1104755
MT3 Gen4 Gemma 2 9B8K / 20.4 GB191
MT4 Gen2 Gemma 2 9B8K / 20.4 GB1803
Darkest Muse V18K / 20.4 GB28625
Magnum V4 9B8K / 18.6 GB266314
MT4 Gen4 Gemma 2 9B8K / 20.4 GB240
Note: green Score (e.g. "73.2") means that the model is better than grimjim/Magot-v2-Gemma2-8k-9B.

Rank the Magot V2 Gemma2 8K 9B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40066 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217