SauerkrautLM Gemma 7B by VAGOsolutions

 ยป  All LLMs  ยป  VAGOsolutions  ยป  SauerkrautLM Gemma 7B   URL Share it on

  Alpha   Autotrain compatible   De   Dpo   En   Endpoints compatible   Finetuned   Gemma   Laser-qlora   Laserrmt   Region:us   Safetensors   Sft   Sharded   Tensorflow   Work in progress

SauerkrautLM Gemma 7B Benchmarks

SauerkrautLM Gemma 7B Parameters and Internals

Model Type 
finetuned
Additional Notes 
Early stage finetuned model with potential strange behavior.
Supported Languages 
de (trained), en (trained)
Training Details 
Data Sources:
SFT, DPO, Laser data(x), Laser again on data(y)
Methodology:
Spherical Linear Interpolation and a lasered version of the model, partially freezing with laser-like analysis
Model Architecture:
novel training technique: laser-QLoRA
Input Output 
Input Format:
vicuna prompt template
Performance Tips:
Use stopping strings "~~", " "
Release Notes 
Version:
01.03.2024
Date:
2024-03-01
Notes:
Reuploaded the model in bfloat16 dtype.
Version:
02.03.2024
Date:
2024-03-02
Notes:
Strongest Gemma finetune model so far with additional scoring.
LLM NameSauerkrautLM Gemma 7B
Repository ๐Ÿค—https://huggingface.co/VAGOsolutions/SauerkrautLM-Gemma-7b 
Model Size7b
Required VRAM17.1 GB
Updated2024-11-21
MaintainerVAGOsolutions
Model Typegemma
Model Files  5.0 GB: 1-of-4   5.0 GB: 2-of-4   5.0 GB: 3-of-4   2.1 GB: 4-of-4
Supported Languagesde en
Model ArchitectureGemmaForCausalLM
Licenseother
Context Length8192
Model Max Length8192
Transformers Version4.39.0.dev0
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typebfloat16
SauerkrautLM Gemma 7B (VAGOsolutions/SauerkrautLM-Gemma-7b)

Best Alternatives to SauerkrautLM Gemma 7B

Best Alternatives
Context / RAM
Downloads
Likes
Kaggle Math Model Gemma V112K / 17.1 GB60
Gemma 1.1 7B It8K / 17.1 GB16633265
SeaLLM 7B V2.58K / 17.1 GB1376749
Codegemma 7B It8K / 17.1 GB28375201
Zephyr 7B Gemma V0.18K / 17.1 GB970121
Codegemma 7B8K / 17.1 GB5959166
DiscoPOP Zephyr 7B Gemma8K / 17.1 GB516936
Gemma 7B Aps It8K / 17.1 GB18524
Gemma Mling 7B8K / 17.8 GB181713
Gemma 7B Openhermes V0.808K / 17 GB57051
Note: green Score (e.g. "73.2") means that the model is better than VAGOsolutions/SauerkrautLM-Gemma-7b.

Rank the SauerkrautLM Gemma 7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 38149 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241110