Gemma 2 2B Stheno Filtered by SaisExperiments

 ยป  All LLMs  ยป  SaisExperiments  ยป  Gemma 2 2B Stheno Filtered   URL Share it on

Base model:finetune:google/gem... Base model:google/gemma-2-2b-i... Dataset:anthracite-org/stheno-...   Gemma2   Region:us   Safetensors

Gemma 2 2B Stheno Filtered Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Gemma 2 2B Stheno Filtered (SaisExperiments/Gemma-2-2B-Stheno-Filtered)

Gemma 2 2B Stheno Filtered Parameters and Internals

Training Details 
Data Sources:
anthracite-org/stheno-filtered-v1.1, google/gemma-2-2b-it
Data Volume:
76.6M tokens
Context Length:
1024
Training Time:
14 hours
LLM NameGemma 2 2B Stheno Filtered
Repository ๐Ÿค—https://huggingface.co/SaisExperiments/Gemma-2-2B-Stheno-Filtered 
Base Model(s)  Gemma 2 2B It   google/gemma-2-2b-it
Model Size2b
Required VRAM5.2 GB
Updated2025-01-24
MaintainerSaisExperiments
Model Typegemma2
Model Files  5.2 GB
Model ArchitectureGemma2ForCausalLM
Licensegemma
Context Length8192
Model Max Length8192
Transformers Version4.44.2
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typebfloat16

Best Alternatives to Gemma 2 2B Stheno Filtered

Best Alternatives
Context / RAM
Downloads
Likes
SJT 2B128K / 5.2 GB700
Gemma 2 2B It8K / 5.2 GB395324894
Gemma 2 2B8K / 10.5 GB129832483
Gemma 2 2B Jpn It8K / 5.2 GB17552155
GWQ2b8K / 5.2 GB24810
Gemma 2 2B TR Knowledge Graph8K / 5.2 GB28611
Gemma 2 Baku 2B It8K / 10.5 GB7140521
Gemma2Slerp1 2.6B8K / 5.3 GB2770
...emma 2 2B It Chinese Kyara DPO8K / 15.7 GB58778
2 PRYMMAL ECE 2B SLERP V18K / 15.8 GB5920
Note: green Score (e.g. "73.2") means that the model is better than SaisExperiments/Gemma-2-2B-Stheno-Filtered.

Rank the Gemma 2 2B Stheno Filtered Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 41817 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227