Continued Trained Gemma2 2B by ChallengerSpaceShuttle

 ยป  All LLMs  ยป  ChallengerSpaceShuttle  ยป  Continued Trained Gemma2 2B   URL Share it on

Base model:finetune:google/gem...   Base model:google/gemma-2-2b Dataset:challengerspaceshuttle...   Gemma2   Pytorch   Region:us   Zu

Continued Trained Gemma2 2B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Continued Trained Gemma2 2B (ChallengerSpaceShuttle/continued-trained-gemma2-2b)

Continued Trained Gemma2 2B Parameters and Internals

Model Type 
text generation
Use Cases 
Primary Use Cases:
Generate coherent Zulu text
Additional Notes 
First iteration targeting IsiZulu models for comparable performance to high-cost training models.
Supported Languages 
Zulu (fluent)
Training Details 
Data Sources:
ChallengerSpaceShuttle/zulu-pretraining-dataset
Context Length:
8192
Hardware Used:
devices: auto, num_nodes: 1
Model Architecture:
Gemma2ForCausalLM
LLM NameContinued Trained Gemma2 2B
Repository ๐Ÿค—https://huggingface.co/ChallengerSpaceShuttle/continued-trained-gemma2-2b 
Base Model(s)  Gemma 2 2B   google/gemma-2-2b
Model Size2b
Required VRAM13.4 GB
Updated2025-02-05
MaintainerChallengerSpaceShuttle
Model Typegemma2
Model Files  13.4 GB
Supported Languageszu
Model ArchitectureGemma2ForCausalLM
Licensegemma
Context Length8192
Model Max Length8192
Transformers Version4.42.4
Vocabulary Size288256
Torch Data Typefloat32

Best Alternatives to Continued Trained Gemma2 2B

Best Alternatives
Context / RAM
Downloads
Likes
SJT 2B128K / 5.2 GB710
SILMA Kashif 2B Instruct V1.012K / 5.2 GB121712
Gemma 2 2B It8K / 5.2 GB421819908
Gemma 2 2B8K / 10.5 GB182986487
Gemma 2 2B Jpn It8K / 5.2 GB16807158
GWQ2b8K / 5.2 GB27010
Gemma2Slerp2 2.6B8K / 5.3 GB3112
2 PRYMMAL ECE 2B SLERP V18K / 15.8 GB8650
...emma 2 2B It Chinese Kyara DPO8K / 15.7 GB75568
Gemma2Slerp1 2.6B8K / 5.3 GB1320
Note: green Score (e.g. "73.2") means that the model is better than ChallengerSpaceShuttle/continued-trained-gemma2-2b.

Rank the Continued Trained Gemma2 2B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227