EZO Common T2 2B Gemma 2 It by HODACHI

 ยป  All LLMs  ยป  HODACHI  ยป  EZO Common T2 2B Gemma 2 It   URL Share it on

  Autotrain compatible   Conversational   Endpoints compatible   Gemma2   Region:us   Safetensors   Sharded   Tensorflow

EZO Common T2 2B Gemma 2 It Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
EZO Common T2 2B Gemma 2 It (AXCXEPT/EZO-Common-T2-2B-gemma-2-it)

EZO Common T2 2B Gemma 2 It Parameters and Internals

Model Type 
text generation, conversational
Use Cases 
Areas:
Global applications
Considerations:
This model is suitable for global use due to its diverse language training.
Additional Notes 
The model excels in Japanese language tasks but is designed for diverse global needs.
Supported Languages 
Japanese (high proficiency), Other languages (varied proficiency)
Training Details 
Data Sources:
https://huggingface.co/datasets/legacy-datasets/wikipedia, https://huggingface.co/datasets/HuggingFaceFW/fineweb
Data Volume:
Japanese Wikipedia and FineWeb data
Methodology:
Plain instruction tuning
Training Time:
4 hours
Hardware Used:
A100 x 8
Model Architecture:
Enhanced Gemma-2-2B-it with multiple tuning techniques
LLM NameEZO Common T2 2B Gemma 2 It
Repository ๐Ÿค—https://huggingface.co/AXCXEPT/EZO-Common-T2-2B-gemma-2-it 
Model Size2b
Required VRAM5.2 GB
Updated2025-01-16
MaintainerHODACHI
Model Typegemma2
Model Files  5.0 GB: 1-of-2   0.2 GB: 2-of-2
Model ArchitectureGemma2ForCausalLM
Licensegemma
Context Length8192
Model Max Length8192
Transformers Version4.43.3
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typebfloat16

Best Alternatives to EZO Common T2 2B Gemma 2 It

Best Alternatives
Context / RAM
Downloads
Likes
SJT 2B128K / 5.2 GB640
Gemma 2 2B It8K / 5.2 GB363779870
Gemma 2 2B8K / 10.5 GB112126476
Gemma 2 2B Jpn It8K / 5.2 GB13508151
GWQ2b8K / 5.2 GB1579
Gemma 2 Baku 2B It8K / 10.5 GB6940121
Gemma2Slerp1 2.6B8K / 5.3 GB2220
2 PRYMMAL ECE 2B SLERP V18K / 15.8 GB5830
...emma 2 2B It Chinese Kyara DPO8K / 15.7 GB38578
Gemma2Slerp2 2.6B8K / 5.3 GB1090
Note: green Score (e.g. "73.2") means that the model is better than AXCXEPT/EZO-Common-T2-2B-gemma-2-it.

Rank the EZO Common T2 2B Gemma 2 It Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 41418 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227