Gemma 1.1 2B It by google

 ยป  All LLMs  ยป  google  ยป  Gemma 1.1 2B It   URL Share it on

  Arxiv:1705.03551   Arxiv:1804.06876   Arxiv:1809.02789   Arxiv:1811.00937   Arxiv:1904.09728   Arxiv:1905.07830   Arxiv:1905.10044   Arxiv:1907.10641   Arxiv:1911.01547   Arxiv:1911.11641   Arxiv:2009.03300   Arxiv:2107.03374   Arxiv:2108.07732   Arxiv:2110.08193   Arxiv:2110.14168   Arxiv:2206.04615   Arxiv:2304.06364   Arxiv:2312.11805   Autotrain compatible   Conversational   Endpoints compatible   Gemma   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/google/gemma-1.1-2b-it 

Gemma 1.1 2B It Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Gemma 1.1 2B It (google/gemma-1.1-2b-it)

Gemma 1.1 2B It Parameters and Internals

Model Type 
text generation, text-to-text, large language model, decoder-only LL
Use Cases 
Areas:
Research, Commercial applications
Applications:
Content Creation and Communication, Research and Education
Primary Use Cases:
Text Generation, Chatbots and Conversational AI, Text Summarization
Limitations:
Model's bias due to training data, Challenges with open-ended tasks
Considerations:
Continuous monitoring and detailed guidelines.
Additional Notes 
Models are state-of-the-art open large language models designed for responsible AI development.
Supported Languages 
English (Fluent)
Training Details 
Data Sources:
Web Documents, Code, Mathematics
Data Volume:
6 trillion tokens
Methodology:
Reinforcement Learning with Human Feedback (RLHF)
Context Length:
2048
Training Time:
Unknown
Hardware Used:
TPUv5e
Model Architecture:
Text-to-text, decoder-only
Safety Evaluation 
Methodologies:
Red-teaming, Automated evaluations, Human evaluation
Findings:
Acceptable thresholds for internal policies, Results on well-known safety benchmarks
Risk Categories:
Child safety, Content safety, Representational harms, Memorization risks, Large-scale harms
Ethical Considerations:
Various safety benchmarks and internal evaluation methods applied.
Responsible Ai Considerations 
Fairness:
Careful pre-processing and evaluation to mitigate biases
Transparency:
Open model with summarized details on architecture and capabilities
Accountability:
Development of mechanisms and guidelines for content safety
Mitigation Strategies:
Techniques for de-biasing and content flagging guidelines
Input Output 
Input Format:
Text string
Accepted Modalities:
text
Output Format:
Generated text
Performance Tips:
Addition of appropriate content context can improve performance.
Release Notes 
Version:
1.1
Date:
Unknown
Notes:
Substantial gains with improved RLHF method and resolved multi-turn conversation bug.
LLM NameGemma 1.1 2B It
Repository ๐Ÿค—https://huggingface.co/google/gemma-1.1-2b-it 
Model Size2b
Required VRAM5.1 GB
Updated2024-12-21
Maintainergoogle
Model Typegemma
Model Files  5.0 GB: 1-of-2   0.1 GB: 2-of-2
Model ArchitectureGemmaForCausalLM
Licensegemma
Context Length8192
Model Max Length8192
Transformers Version4.38.1
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typebfloat16

Quantized Models of the Gemma 1.1 2B It

Model
Likes
Downloads
VRAM
Gemma 1.1 2B It GPTQ1275693 GB
Gemma 1.1 2B It GGUF02541 GB
Gemma 1.1 2B It 4bit3232 GB
Gemma 1.1 2B It AWQ1223 GB

Best Alternatives to Gemma 1.1 2B It

Best Alternatives
Context / RAM
Downloads
Likes
Gemma Ko 1.1 2B It8K / 5.1 GB48291
Octopus V28K / 5.1 GB490868
Codegemma 2B8K / 5.1 GB213773
EMO 2B8K / 5.1 GB44251
Gemma 2B Ko V08K / 5 GB25270
Gemma2b Lungcancerqa8K / 3.1 GB762
Gemma 2B Ko Dev Pbmt1928K / 5 GB25201
Gemma 2B Data Std8K / 5.1 GB25301
Geko28K / 5.1 GB160
2B Or Not 2B8K / 5.1 GB7525

Rank the Gemma 1.1 2B It Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40013 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217