Gemma Ko 7B by beomi

 ยป  All LLMs  ยป  beomi  ยป  Gemma Ko 7B   URL Share it on

  Autotrain compatible   Doi:10.57967/hf/1859   En   Endpoints compatible   Gemma   Ko   Pytorch   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/beomi/gemma-ko-7b 

Gemma Ko 7B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Gemma Ko 7B (beomi/gemma-ko-7b)

Gemma Ko 7B Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
Content Creation and Communication, Research and Education
Applications:
Text Generation, NLP Research, Language Learning Tools, Knowledge Exploration
Primary Use Cases:
Question answering, Summarization, Reasoning
Limitations:
Biases or gaps in the training data can lead to limitations in the model's responses., LLMs might struggle to grasp subtle nuances, sarcasm, or figurative language., They may generate incorrect or outdated factual statements., Lack the ability to apply common sense reasoning in certain situations.
Considerations:
Developers are encouraged to adhere to privacy regulations with privacy-preserving techniques.
Additional Notes 
Open Large Language Models (LLMs) have a wide range of applications across various industries and domains.
Supported Languages 
ko (fluent), en (fluent)
Training Details 
Context Length:
2048
Hardware Used:
TPU
Model Architecture:
Text-to-text decoder-only
Responsible Ai Considerations 
Fairness:
LLMs trained on large-scale, real-world text data can reflect socio-cultural biases embedded in the training material.
Transparency:
This model card summarizes details on the models' architecture, capabilities, limitations, and evaluation processes.
Accountability:
Mechanisms and guidelines for content safety are essential. Developers are encouraged to exercise caution and implement appropriate content safety safeguards based on their specific product policies and application use cases.
Mitigation Strategies:
Educational resources and reporting mechanisms for users to flag misuse are provided.
Input Output 
Input Format:
Text string
Accepted Modalities:
text
Output Format:
Generated Korean/English-language text
Performance Tips:
Longer context generally leads to better outputs, up to a certain point.
Release Notes 
Version:
7B
Date:
2024-03-08
Notes:
First release of Gemma-Ko 7B model
LLM NameGemma Ko 7B
Repository ๐Ÿค—https://huggingface.co/beomi/gemma-ko-7b 
Model Size7b
Required VRAM17 GB
Updated2024-12-22
Maintainerbeomi
Model Typegemma
Model Files  2.9 GB: 1-of-6   2.9 GB: 2-of-6   3.0 GB: 3-of-6   2.9 GB: 4-of-6   2.9 GB: 5-of-6   2.4 GB: 6-of-6
Supported Languagesko en
Model ArchitectureGemmaForCausalLM
Licenseother
Context Length8192
Model Max Length8192
Transformers Version4.38.1
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Torch Data Typebfloat16

Best Alternatives to Gemma Ko 7B

Best Alternatives
Context / RAM
Downloads
Likes
Kaggle Math Model Gemma V112K / 17.1 GB250
Gemma 1.1 7B It8K / 17.1 GB15458267
SeaLLM 7B V2.58K / 17.1 GB1485249
SauerkrautLM Gemma 7B8K / 17.1 GB771413
Zephyr 7B Gemma V0.18K / 17.1 GB712121
... Codegemma 2 7B It Alpaca V1.38K / 17.1 GB190
Codegemma 7B8K / 17.1 GB3119170
Codegemma 7B It8K / 17.1 GB1887205
DiscoPOP Zephyr 7B Gemma8K / 17.1 GB706036
Gemma 7B Openhermes V0.808K / 17 GB74071
Note: green Score (e.g. "73.2") means that the model is better than beomi/gemma-ko-7b.

Rank the Gemma Ko 7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40066 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217