Linkbricks Llama3.2 Korean Cpt 3B by Saxo

 ยป  All LLMs  ยป  Saxo  ยป  Linkbricks Llama3.2 Korean Cpt 3B   URL Share it on

  Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/llama-3....   Cn   Conversational   Dataset:kuotient/gsm8k-ko Dataset:lilacai/glaive-functio... Dataset:maywell/ko ultrafeedba... Dataset:saxo/en ko translation... Dataset:saxo/en ko translation...   Dataset:saxo/ko-news-corpus-1   Dataset:saxo/ko-news-corpus-2   Dataset:saxo/ko-news-corpus-3   Dataset:saxo/ko-news-corpus-4   Dataset:saxo/ko-news-corpus-5   Dataset:saxo/ko-news-corpus-6   Dataset:saxo/ko-news-corpus-7   Dataset:saxo/ko-news-corpus-8   Dataset:saxo/ko-news-corpus-9 Dataset:saxo/ko aspect sentime... Dataset:saxo/ko cn translation... Dataset:saxo/ko government qa ... Dataset:saxo/ko jp translation... Dataset:saxo/ko summarization ... Dataset:saxo/openorca cleaned ... Dataset:youjunhyeok/ko-orca-pa...   En   Endpoints compatible   Instruct   Jp   Ko   Llama   Region:us   Safetensors   Sharded   Tensorflow

Linkbricks Llama3.2 Korean Cpt 3B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Linkbricks Llama3.2 Korean Cpt 3B (Saxo/Linkbricks-Llama3.2-Korean-cpt-3b)

Linkbricks Llama3.2 Korean Cpt 3B Parameters and Internals

Model Type 
text-generation
Supported Languages 
ko (Korean), en (English), jp (Japanese), cn (Chinese)
Training Details 
Data Sources:
Saxo/ko_cn_translation_tech_social_science_linkbricks_single_dataset, Saxo/ko_jp_translation_tech_social_science_linkbricks_single_dataset, Saxo/en_ko_translation_tech_science_linkbricks_single_dataset_with_prompt_text_huggingface, Saxo/en_ko_translation_social_science_linkbricks_single_dataset_with_prompt_text_huggingface, Saxo/ko_aspect_sentiment_sns_mall_sentiment_linkbricks_single_dataset_with_prompt_text_huggingface, Saxo/ko_summarization_linkbricks_single_dataset_with_prompt_text_huggingface, Saxo/OpenOrca_cleaned_kor_linkbricks_single_dataset_with_prompt_text_huggingface, Saxo/ko_government_qa_total_linkbricks_single_dataset_with_prompt_text_huggingface_sampled, Saxo/ko-news-corpus-1, Saxo/ko-news-corpus-2, Saxo/ko-news-corpus-3, Saxo/ko-news-corpus-4, Saxo/ko-news-corpus-5, Saxo/ko-news-corpus-6, Saxo/ko-news-corpus-7, Saxo/ko-news-corpus-8, Saxo/ko-news-corpus-9, maywell/ko_Ultrafeedback_binarized, youjunhyeok/ko-orca-pair-and-ultrafeedback-dpo, lilacai/glaive-function-calling-v2-sharegpt, kuotient/gsm8k-ko
Data Volume:
50 million Korean news articles + other Korean corpora
Methodology:
Continued Pre Training (CPT)
Context Length:
128000
Hardware Used:
8 H100-80G GPUs
Model Architecture:
Re-tuning of 35% of model parameters
Input Output 
Accepted Modalities:
text
LLM NameLinkbricks Llama3.2 Korean Cpt 3B
Repository ๐Ÿค—https://huggingface.co/Saxo/Linkbricks-Llama3.2-Korean-cpt-3b 
Base Model(s)  meta-llama/Llama-3.2-3B-Instruct   meta-llama/Llama-3.2-3B-Instruct
Model Size3b
Required VRAM7.2 GB
Updated2024-12-21
MaintainerSaxo
Model Typellama
Instruction-BasedYes
Model Files  5.0 GB: 1-of-2   2.2 GB: 2-of-2
Supported Languagesko en jp cn
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length131072
Model Max Length131072
Transformers Version4.43.2
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|eot_id|>
Vocabulary Size128256
Torch Data Typebfloat16

Best Alternatives to Linkbricks Llama3.2 Korean Cpt 3B

Best Alternatives
Context / RAM
Downloads
Likes
Llama 3.2 3B Instruct128K / 6.5 GB2165514808
Llama 3.2 3B Instruct128K / 6.4 GB6393735
Llama Song Stream 3B Instruct128K / 6.5 GB299
Llama Chat Summary 3.2 3B128K / 6.5 GB577
...lama 3.2 Rabbit Ko 3B Instruct128K / 6.5 GB15538
Llama 3.2 3B Instruct128K / 6.5 GB554272
SummLlama3.2 3B128K / 6.5 GB482135
FinMatcha 3B Instruct128K / 6.5 GB3590
...3.2 Rabbit Ko 3B Instruct 2412128K / 6.5 GB270
... Instruct JankMixBread V0.1 3B128K / 6.4 GB3430
Note: green Score (e.g. "73.2") means that the model is better than Saxo/Linkbricks-Llama3.2-Korean-cpt-3b.

Rank the Linkbricks Llama3.2 Korean Cpt 3B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40013 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217