Yunsung Llama 2 Koen 13B Linkbricks Sft Basic V1 by Saxo

 ยป  All LLMs  ยป  Saxo  ยป  Yunsung Llama 2 Koen 13B Linkbricks Sft Basic V1   URL Share it on

  Autotrain compatible   Conversational   Dataset:beomi/koalpaca-v1.1a Dataset:kyujinpy/kor-openorca-... Dataset:nlpai-lab/databricks-d... Dataset:saxo/total ko train se...   En   Endpoints compatible   Ko   Llama   Region:us   Safetensors   Sharded   Tensorflow

Yunsung Llama 2 Koen 13B Linkbricks Sft Basic V1 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Yunsung Llama 2 Koen 13B Linkbricks Sft Basic V1 (Saxo/yunsung-llama-2-koen-13b-linkbricks-sft-basic-v1)

Yunsung Llama 2 Koen 13B Linkbricks Sft Basic V1 Parameters and Internals

Model Type 
text-generation
Additional Notes 
Flash Attention is disabled. The model uses peft, quantization 'int4', and LoRA (r=64, alpha=16, dropout=0.1).
Supported Languages 
ko (Korean), en (English)
Training Details 
Data Sources:
Saxo/total_ko_train_set_small_basic, beomi/KoAlpaca-v1.1a, kyujinpy/KOR-OpenOrca-Platypus-v2, nlpai-lab/databricks-dolly-15k-ko
Methodology:
Instructional training using SFT (Supervised Fine-tuning)
Context Length:
2048
Training Time:
4 hours
Hardware Used:
4 A100-40Gs on GCP
Input Output 
Input Format:
Alpaca Format Prompt Text
LLM NameYunsung Llama 2 Koen 13B Linkbricks Sft Basic V1
Repository ๐Ÿค—https://huggingface.co/Saxo/yunsung-llama-2-koen-13b-linkbricks-sft-basic-v1 
Model Size13b
Required VRAM26.3 GB
Updated2025-03-19
MaintainerSaxo
Model Typellama
Model Files  5.0 GB: 1-of-6   4.9 GB: 2-of-6   4.9 GB: 3-of-6   4.9 GB: 4-of-6   5.0 GB: 5-of-6   1.6 GB: 6-of-6
Supported Languagesko en
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length2048
Model Max Length2048
Transformers Version4.38.2
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size46336
Torch Data Typebfloat16

Best Alternatives to Yunsung Llama 2 Koen 13B Linkbricks Sft Basic V1

Best Alternatives
Context / RAM
Downloads
Likes
Luminaura RP 13B128K / 26 GB130
Yarn Llama 2 13B 128K128K / 26 GB2404112
Agent Llama2 13B 80K80K / 26.4 GB110
Chat Llama2 13B 80K80K / 52.8 GB110
LongAlign 13B 64K64K / 26 GB3013
LongAlign 13B 64K Base64K / 26 GB183
Yarn Llama 2 13B 64K64K / 26 GB430317
Openbuddy Llama2 13B V15p1 64K64K / 26.1 GB114
Openbuddy Llama2 13b64k V1564K / 26.1 GB71
Airoboros L2 13B 2.1 YaRN 64K64K / 26 GB337

Rank the Yunsung Llama 2 Koen 13B Linkbricks Sft Basic V1 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 45269 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227