YuLan Mini by yulan-team

 ยป  All LLMs  ยป  yulan-team  ยป  YuLan Mini   URL Share it on

  Arxiv:2412.17743   Autotrain compatible   Code   Conversational   Dataset:ai-mo/numinamath-cot   Dataset:allenai/dolma   Dataset:bigcode/the-stack-v2   Dataset:deepmind/math dataset Dataset:deepseek-ai/deepseek-p... Dataset:gair-prox/open-web-mat... Dataset:huggingfacefw/fineweb-... Dataset:huggingfacetb/cosmoped... Dataset:huggingfacetb/smollm-c...   Dataset:internlm/lean-github   Dataset:internlm/lean-workbook   Dataset:liwu/mnbvc   Dataset:manu/project gutenberg   Dataset:math-ai/automathtext Dataset:microsoft/orca-math-wo... Dataset:mlfoundations/dclm-bas... Dataset:mrfakename/basic-math-...   Dataset:mu-nlpc/calc-ape210k Dataset:opencoder-llm/opc-anne... Dataset:opencoder-llm/opc-sft-... Dataset:opencoder-llm/opc-sft-... Dataset:opencsg/chinese-finewe... Dataset:ruc-aibox/long form th... Dataset:scalablemath/lean-cot-... Dataset:scalablemath/lean-cot-... Dataset:scalablemath/lean-star... Dataset:scalablemath/lean-star... Dataset:storytracer/loc-pd-boo... Dataset:vikp/textbook quality ... Dataset:xinyaohu/amps mathemat... Dataset:yulan-team/yulan-mini-...   En   Endpoints compatible   Llama   Math   Model-index   Region:us   Safetensors   Zh
Model Card on HF ๐Ÿค—: https://huggingface.co/yulan-team/YuLan-Mini 

YuLan Mini Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
YuLan Mini (yulan-team/YuLan-Mini)

YuLan Mini Parameters and Internals

LLM NameYuLan Mini
Repository ๐Ÿค—https://huggingface.co/yulan-team/YuLan-Mini 
Model Size2b
Required VRAM4.8 GB
Updated2025-01-07
Maintaineryulan-team
Model Typellama
Model Files  4.8 GB
Supported Languagesen zh
Model ArchitectureLlamaForCausalLM
Licensemit
Context Length28723
Model Max Length28723
Transformers Version4.47.1
Tokenizer ClassLlamaTokenizer
Padding Token<pad>
Vocabulary Size99000
Torch Data Typebfloat16

Best Alternatives to YuLan Mini

Best Alternatives
Context / RAM
Downloads
Likes
Llama 2B Hf 32768 Fpf32K / 3.8 GB2831
...icpm 2B Sft Bf16 Llamafied 16K16K / 6 GB5601
SmolLM2 MedIT Upscale 2B8K / 4.2 GB1434
Salamandra 2B8K / 4.5 GB331019
Sarvam 2B V0.58K / 5.1 GB35882
Llama3 2B Base8K / 4.7 GB2131
Test Quantized8K / 5.8 GB240
EPFL TA Meister Quantized V18K / 5.8 GB220
Llama3 Rommie8K / 5.8 GB240
...ta Llama 3 2B Mlp Layer Pruned8K / 5.1 GB150

Rank the YuLan Mini Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 41028 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227