Airoboros L2 13B 2 1 YaRN 64K AWQ by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  Airoboros L2 13B 2 1 YaRN 64K AWQ   URL Share it on

  4-bit   Autotrain compatible   Awq Base model:bhenrym14/airoboros... Base model:quantized:bhenrym14...   Custom code Dataset:jondurbin/airoboros-2....   Llama   Quantized   Region:us   Safetensors   Yarn

Airoboros L2 13B 2 1 YaRN 64K AWQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Airoboros L2 13B 2 1 YaRN 64K AWQ (TheBloke/Airoboros-L2-13B-2_1-YaRN-64K-AWQ)

Airoboros L2 13B 2 1 YaRN 64K AWQ Parameters and Internals

Model Type 
llama
Additional Notes 
To use with Transformers, pass trust_remote_code=True. Performance at long context lengths needs further exploration.
Training Details 
Data Sources:
jondurbin/airoboros-2.1
Methodology:
Instruction tuning with additional pretraining done with YaRN scaling.
Context Length:
65536
Training Time:
~16 hours
Hardware Used:
1x RTX 6000 Ada
Model Architecture:
Extended context finetune of Llama-2-13b using YaRN scaling.
Input Output 
Input Format:
A chat.\nUSER: {prompt}\nASSISTANT: \n
LLM NameAiroboros L2 13B 2 1 YaRN 64K AWQ
Repository ๐Ÿค—https://huggingface.co/TheBloke/Airoboros-L2-13B-2_1-YaRN-64K-AWQ 
Model NameAiroboros L2 13B 2.1 YaRN 64K
Model Creatorbhenrym14
Base Model(s)  bhenrym14/airoboros-l2-13b-2.1-YaRN-64k   bhenrym14/airoboros-l2-13b-2.1-YaRN-64k
Model Size13b
Required VRAM7.2 GB
Updated2024-12-21
MaintainerTheBloke
Model Typellama
Model Files  7.2 GB
AWQ QuantizationYes
Quantization Typeawq
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length65536
Model Max Length65536
Transformers Version4.33.0.dev0
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to Airoboros L2 13B 2 1 YaRN 64K AWQ

Best Alternatives
Context / RAM
Downloads
Likes
Yarn Llama 2 13B 128K AWQ128K / 7.2 GB202
LongAlign 13B 64K AWQ64K / 7.2 GB272
OrcaMaid V3 13B 32K AWQ32K / 7.2 GB234
OrcaMaid V2 FIX 13B 32K AWQ32K / 7.2 GB191
NexusRaven V2 13B AWQ16K / 7.2 GB143
NexusRaven V2 13B AWQ16K / 7.2 GB113
...th CodeLlama 13B Python Hf AWQ16K / 7.5 GB50
WhiteRabbitNeo 13B AWQ16K / 7.2 GB174
NexusRaven V2 13B AWQ16K / 7.2 GB231
Ramgpt 13B AWQ Gemm16K / 7.2 GB01
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/Airoboros-L2-13B-2_1-YaRN-64K-AWQ.

Rank the Airoboros L2 13B 2 1 YaRN 64K AWQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40013 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217