TinyLlama 1.1B Chat V1.0 AWQ by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  TinyLlama 1.1B Chat V1.0 AWQ   URL Share it on

  4-bit   Autotrain compatible   Awq Base model:quantized:tinyllama... Base model:tinyllama/tinyllama...   Conversational   Dataset:bigcode/starcoderdata Dataset:cerebras/slimpajama-62... Dataset:openassistant/oasst to...   En   Llama   Quantized   Region:us   Safetensors

TinyLlama 1.1B Chat V1.0 AWQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
TinyLlama 1.1B Chat V1.0 AWQ (TheBloke/TinyLlama-1.1B-Chat-v1.0-AWQ)

TinyLlama 1.1B Chat V1.0 AWQ Parameters and Internals

Model Type 
tinyllama
Additional Notes 
This model can be used in open-source projects built upon the Llama architecture. It employs techniques like AWQ for efficient and fast inference.
Training Details 
Data Sources:
cerebras/SlimPajama-627B, bigcode/starcoderdata, OpenAssistant/oasst_top1_2023-08-25
Methodology:
finetuned using HF's Zephyr's training recipe and aligned with TRL's DPOTrainer
Hardware Used:
16 A100-40G GPUs
Model Architecture:
Llama 2
LLM NameTinyLlama 1.1B Chat V1.0 AWQ
Repository ๐Ÿค—https://huggingface.co/TheBloke/TinyLlama-1.1B-Chat-v1.0-AWQ 
Model NameTinyllama 1.1B Chat v1.0
Model CreatorTinyLlama
Base Model(s)  TinyLlama/TinyLlama-1.1B-Chat-v1.0   TinyLlama/TinyLlama-1.1B-Chat-v1.0
Model Size1.1b
Required VRAM0.8 GB
Updated2025-02-22
MaintainerTheBloke
Model Typetinyllama
Model Files  0.8 GB
Supported Languagesen
AWQ QuantizationYes
Quantization Typeawq
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length2048
Model Max Length2048
Transformers Version4.37.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32000
Torch Data Typefloat16

Best Alternatives to TinyLlama 1.1B Chat V1.0 AWQ

Best Alternatives
Context / RAM
Downloads
Likes
Medicine Chat AWQ4K / 3.9 GB713
Finance Chat AWQ4K / 3.9 GB693
Law Chat AWQ4K / 3.9 GB683
AdaptLLM Finance Chat AWQ4K / 3.9 GB631
Gorilla Openfunctions V1 AWQ4K / 3.9 GB810
TinyLlama 1.1B Chat V1.0 AWQ2K / 0.8 GB780
Tinyllama AWQ Marlin2K / 0.8 GB431
TinyLlama 1.1B Chat V0.3 AWQ2K / 0.8 GB85403
Medicine LLM AWQ2K / 3.9 GB513
Finance LLM AWQ2K / 3.9 GB685
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/TinyLlama-1.1B-Chat-v1.0-AWQ.

Rank the TinyLlama 1.1B Chat V1.0 AWQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227