LLM Explorer: A Curated Large Language Model Directory and Analytics  // 

Tinyllama 1B AWQ Gemv by casperhansen

What open-source LLMs or SLMs are you in search of? 18857 in total.

 ยป  All LLMs  ยป  casperhansen  ยป  Tinyllama 1B AWQ Gemv   URL Share it on

  4-bit   Autotrain compatible   Awq   Endpoints compatible   License:apache-2.0   Llama   Pytorch   Quantized   Region:us

Rank the Tinyllama 1B AWQ Gemv Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Tinyllama 1B AWQ Gemv (casperhansen/tinyllama-1b-awq-gemv)

Best Alternatives to Tinyllama 1B AWQ Gemv

Best Alternatives
HF Rank
Tinyllama 2 1B Miniguanaco AWQ2K / 0.8 GB42
Tinyllama 1B AWQ2K / 0.8 GB90
...les 32K Bf16 V1.4.0bpw H6 EXL232K / 18.1 GB52
...les 32K Bf16 V1.5.0bpw H6 EXL232K / 22.2 GB22
...les 32K Bf16 V1.6.0bpw H6 EXL232K / 26.5 GB51
...1B Chat V1.0 AQLM 2Bit 1x16 Hf2K / 0.7 GB220
LWM Text Chat 1M GPTQ1024K / 4.3 GB171
LWM Text Chat 1M1024K / 13.5 GB443155
LWM Text 1M1024K / 13.5 GB4620
JOSIE 1M Base1024K / 13.5 GB11

Tinyllama 1B AWQ Gemv Parameters and Internals

LLM NameTinyllama 1B AWQ Gemv
RepositoryOpen on ๐Ÿค— 
Model Size1b
Required VRAM0.8 GB
Model Typellama
Model Files  0.8 GB
AWQ QuantizationYes
Quantization Typeawq
Model ArchitectureLlamaForCausalLM
Context Length2048
Model Max Length2048
Transformers Version4.33.2
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Initializer Range0.02
Torch Data Typefloat16
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024022003