LLM Explorer: A Curated Large Language Model Directory and Analytics  // 

Tukan 1.1B Chat Reasoning Sft COLA by alexredna

What open-source LLMs or SLMs are you in search of? 18857 in total.

 ยป  All LLMs  ยป  alexredna  ยป  Tukan 1.1B Chat Reasoning Sft COLA   URL Share it on

  Autotrain compatible Base model:tinyllama/tinyllama...   Conversational   Dataset:generator   Endpoints compatible   Generated from trainer   License:apache-2.0   Llama   Region:us   Safetensors   Sft   Tensorboard   Trl

Rank the Tukan 1.1B Chat Reasoning Sft COLA Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Tukan 1.1B Chat Reasoning Sft COLA (alexredna/Tukan-1.1B-Chat-reasoning-sft-COLA)

Best Alternatives to Tukan 1.1B Chat Reasoning Sft COLA

Best Alternatives
HF Rank
Context/RAM
Downloads
Likes
Finance Chat AWQ46.74K / 3.9 GB173
Finance Chat GPTQ46.74K / 3.9 GB33
Medicine Chat GPTQ46.54K / 3.9 GB235
Medicine Chat AWQ46.54K / 3.9 GB4203
Law Chat GPTQ46.44K / 3.9 GB234
Law Chat AWQ46.44K / 3.9 GB332
Tinyllama 3T 64K JSONExtractor64K / 2.2 GB250
570M Tinyllama32K / 1.1 GB220
TinyLlama 1.1B 32K32K / 2.2 GB49425
TinyLlama 1.1B 32K Instruct32K / 2.2 GB6996
Note: green Score (e.g. "73.2") means that the model is better than alexredna/Tukan-1.1B-Chat-reasoning-sft-COLA.

Tukan 1.1B Chat Reasoning Sft COLA Parameters and Internals

LLM NameTukan 1.1B Chat Reasoning Sft COLA
RepositoryOpen on ๐Ÿค— 
Base Model(s)  TinyLlama 1.1B Chat V1.0   TinyLlama/TinyLlama-1.1B-Chat-v1.0
Model Size1.1b
Required VRAM4.4 GB
Updated2024-02-28
Maintaineralexredna
Model Typellama
Model Files  4.4 GB   0.0 GB
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length2048
Model Max Length2048
Transformers Version4.36.2
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32000
Initializer Range0.02
Torch Data Typefloat32
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024022003