TinyLLama V0 by Maykeye

 ยป  All LLMs  ยป  Maykeye  ยป  TinyLLama V0   URL Share it on

  Autotrain compatible   Endpoints compatible   Llama   Pytorch   Region:us   Safetensors
Model Card on HF ๐Ÿค—: https://huggingface.co/Maykeye/TinyLLama-v0 

TinyLLama V0 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
TinyLLama V0 (Maykeye/TinyLLama-v0)

TinyLLama V0 Parameters and Internals

Model Type 
Causal Language Model
Additional Notes 
Extremely PoC version with simplified training and caching mechanics. Backup and validation scripts provided. Training data constraints exist due to context length handling.
Training Details 
Data Sources:
TinyStoriesV2-GPT4-train.txt, TinyStoriesV2-GPT4-valid.txt
Methodology:
Training truncates stories longer than context size without sliding window. Uses open_llama_3b tokenizer.
Training Time:
9 hours total, 3 hours per epoch
Hardware Used:
40GB A100 GPU with ~30GB VRAM
Model Architecture:
Llama
Release Notes 
Version:
1.0
Notes:
First version PoC with basic function to train and validate with specified dataset files.
LLM NameTinyLLama V0
Repository ๐Ÿค—https://huggingface.co/Maykeye/TinyLLama-v0 
Model Size4.6m
Required VRAM0 GB
Updated2024-12-21
MaintainerMaykeye
Model Typellama
Model Files  0.0 GB   0.0 GB
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length2048
Model Max Length2048
Transformers Version4.30.2
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to TinyLLama V0

Best Alternatives
Context / RAM
Downloads
Likes
SimpleLlamaSentences2K / 0 GB190
UniversalNER TinyLLama2K / 0 GB211
Q4 Llama 38K / 6.1 GB120
BNB 4Bit Llama3 Finetune8K / 6.1 GB130
Llama3 Finetune 4bit8K / 6.1 GB00

Rank the TinyLLama V0 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40013 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217