Baby Llama by bbunzeck

 ยป  All LLMs  ยป  bbunzeck  ยป  Baby Llama   URL Share it on

  Autotrain compatible   Dataset:nilq/babylm-10m   En   Endpoints compatible   Llama   Pytorch   Region:us
Model Card on HF ๐Ÿค—: https://huggingface.co/bbunzeck/baby_llama 

Baby Llama Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Baby Llama (bbunzeck/baby_llama)

Baby Llama Parameters and Internals

Model Type 
autoregressive language model
Additional Notes 
The model is part of a series of small language models. Other models in the series have variations in terms of training data volume and parameter sizes.
Supported Languages 
en (unknown)
Training Details 
Data Sources:
BabyLM data
Data Volume:
10M tokens
Context Length:
128
Model Architecture:
unknown
LLM NameBaby Llama
Repository ๐Ÿค—https://huggingface.co/bbunzeck/baby_llama 
Model Size10m
Required VRAM0 GB
Updated2025-02-22
Maintainerbbunzeck
Model Typellama
Model Files  0.0 GB   0.0 GB
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Context Length128
Model Max Length128
Transformers Version4.32.1
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<pad>
Vocabulary Size16000
Torch Data Typefloat32

Best Alternatives to Baby Llama

Best Alternatives
Context / RAM
Downloads
Likes
...enbuddy Falcon3 10B V24.2 131K128K / 20.7 GB60
HelpingAI2.5 10B128K / 20.5 GB122864
Priya 10B128K / 20.5 GB1181
HelpingAI2.5 10B128K / 20.5 GB662
L3.1 Mochav2 10B128K / 42.8 GB210
HELVETE X128K / 20.5 GB924
Yarn Solar 10B 64K64K / 21.4 GB547715
StoryTeller 10B 2e V258K / 21.4 GB41
Falcon3 10B Instruct32K / 20.5 GB3217093
Virtuoso Lite32K / 20.5 GB203933
Note: green Score (e.g. "73.2") means that the model is better than bbunzeck/baby_llama.

Rank the Baby Llama Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227