Baby Llama By bbunzeck: Benchmarks, Features and Detailed Analysis. Insights on Baby Llama.

Autotrain compatible Dataset:nilq/babylm-10m En Endpoints compatible Llama Pytorch Region:us

Model Card on HF 🤗: https://huggingface.co/bbunzeck/baby_llama

Baby Llama Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Baby Llama Parameters and Internals

Model Type

autoregressive language model

Additional Notes

The model is part of a series of small language models. Other models in the series have variations in terms of training data volume and parameter sizes.

Supported Languages

en (unknown)

Training Details

Data Sources:

BabyLM data

Data Volume:

10M tokens

Context Length:

128

Model Architecture:

unknown

LLM Name	Baby Llama
Repository 🤗	https://huggingface.co/bbunzeck/baby_llama
Model Size	10m
Required VRAM	0 GB
Updated	2025-02-22
Maintainer	bbunzeck
Model Type	llama
Model Files	0.0 GB 0.0 GB
Supported Languages	en
Model Architecture	LlamaForCausalLM
Context Length	128
Model Max Length	128
Transformers Version	4.32.1
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<pad>
Vocabulary Size	16000
Torch Data Type	float32

Best Alternatives to Baby Llama

Best Alternatives	Context / RAM	Downloads	Likes
...enbuddy Falcon3 10B V24.2 131K	128K / 20.7 GB	6	0
HelpingAI2.5 10B	128K / 20.5 GB	12286	4
Priya 10B	128K / 20.5 GB	118	1
HelpingAI2.5 10B	128K / 20.5 GB	66	2
L3.1 Mochav2 10B	128K / 42.8 GB	21	0
HELVETE X	128K / 20.5 GB	92	4
Yarn Solar 10B 64K	64K / 21.4 GB	5477	15
StoryTeller 10B 2e V2	58K / 21.4 GB	4	1
Falcon3 10B Instruct	32K / 20.5 GB	32170	93
Virtuoso Lite	32K / 20.5 GB	2039	33

Note: green Score (e.g. "73.2") means that the model is better than bbunzeck/baby_llama.

Rank the Baby Llama Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Baby Llama by bbunzeck

» All LLMs » bbunzeck » Baby Llama URL Share it on

Baby Llama Benchmarks

Baby Llama Parameters and Internals

Best Alternatives to Baby Llama

Rank the Baby Llama Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.