Cosmo 1B By HuggingFaceTB: Benchmarks, Features and Detailed Analysis. Insights on Cosmo 1B.

Cosmopedia, non-synthetic sources like code-python-0.60-to-1.00 and web-0.50-to-1.00 subsets of AutoMathText, The Stack's Jupyter Notebooks, ultrachat

Data Volume:

180 billion tokens

Methodology:

Used UltraChat data formatted for LlaMa models, to avoid post pre-training instruction-tuning, and upsampled data from seed sources.

Context Length:

2048

Training Time:

15 hours

Hardware Used:

160 H100 GPUs

Model Architecture:

Llama-2

Input Output

Input Format:

Chat and regular text prompts

Accepted Modalities:

text

Output Format:

Generated text sequence

Performance Tips:

Use chat format for better instructional adherence.

LLM Name	Cosmo 1B
Repository 🤗	https://huggingface.co/HuggingFaceTB/cosmo-1b
Model Size	1b
Required VRAM	7 GB
Updated	2025-02-22
Maintainer	HuggingFaceTB
Model Type	llama
Model Files	5.0 GB: 1-of-2 2.0 GB: 2-of-2
Supported Languages	en
Model Architecture	LlamaForCausalLM
License	apache-2.0
Context Length	2048
Model Max Length	2048
Transformers Version	4.37.2
Tokenizer Class	LlamaTokenizer
Vocabulary Size	32000
Torch Data Type	bfloat16

Best Alternatives to Cosmo 1B

Best Alternatives	Context / RAM	Downloads	Likes
LWM Text Chat 1M	1024K / 13.5 GB	2084	175
LWM Text 1M	1024K / 13.5 GB	491	28
JOSIE 1M Base	1024K / 13.5 GB	12	1
JOSIE 1M Base	1024K / 13.5 GB	6	1
Llama 3.2 1B	128K / 2.5 GB	10850287	1584
Llama 3.2 1B Instruct	128K / 2.5 GB	1716586	773
Llama 3.2 1B Instruct	128K / 2.5 GB	112714	63
MiniThinky V2 1B Llama 3.2	128K / 4.9 GB	7294	38
Lancer 1 1B Instruct	128K / 2.5 GB	110	2
Llama Express.1 Math	128K / 2.5 GB	405	7

Note: green Score (e.g. "73.2") means that the model is better than HuggingFaceTB/cosmo-1b.

Rank the Cosmo 1B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Cosmo 1B by HuggingFaceTB

» All LLMs » HuggingFaceTB » Cosmo 1B URL Share it on

Cosmo 1B Benchmarks

Cosmo 1B Parameters and Internals

Best Alternatives to Cosmo 1B

Rank the Cosmo 1B Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.