Llama 161M 100B By abacaj: Benchmarks, Features and Detailed Analysis. Insights on Llama 161M 100B.

Autotrain compatible Endpoints compatible Llama Onnx Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/abacaj/llama-161M-100B

Llama 161M 100B Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama 161M 100B (abacaj/llama-161M-100B)

Llama 161M 100B Parameters and Internals

Model Type

pretrained, text generation

Use Cases

Primary Use Cases:

base pretrained model requiring fine-tuning

Limitations:

Requires further fine-tuning to be useful

Additional Notes

This is a base pretrained model and requires further fine tuning to be useful.

Training Details

Data Sources:

80% code, 10% NL, 10% instruction data

Data Volume:

100B tokens

Methodology:

WSD scheduler with 10% decay

Training Time:

110 hours

Hardware Used:

8x3090s

LLM Name	Llama 161M 100B
Repository 🤗	https://huggingface.co/abacaj/llama-161M-100B
Model Size	100b
Required VRAM	0.3 GB
Updated	2025-05-31
Maintainer	abacaj
Model Type	llama
Model Files	0.3 GB
Model Architecture	LlamaForCausalLM
License	apache-2.0
Context Length	1024
Model Max Length	1024
Transformers Version	4.40.2
Tokenizer Class	LlamaTokenizer
Padding Token	<unk>
Vocabulary Size	32000
Torch Data Type	bfloat16

Best Alternatives to Llama 161M 100B

Best Alternatives	Context / RAM	Downloads	Likes
SmollerLM2 100M Instruct Sft	8K / 0.3 GB	26	0
SmollerLM2 100M	8K / 0.2 GB	5	0
Stockmark 2 100B Instruct Beta	4K / 192.9 GB	52	10
Stockmark 100B	4K / 191.9 GB	1038	33
Saily 100b	4K / 235.5 GB	10	7
Plankton 100M	4K / 0.4 GB	10	0
...lisLM 100M Layer Hidden Pruned	2K / 0.2 GB	820	0
Reglu 100B	2K / 2.6 GB	8	1
...ephyr Smol Llama 100M DPO Full	1K / 0.2 GB	12	3
...ephyr Smol Llama 100M DPO Full	1K / GB	13	1

Note: green Score (e.g. "73.2") means that the model is better than abacaj/llama-161M-100B.

Rank the Llama 161M 100B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Llama 161M 100B by abacaj

» All LLMs » abacaj » Llama 161M 100B URL Share it on

Llama 161M 100B Benchmarks

Llama 161M 100B Parameters and Internals

Best Alternatives to Llama 161M 100B

Rank the Llama 161M 100B Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.