Poe 4B By FourOhFour: Benchmarks, Features and Detailed Analysis. Insights on Poe 4B.

Autotrain compatible Base model:finetune:interviten... Base model:intervitensinc/llam... Conversational Endpoints compatible Generated from trainer Llama Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/FourOhFour/Poe_4B

Poe 4B Benchmarks

LLME Score: 0.21176

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Poe 4B Parameters and Internals

Model Type

AutoModelForCausalLM

Training Details

Data Sources:

PocketDoc/Dans-MemoryCore-CoreCurriculum-Small, NewEden/Kalo-Opus-Instruct-22k-Refusal-Murdered, Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned, NewEden/Gryphe-Sonnet-3.5-35k-Subset, anthracite-org/stheno-filtered-v1.1, Epiculous/SynthRP-Gens-v1.1-Filtered-n-Cleaned, ResplendentAI/bluemoon, openerotica/freedom-rp, jeiku/Nitral_Medical_Dialog_Fixed, MinervaAI/Aesir-Preview, jeiku/jeikutxt, ResplendentAI/Sissification_Hypno_1k, ResplendentAI/theory_of_mind_fixed_output, ResplendentAI/Synthetic_Soul_1k

Methodology:

chatml and alpaca conversation types, sequence length 8192, trained with Axolotl version 0.4.1 and multi-GPU distributed training

Context Length:

8192

Hardware Used:

multi-GPU, 2 devices

LLM Name	Poe 4B
Repository 🤗	https://huggingface.co/FourOhFour/Poe_4B
Base Model(s)	IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml
Model Size	4b
Required VRAM	9 GB
Updated	2025-02-22
Maintainer	FourOhFour
Model Type	llama
Model Files	5.0 GB: 1-of-2 4.0 GB: 2-of-2
Model Architecture	LlamaForCausalLM
License	other
Context Length	131072
Model Max Length	131072
Transformers Version	4.45.0.dev0
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<\|finetune_right_pad_id\|>
Vocabulary Size	128256
Torch Data Type	bfloat16

Best Alternatives to Poe 4B

Best Alternatives	Context / RAM	Downloads	Likes
SJT 4B	146K / 7.6 GB	17	0
Loxa 4B	128K / 16 GB	64	0
Aura 4B	128K / 9 GB	27	10
...ama 3.1 Minitron 4B Depth Base	128K / 9.1 GB	2853	21
Nemotron W 4b MagLight 0.1	128K / 9.2 GB	16	2
...ama 3.1 Minitron 4B Width Base	128K / 9 GB	3382	188
....5 MINI 4B SFTxORPO HESSIAN AI	128K / 7.7 GB	16	0
....5 MINI 4B ORPOxSFT HESSIAN AI	128K / 7.7 GB	15	0
....5 MINI 4B ORPOxSFT HESSIAN AI	128K / 7.7 GB	15	0
....5 MINI 4B SFTxORPO HESSIAN AI	128K / 7.7 GB	13	0

Note: green Score (e.g. "73.2") means that the model is better than FourOhFour/Poe_4B.

Rank the Poe 4B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Poe 4B by FourOhFour

» All LLMs » FourOhFour » Poe 4B URL Share it on

Poe 4B Benchmarks

Poe 4B Parameters and Internals

Best Alternatives to Poe 4B

Rank the Poe 4B Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.