OpenChat 3.5 0106 9.86B 44Layers Appended By Pretergeek: Benchmarks, Features and Detailed Analysis. Insights on OpenChat 3.5 0106 9.86B 44Layers Appended.

Merged Model Arxiv:2401.02415 Autotrain compatible Base model:finetune:openchat/o... Base model:openchat/openchat-3... Conversational Endpoints compatible Mistral Model-index Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/Pretergeek/OpenChat-3.5-0106_9.86B_44Layers-Appended

OpenChat 3.5 0106 9.86B 44Layers Appended Benchmarks

MMLU Pro: 25.44

GPQA: 7.61

MUSR: 11.78

BBH: 24.06

IFEval: 59.61 vs 88 (so35)^-32.3%

MATH Lvl 5: 7.93

LLME Score: 0.26796

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

OpenChat 3.5 0106 9.86B 44Layers Appended (Pretergeek/OpenChat-3.5-0106_9.86B_44Layers-Appended)

OpenChat 3.5 0106 9.86B 44Layers Appended Parameters and Internals

Model Type

text generation

Additional Notes

This model is created using [mergekit] passing through merge method and employing Block Expansion method enhancing the understanding of existing tasks.

Training Details

Methodology:

The model employs a variation of the Block Expansion method. New layers are added to the end of the model while original layers are frozen, allowing the new layers to be trained or fine-tuned separately, minimizing catastrophic forgetting.

Model Architecture:

Passthrough merge method with Block Expansion, adding new layers to the end of the model.

LLM Name	OpenChat 3.5 0106 9.86B 44Layers Appended
Repository 🤗	https://huggingface.co/Pretergeek/OpenChat-3.5-0106_9.86B_44Layers-Appended
Base Model(s)	openchat/openchat-3.5-0106 openchat/openchat-3.5-0106
Merged Model	Yes
Model Size	9.86b
Required VRAM	19.7 GB
Updated	2025-02-22
Maintainer	Pretergeek
Model Type	mistral
Model Files	4.9 GB: 1-of-4 4.9 GB: 2-of-4 4.9 GB: 3-of-4 5.0 GB: 4-of-4
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	8192
Model Max Length	8192
Transformers Version	4.42.4
Tokenizer Class	LlamaTokenizer
Vocabulary Size	32002
Torch Data Type	bfloat16

Rank the OpenChat 3.5 0106 9.86B 44Layers Appended Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

OpenChat 3.5 0106 9.86B 44Layers Appended by Pretergeek

» All LLMs » Pretergeek » OpenChat 3.5 0106 9.86B 44Layers Appended URL Share it on

OpenChat 3.5 0106 9.86B 44Layers Appended Benchmarks

OpenChat 3.5 0106 9.86B 44Layers Appended Parameters and Internals

Rank the OpenChat 3.5 0106 9.86B 44Layers Appended Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.