Yi 34B 200K AEZAKMI RAW 2901 By adamo1139: Benchmarks, Features and Detailed Analysis. Insights on Yi 34B 200K AEZAKMI RAW 2901.

Autotrain compatible Dataset:adamo1139/aezakmi v2 Dataset:adamo1139/rawrr v1 Endpoints compatible Llama Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/adamo1139/Yi-34B-200K-AEZAKMI-RAW-2901

Yi 34B 200K AEZAKMI RAW 2901 Benchmarks

ARC: 64.93 vs 96.7 (so35)^-32.9%

HellaSwag: 84.98 vs 95.3 (gpt4)^-10.8%

MMLU: 73.7 vs 88.3 (so35)^-16.5%

TruthfulQA: 55.09 vs 59 (gpt4)^-6.6%

WinoGrande: 79.32 vs 87.5 (gpt4)^-9.3%

GSM8K: 59.51 vs 96.4 (so35)^-38.3%

LLME Score: 0.173

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Yi 34B 200K AEZAKMI RAW 2901 (adamo1139/Yi-34B-200K-AEZAKMI-RAW-2901)

Yi 34B 200K AEZAKMI RAW 2901 Parameters and Internals

Model Type

chat model

Use Cases

Applications:

chat

Primary Use Cases:

chat assistance

Limitations:

Not good at math or riddles, Known issues with repetition

Considerations:

Repetition penalty recommended to avoid repetition issues

Additional Notes

EXPERIMENTAL MODEL, NOT FINAL, IT HAS SOME ISSUES, I DIDN'T TEST IT TOO MUCH YET

Training Details

Data Sources:

adamo1139/AEZAKMI_v2, adamo1139/rawrr_v1

Methodology:

Fine-tuned via DPO and SFT using Unsloth

Context Length:

200000

Training Time:

DPO: 6 hours, SFT: 25 hours

Input Output

Input Format:

Standard ChatML format

Accepted Modalities:

text

Performance Tips:

Repetition penalty around 1.05 is recommended.

LLM Name	Yi 34B 200K AEZAKMI RAW 2901
Repository 🤗	https://huggingface.co/adamo1139/Yi-34B-200K-AEZAKMI-RAW-2901
Model Size	34b
Required VRAM	69.2 GB
Updated	2025-02-22
Maintainer	adamo1139
Model Type	llama
Model Files	4.8 GB: 1-of-15 4.8 GB: 2-of-15 5.0 GB: 3-of-15 4.8 GB: 4-of-15 4.8 GB: 5-of-15 5.0 GB: 6-of-15 4.8 GB: 7-of-15 4.8 GB: 8-of-15 5.0 GB: 9-of-15 4.8 GB: 10-of-15 4.8 GB: 11-of-15 5.0 GB: 12-of-15 4.8 GB: 13-of-15 4.8 GB: 14-of-15 1.2 GB: 15-of-15
Model Architecture	LlamaForCausalLM
License	apache-2.0
Context Length	200000
Model Max Length	200000
Transformers Version	4.36.2
Tokenizer Class	LlamaTokenizer
Padding Token	<unk>
Vocabulary Size	64000
Torch Data Type	float16

Best Alternatives to Yi 34B 200K AEZAKMI RAW 2901

Best Alternatives	Context / RAM	Downloads	Likes
Casual Magnum 34B	195K / 68.8 GB	14	1
34B Beta	195K / 69.2 GB	3729	63
Bagel 34B V0.2	195K / 68.7 GB	6790	40
Bagel Hermes 34B Slerp	195K / 68.9 GB	3894	1
Smaug 34B V0.1	195K / 69.2 GB	3672	60
Yi 34B 200K	195K / 68.9 GB	6015	318
Yi 34B 200K AEZAKMI V2	195K / 69.2 GB	2007	12
Faro Yi 34B	195K / 69.2 GB	3612	6
Smaug 34B V0.1 ExPO	195K / 69.2 GB	1972	0
Mergekit Slerp Anaazls	195K / 69.2 GB	9	0

Note: green Score (e.g. "73.2") means that the model is better than adamo1139/Yi-34B-200K-AEZAKMI-RAW-2901.

Rank the Yi 34B 200K AEZAKMI RAW 2901 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Yi 34B 200K AEZAKMI RAW 2901 by adamo1139

» All LLMs » adamo1139 » Yi 34B 200K AEZAKMI RAW 2901 URL Share it on

Yi 34B 200K AEZAKMI RAW 2901 Benchmarks

Yi 34B 200K AEZAKMI RAW 2901 Parameters and Internals

Best Alternatives to Yi 34B 200K AEZAKMI RAW 2901

Rank the Yi 34B 200K AEZAKMI RAW 2901 Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.