Yi 34B 200K XLCTX RAW ORPO 0805 GaLore By adamo1139: Benchmarks, Features and Detailed Analysis. Insights on Yi 34B 200K XLCTX RAW ORPO 0805 GaLore.

Autotrain compatible Endpoints compatible Llama Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/adamo1139/Yi-34B-200K-XLCTX-RAW-ORPO-0805-GaLore

Yi 34B 200K XLCTX RAW ORPO 0805 GaLore Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Yi 34B 200K XLCTX RAW ORPO 0805 GaLore (adamo1139/Yi-34B-200K-XLCTX-RAW-ORPO-0805-GaLore)

Yi 34B 200K XLCTX RAW ORPO 0805 GaLore Parameters and Internals

Use Cases

Applications:

Serves as a base for further finetuning

Primary Use Cases:

Further finetuning for specialized tasks

Considerations:

Intended to reduce inherited behavior from training on AI-generated content.

Additional Notes

The model is not intended for chat applications.

Training Details

Data Sources:

adamo1139/rawrr_v2_2_stage1

Methodology:

Finetuned using ORPO and GaLore with 4-bit (bnb) weights on the dataset.

LLM Name	Yi 34B 200K XLCTX RAW ORPO 0805 GaLore
Repository 🤗	https://huggingface.co/adamo1139/Yi-34B-200K-XLCTX-RAW-ORPO-0805-GaLore
Model Size	34b
Required VRAM	69.2 GB
Updated	2025-02-22
Maintainer	adamo1139
Model Type	llama
Model Files	4.8 GB: 1-of-15 4.8 GB: 2-of-15 5.0 GB: 3-of-15 4.8 GB: 4-of-15 4.8 GB: 5-of-15 5.0 GB: 6-of-15 4.8 GB: 7-of-15 4.8 GB: 8-of-15 5.0 GB: 9-of-15 4.8 GB: 10-of-15 4.8 GB: 11-of-15 5.0 GB: 12-of-15 4.8 GB: 13-of-15 4.8 GB: 14-of-15 1.2 GB: 15-of-15
Model Architecture	LlamaForCausalLM
License	apache-2.0
Context Length	200000
Model Max Length	200000
Transformers Version	4.36.2
Tokenizer Class	LlamaTokenizer
Padding Token	<unk>
Vocabulary Size	64000
Torch Data Type	float16

Best Alternatives to Yi 34B 200K XLCTX RAW ORPO 0805 GaLore

Best Alternatives	Context / RAM	Downloads	Likes
Casual Magnum 34B	195K / 68.8 GB	14	1
34B Beta	195K / 69.2 GB	3729	63
Bagel 34B V0.2	195K / 68.7 GB	6790	40
Bagel Hermes 34B Slerp	195K / 68.9 GB	3894	1
Smaug 34B V0.1	195K / 69.2 GB	3672	60
Yi 34B 200K	195K / 68.9 GB	6015	318
Yi 34B 200K AEZAKMI V2	195K / 69.2 GB	2007	12
Faro Yi 34B	195K / 69.2 GB	3612	6
Smaug 34B V0.1 ExPO	195K / 69.2 GB	1972	0
Mergekit Slerp Anaazls	195K / 69.2 GB	9	0

Note: green Score (e.g. "73.2") means that the model is better than adamo1139/Yi-34B-200K-XLCTX-RAW-ORPO-0805-GaLore.

Rank the Yi 34B 200K XLCTX RAW ORPO 0805 GaLore Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Yi 34B 200K XLCTX RAW ORPO 0805 GaLore by adamo1139

» All LLMs » adamo1139 » Yi 34B 200K XLCTX RAW ORPO 0805 GaLore URL Share it on

Yi 34B 200K XLCTX RAW ORPO 0805 GaLore Benchmarks

Yi 34B 200K XLCTX RAW ORPO 0805 GaLore Parameters and Internals

Best Alternatives to Yi 34B 200K XLCTX RAW ORPO 0805 GaLore

Rank the Yi 34B 200K XLCTX RAW ORPO 0805 GaLore Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.