Yi 34B 200K DARE Megamerge V8 by brucethemoose

 ยป  All LLMs  ยป  brucethemoose  ยป  Yi 34B 200K DARE Megamerge V8   URL Share it on

  Merged Model   Arxiv:2306.01708   Arxiv:2311.03099   Autotrain compatible   En   Endpoints compatible   Llama   Model-index   Region:us   Safetensors   Sharded   Tensorflow   Yi

Yi 34B 200K DARE Megamerge V8 Benchmarks

Yi 34B 200K DARE Megamerge V8 (brucethemoose/Yi-34B-200K-DARE-megamerge-v8)

Yi 34B 200K DARE Megamerge V8 Parameters and Internals

Model Type 
text-generation
Additional Notes 
Merged using DARE TIES method to handle models with up to 200,000 context efficiently. Specialized in merging multiple Yi models for improved performance.
Input Output 
Input Format:
Orca-Vicuna template
Accepted Modalities:
text
Output Format:
text
Performance Tips:
Run at a lower temperature with 0.1 or higher MinP, a little repetition penalty, possibly mirostat with low tau.
LLM NameYi 34B 200K DARE Megamerge V8
Repository ๐Ÿค—https://huggingface.co/brucethemoose/Yi-34B-200K-DARE-megamerge-v8 
Merged ModelYes
Model Size34b
Required VRAM68.8 GB
Updated2024-12-21
Maintainerbrucethemoose
Model Typellama
Model Files  9.8 GB: 1-of-7   9.8 GB: 2-of-7   9.8 GB: 3-of-7   10.0 GB: 4-of-7   9.8 GB: 5-of-7   9.8 GB: 6-of-7   9.8 GB: 7-of-7
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length200000
Model Max Length200000
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size64002
Torch Data Typebfloat16

Quantized Models of the Yi 34B 200K DARE Megamerge V8

Model
Likes
Downloads
VRAM
...4B 200K DARE Megamerge V8 GGUF135209 GB
...4B 200K DARE Megamerge V8 GPTQ39718 GB
...34B 200K DARE Megamerge V8 AWQ24219 GB

Best Alternatives to Yi 34B 200K DARE Megamerge V8

Best Alternatives
Context / RAM
Downloads
Likes
Casual Magnum 34B195K / 68.8 GB131
34B Beta195K / 69.2 GB354862
Yi 34B 200K195K / 68.9 GB4525317
Smaug 34B V0.1195K / 69.2 GB371860
Bagel Hermes 34B Slerp195K / 68.9 GB39861
Bagel 34B V0.2195K / 68.7 GB395439
Yi 34B 200K AEZAKMI V2195K / 69.2 GB121012
Smaug 34B V0.1 ExPO195K / 69.2 GB29660
Faro Yi 34B195K / 69.2 GB37496
Bagel DPO 34B V0.5195K / 68.7 GB300817
Note: green Score (e.g. "73.2") means that the model is better than brucethemoose/Yi-34B-200K-DARE-megamerge-v8.

Rank the Yi 34B 200K DARE Megamerge V8 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40013 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217