SG Raccoon Yi 55B 200K by mlinmg

 ยป  All LLMs  ยป  mlinmg  ยป  SG Raccoon Yi 55B 200K   URL Share it on

  Autotrain compatible   Conversational   Endpoints compatible   Llama   Region:us   Safetensors   Sharded   Tensorflow

SG Raccoon Yi 55B 200K Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
SG Raccoon Yi 55B 200K (mlinmg/SG-Raccoon-Yi-55B-200k)

SG Raccoon Yi 55B 200K Parameters and Internals

Model Type 
Auto-regressive Causal LM
Additional Notes 
This is a retired model, merged with Capybara due to issues with a missing eos_token.
Supported Languages 
en (English)
Input Output 
Input Format:
SYSTEM: USER: ASSISTANT:
Performance Tips:
Try disabling the BOS token and/or running a lower temperature with MinP for better output. Yi tends to run "hot" by default. Add ~~ as an additional stopping condition if needed.
LLM NameSG Raccoon Yi 55B 200K
Repository ๐Ÿค—https://huggingface.co/mlinmg/SG-Raccoon-Yi-55B-200k 
Model Size55b
Required VRAM111.4 GB
Updated2025-02-22
Maintainermlinmg
Model Typellama
Model Files  10.0 GB: 1-of-12   9.9 GB: 2-of-12   9.8 GB: 3-of-12   9.8 GB: 4-of-12   9.9 GB: 5-of-12   9.9 GB: 6-of-12   9.8 GB: 7-of-12   9.9 GB: 8-of-12   9.9 GB: 9-of-12   9.8 GB: 10-of-12   9.8 GB: 11-of-12   2.9 GB: 12-of-12
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length200000
Model Max Length200000
Transformers Version4.36.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size64002
Torch Data Typefloat16

Quantized Models of the SG Raccoon Yi 55B 200K

Model
Likes
Downloads
VRAM
SG Raccoon Yi 55B 200K GGUF39223 GB

Best Alternatives to SG Raccoon Yi 55B 200K

Best Alternatives
Context / RAM
Downloads
Likes
Etheria 55B V0.1195K / 111.2 GB619
Etheria 55B V0.1195K / 111.2 GB2010
SG Raccoon Yi 55B 200K 2.0195K / 111.3 GB326
SG Raccoon Yi 55B4K / 111.2 GB566
...theria 55B V0.1 3.0bpw H6 EXL2195K / 21.9 GB42
...theria 55B V0.1 3.5bpw H6 EXL2195K / 25.3 GB41
Etheria 55B V0.1 GPTQ195K / 29.2 GB264
Etheria 55B V0.1 AWQ195K / 30.2 GB141
SG Raccoon Yi 55B GPTQ4K / 29.2 GB281
SG Raccoon Yi 55B AWQ4K / 30.2 GB112
Note: green Score (e.g. "73.2") means that the model is better than mlinmg/SG-Raccoon-Yi-55B-200k.

Rank the SG Raccoon Yi 55B 200K Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43508 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227