Dark Miqu 120B By jukofyork: Benchmarks, Features and Detailed Analysis. Insights on Dark Miqu 120B.

Merged Model Autotrain compatible Base model:finetune:jukofyork/... Base model:jukofyork/dark-miqu... Endpoints compatible Llama Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/jukofyork/Dark-Miqu-120B

Dark Miqu 120B Benchmarks

LLME Score: 0.19883

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Dark Miqu 120B (jukofyork/Dark-Miqu-120B)

Dark Miqu 120B Parameters and Internals

Model Type

creative writing

Training Details

Methodology:

Created using 'Mergekit' from Dark-Miqu-70B model

Context Length:

32000

Input Output

Input Format:

Supports 'Vicuna', 'Mistral', and 'Alpaca' formats

LLM Name	Dark Miqu 120B
Repository 🤗	https://huggingface.co/jukofyork/Dark-Miqu-120B
Base Model(s)	Dark Miqu 70B Dark Miqu 70B jukofyork/Dark-Miqu-70B jukofyork/Dark-Miqu-70B
Merged Model	Yes
Model Size	70b
Required VRAM	240.7 GB
Updated	2024-12-03
Maintainer	jukofyork
Model Type	llama
Model Files	9.6 GB: 1-of-25 10.0 GB: 2-of-25 9.9 GB: 3-of-25 9.8 GB: 4-of-25 9.7 GB: 5-of-25 9.7 GB: 6-of-25 9.8 GB: 7-of-25 9.8 GB: 8-of-25 9.8 GB: 9-of-25 10.0 GB: 10-of-25 10.0 GB: 11-of-25 9.7 GB: 12-of-25 9.8 GB: 13-of-25 9.9 GB: 14-of-25 9.7 GB: 15-of-25 9.8 GB: 16-of-25 9.7 GB: 17-of-25 9.8 GB: 18-of-25 9.9 GB: 19-of-25 10.0 GB: 20-of-25 9.9 GB: 21-of-25 9.9 GB: 22-of-25 10.0 GB: 23-of-25 9.8 GB: 24-of-25 4.7 GB: 25-of-25
Model Architecture	LlamaForCausalLM
License	other
Context Length	32764
Model Max Length	32764
Transformers Version	4.39.3
Tokenizer Class	LlamaTokenizer
Padding Token	<unk>
Vocabulary Size	32000
Torch Data Type	float16

Best Alternatives to Dark Miqu 120B

Best Alternatives	Context / RAM	Downloads	Likes
... Chat 1048K Chinese Llama3 70B	1024K / 141.9 GB	4035	5
... Chat 1048K Chinese Llama3 70B	1024K / 141.9 GB	2524	5
... 3 70B Instruct Gradient 1048K	1024K / 141.9 GB	199	121
Llama3 Function Calling 1048K	1024K / 141.9 GB	6	1
...a 3 70B Instruct Gradient 524K	512K / 141.9 GB	233	23
...a 3 70B Instruct Gradient 262K	256K / 141.9 GB	186	55
...ama 3 70B Arimas Story RP V1.5	256K / 141.2 GB	321	2
...ama 3 70B Arimas Story RP V2.0	256K / 141.1 GB	65	3
...ama 3 70B Arimas Story RP V1.6	256K / 141.2 GB	12	0
Yi 70B 200K RPMerge Franken	195K / 142.4 GB	10	1

Note: green Score (e.g. "73.2") means that the model is better than jukofyork/Dark-Miqu-120B.

Rank the Dark Miqu 120B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Dark Miqu 120B by jukofyork

» All LLMs » jukofyork » Dark Miqu 120B URL Share it on

Dark Miqu 120B Benchmarks

Dark Miqu 120B Parameters and Internals

Best Alternatives to Dark Miqu 120B

Rank the Dark Miqu 120B Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.