Deep Miqu 103B By jukofyork: Benchmarks, Features and Detailed Analysis. Insights on Deep Miqu 103B.

Merged Model Autotrain compatible Base model:jukofyork/dark-miqu... Base model:jukofyork/dawn-miqu... Endpoints compatible Llama Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/jukofyork/Deep-Miqu-103B

Deep Miqu 103B Benchmarks

LLME Score: 0.18535

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Deep Miqu 103B (jukofyork/Deep-Miqu-103B)

Deep Miqu 103B Parameters and Internals

Model Type

creative writing

Additional Notes

Created using Mergekit from two 70B parameter miqu-based models. Suitable for non-commercial, personal use only.

Input Output

Input Format:

Vicuna format is preferred. Mistral and Alpaca formats are also supported.

LLM Name	Deep Miqu 103B
Repository 🤗	https://huggingface.co/jukofyork/Deep-Miqu-103B
Base Model(s)	Dark Miqu 70B Dawn Miqu 70B jukofyork/Dark-Miqu-70B jukofyork/Dawn-Miqu-70B
Merged Model	Yes
Model Size	70b
Required VRAM	206.5 GB
Updated	2024-12-03
Maintainer	jukofyork
Model Type	llama
Model Files	9.6 GB: 1-of-21 10.0 GB: 2-of-21 10.0 GB: 3-of-21 10.0 GB: 4-of-21 10.0 GB: 5-of-21 9.9 GB: 6-of-21 9.6 GB: 7-of-21 9.7 GB: 8-of-21 10.0 GB: 9-of-21 9.9 GB: 10-of-21 9.9 GB: 11-of-21 9.9 GB: 12-of-21 9.7 GB: 13-of-21 9.7 GB: 14-of-21 9.8 GB: 15-of-21 9.6 GB: 16-of-21 9.8 GB: 17-of-21 10.0 GB: 18-of-21 9.6 GB: 19-of-21 9.8 GB: 20-of-21 10.0 GB: 21-of-21
Model Architecture	LlamaForCausalLM
License	other
Context Length	32764
Model Max Length	32764
Transformers Version	4.39.3
Tokenizer Class	LlamaTokenizer
Padding Token	<unk>
Vocabulary Size	32000
Torch Data Type	float16

Best Alternatives to Deep Miqu 103B

Best Alternatives	Context / RAM	Downloads	Likes
... Chat 1048K Chinese Llama3 70B	1024K / 141.9 GB	4035	5
... Chat 1048K Chinese Llama3 70B	1024K / 141.9 GB	2524	5
... 3 70B Instruct Gradient 1048K	1024K / 141.9 GB	199	121
Llama3 Function Calling 1048K	1024K / 141.9 GB	6	1
...a 3 70B Instruct Gradient 524K	512K / 141.9 GB	233	23
...a 3 70B Instruct Gradient 262K	256K / 141.9 GB	186	55
...ama 3 70B Arimas Story RP V1.5	256K / 141.2 GB	321	2
...ama 3 70B Arimas Story RP V2.0	256K / 141.1 GB	65	3
...ama 3 70B Arimas Story RP V1.6	256K / 141.2 GB	12	0
Yi 70B 200K RPMerge Franken	195K / 142.4 GB	10	1

Note: green Score (e.g. "73.2") means that the model is better than jukofyork/Deep-Miqu-103B.

Rank the Deep Miqu 103B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Deep Miqu 103B by jukofyork

» All LLMs » jukofyork » Deep Miqu 103B URL Share it on

Deep Miqu 103B Benchmarks

Deep Miqu 103B Parameters and Internals

Best Alternatives to Deep Miqu 103B

Rank the Deep Miqu 103B Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.