Nemo 12B Marlin V8 By UsernameJustAnother: Benchmarks, Features and Detailed Analysis. Insights on Nemo 12B Marlin V8.

Autotrain compatible Base model:finetune:unsloth/mi... Base model:unsloth/mistral-nem... Conversational Dataset:fizzarolli/fallingthro... Dataset:kalomaze/opus instruct... Dataset:nothingiisreal/reddit-... Dataset:sao10k/c2-logs-filtere... En Endpoints compatible Experimental Instruct Long-context Mistral Region:us Rp Safetensors Sharded Tensorflow Trl Unsloth Writing

Model Card on HF 🤗: https://huggingface.co/UsernameJustAnother/Nemo-12B-Marlin-v8

Nemo 12B Marlin V8 Benchmarks

LLME Score: 0.21692

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Nemo 12B Marlin V8 (UsernameJustAnother/Nemo-12B-Marlin-v8)

Nemo 12B Marlin V8 Parameters and Internals

Model Type

text-generation-inference, transformers, experimental, long-context

Additional Notes

Experimental model for RP/storywriting with a larger context.

Supported Languages

en (English)

Training Details

Data Sources:

kalomaze/Opus_Instruct_25k, Fizzarolli/FallingThroughTheSkies-592k-Filtered-Filtered, Sao10K/c2-Logs-Filtered, nothingiisreal/Reddit-Dirty-And-WritingPrompts

Data Volume:

10K-ish records

Methodology:

Fine-tuned using ChatML with filtering and editing for data curation

Context Length:

16000

Training Time:

7.5 hours

Hardware Used:

A100 80GB from runpod.io

Release Notes

Version:

Notes:

Fine-tuned on Nemo Base instead of Instruct, trained with dataset skill enhancements, and fits in 16GB VRAM.

LLM Name	Nemo 12B Marlin V8
Repository 🤗	https://huggingface.co/UsernameJustAnother/Nemo-12B-Marlin-v8
Base Model(s)	Mistral Nemo Base 2407 unsloth/Mistral-Nemo-Base-2407
Model Size	12b
Required VRAM	24.5 GB
Updated	2025-02-05
Maintainer	UsernameJustAnother
Model Type	mistral
Instruction-Based	Yes
Model Files	4.9 GB: 1-of-5 4.9 GB: 2-of-5 4.9 GB: 3-of-5 4.9 GB: 4-of-5 4.9 GB: 5-of-5
Supported Languages	en
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	1024000
Model Max Length	1024000
Transformers Version	4.44.1
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<pad>
Vocabulary Size	131072
Torch Data Type	bfloat16

Best Alternatives to Nemo 12B Marlin V8

Best Alternatives	Context / RAM	Downloads	Likes
...r Nemo 12B Instruct R 21 09 24	1000K / 24.5 GB	8449	106
SauerkrautLM Nemo 12B Instruct	1000K / 24.5 GB	19527	22
Mistral Nemo Wissenschaft 12B	1000K / 24.5 GB	5216	7
MN Slush	1000K / 24.5 GB	307	19
Magnum V4 12B	1000K / 24.5 GB	844	39
ChatWaifu V1.4	1000K / 24.5 GB	97	19
ChatWaifu 12B V2.0	1000K / 24.5 GB	57	18
...tral Nemo Gutenberg Doppel 12B	1000K / 24.5 GB	61	5
GodSlayer 12B ABYSS	1000K / 24.5 GB	84	5
SAINEMO ReMIX	1000K / 24.5 GB	994	24

Note: green Score (e.g. "73.2") means that the model is better than UsernameJustAnother/Nemo-12B-Marlin-v8.

Rank the Nemo 12B Marlin V8 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 42577 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Nemo 12B Marlin V8 by UsernameJustAnother

» All LLMs » UsernameJustAnother » Nemo 12B Marlin V8 URL Share it on

Nemo 12B Marlin V8 Benchmarks

Nemo 12B Marlin V8 Parameters and Internals

Best Alternatives to Nemo 12B Marlin V8

Rank the Nemo 12B Marlin V8 Capabilities

What open-source LLMs or SLMs are you in search of? 42577 in total.