Nemotron W 4B Halo 0.1 by Nexesenex

 ยป  All LLMs  ยป  Nexesenex  ยป  Nemotron W 4B Halo 0.1   URL Share it on

  Merged Model   Arxiv:2403.19522   Autotrain compatible Base model:bunnycore/llama-3.1... Base model:fourohfour/maelstro... Base model:nexesenex/nemotron ...   Conversational   Endpoints compatible   Llama   Model-index   Region:us   Safetensors   Sharded   Tensorflow

Nemotron W 4b Halo 0.1 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Nemotron W 4B Halo 0.1 (Nexesenex/Nemotron_W_4b_Halo_0.1)

Nemotron W 4B Halo 0.1 Parameters and Internals

LLM NameNemotron W 4b Halo 0.1
Repository ๐Ÿค—https://huggingface.co/Nexesenex/Nemotron_W_4b_Halo_0.1 
Base Model(s)  Nexesenex/Nemotron_W_4b_MagLight_0.1   bunnycore/LLama-3.1-4B-TitanFusion   Maelstrom 4B   Nexesenex/Nemotron_W_4b_MagLight_0.1   bunnycore/LLama-3.1-4B-TitanFusion   FourOhFour/Maelstrom_4B
Merged ModelYes
Model Size4b
Required VRAM9.2 GB
Updated2025-04-23
MaintainerNexesenex
Model Typellama
Model Files  0.8 GB: 1-of-10   1.0 GB: 2-of-10   1.0 GB: 3-of-10   1.0 GB: 4-of-10   1.0 GB: 5-of-10   1.0 GB: 6-of-10   1.0 GB: 7-of-10   1.0 GB: 8-of-10   1.0 GB: 9-of-10   0.4 GB: 10-of-10
Model ArchitectureLlamaForCausalLM
Context Length131072
Model Max Length131072
Transformers Version4.48.2
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|end_of_text|>
Vocabulary Size128256
Torch Data Typebfloat16

Best Alternatives to Nemotron W 4B Halo 0.1

Best Alternatives
Context / RAM
Downloads
Likes
SJT 4B146K / 7.6 GB70
Nemotron W 4b MagLight 0.1128K / 9.2 GB113
Loxa 4B128K / 16 GB150
...ama 3.1 Minitron 4B Depth Base128K / 9.1 GB753121
Hamanasu 4B Instruct KTO V2128K / 9 GB2011
...ama 3.1 Minitron 4B Width Base128K / 9 GB7519187
Aura 4B128K / 9 GB910
Hamanasu 4B Instruct KTO V1128K / 9 GB1281
Hamanasu KTO 4B128K / 9 GB1420
Hamanasu Magnum 4B128K / 9 GB242
Note: green Score (e.g. "73.2") means that the model is better than Nexesenex/Nemotron_W_4b_Halo_0.1.

Rank the Nemotron W 4B Halo 0.1 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 46599 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227