Nucleus 22B Token 500B GGUF by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  Nucleus 22B Token 500B GGUF   URL Share it on

Base model:nucleusai/nucleus-2... Base model:quantized:nucleusai...   En   Gguf   Llama   Quantized   Region:us

Nucleus 22B Token 500B GGUF Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Nucleus 22B Token 500B GGUF (TheBloke/nucleus-22B-token-500B-GGUF)

Nucleus 22B Token 500B GGUF Parameters and Internals

Model Type 
Causal decoder-only
Use Cases 
Areas:
Research
Primary Use Cases:
summarization, text generation, chatbot
Considerations:
Needs finetuning for specific use cases.
Additional Notes 
It is a raw, pretrained model, intended to be further fine-tuned for specific use cases.
Supported Languages 
en (English)
Training Details 
Data Sources:
RefinedWeb, Books, Code, Technical, Math
Data Volume:
500B tokens
Context Length:
2048
Training Time:
two weeks
Hardware Used:
256 A100 80GB GPUs
Model Architecture:
Causal decoder-only
LLM NameNucleus 22B Token 500B GGUF
Repository ๐Ÿค—https://huggingface.co/TheBloke/nucleus-22B-token-500B-GGUF 
Model NameNucleus 22B Token 500B
Model CreatorNucleusAI
Base Model(s)  Nucleus 22B Token 500B   NucleusAI/nucleus-22B-token-500B
Model Size22b
Required VRAM9.1 GB
Updated2025-02-05
MaintainerTheBloke
Model Typellama
Model Files  9.1 GB   11.6 GB   10.6 GB   9.5 GB   12.3 GB   13.2 GB   12.4 GB   15.0 GB   15.5 GB   15.0 GB   17.9 GB   23.2 GB
Supported Languagesen
GGUF QuantizationYes
Quantization Typegguf
Model ArchitectureAutoModel
Licensemit

Best Alternatives to Nucleus 22B Token 500B GGUF

Best Alternatives
Context / RAM
Downloads
Likes
Codestral 22B V0.1 GGUF0K / 4.8 GB1409
...t2cypher Codestral Q4 K M Gguf0K / 13.3 GB194
...xt2cypher Codestral 16bit Gguf0K / 44.5 GB280
Qwen1.5 22B Chat Merge GGUF0K / 12.6 GB181
Llama2 22B Daydreamer V3 GGUF0K / 9.1 GB1662
Llama2 22B Daydreamer V2 GGUF0K / 9.1 GB1461
Llama2 22B GPLATTY GGUF0K / 9.1 GB1163
Huginn 22B Prototype GGUF0K / 9.1 GB962
Huginn 22B Prototype GGML0K / 9.2 GB31
Llama2 22B GPLATTY GGML0K / 9.2 GB97
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/nucleus-22B-token-500B-GGUF.

Rank the Nucleus 22B Token 500B GGUF Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227