NuExtract 1.5 Smol by numind

 ยป  All LLMs  ยป  numind  ยป  NuExtract 1.5 Smol   URL Share it on

Base model:finetune:huggingfac... Base model:huggingfacetb/smoll...   Llama   Multilingual   Region:us   Safetensors

NuExtract 1.5 Smol Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
NuExtract 1.5 Smol (numind/NuExtract-1.5-smol)

NuExtract 1.5 Smol Parameters and Internals

Model Type 
text generation, information extraction
Use Cases 
Primary Use Cases:
Structured information extraction
Additional Notes 
Designed to prioritize pure extraction; inputs should be structured accordingly.
Supported Languages 
English (fluent), Multilingual (supported)
Training Details 
Methodology:
Fine-tuning of Hugging Face's SmolLM2-1.7B
Input Output 
Input Format:
<|input|>\n### Template:\n{template}\n### Text:\n{text}\n\n<|output|>
Accepted Modalities:
text
Output Format:
Structured JSON as per the template.
Performance Tips:
Use a JSON template for structured extraction.
LLM NameNuExtract 1.5 Smol
Repository ๐Ÿค—https://huggingface.co/numind/NuExtract-1.5-smol 
Base Model(s)  HuggingFaceTB/SmolLM2-1.7B   HuggingFaceTB/SmolLM2-1.7B
Model Size1.7b
Required VRAM3.4 GB
Updated2025-04-23
Maintainernumind
Model Typellama
Model Files  3.4 GB
Model ArchitectureLlamaForCausalLM
Licensemit
Context Length8192
Model Max Length8192
Transformers Version4.45.2
Tokenizer ClassGPT2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size49152
Torch Data Typebfloat16

Quantized Models of the NuExtract 1.5 Smol

Model
Likes
Downloads
VRAM
NuExtract 1.5 Smol GGUF71100 GB

Best Alternatives to NuExtract 1.5 Smol

Best Alternatives
Context / RAM
Downloads
Likes
SmolLM2 1.7B Instruct 16K16K / 3.4 GB1099
SmolLM2 1.7B Instruct8K / 3.4 GB82977599
SmolLM2 1.7B8K / 3.4 GB36108117
Superthoughts Lite V18K / 3.4 GB3342
SmolTulu 1.7B Reinforced8K / 3.4 GB195
Cllm 1.0.0 340000 Instruct8K / 3.5 GB1060
SmolLM2 1.7B R1 Distilled8K / 3.4 GB920
...urtis E1 SmolLM2 1.7B Instruct8K / 6.7 GB900
SmolLM2 1.7 Persona8K / 3.5 GB80
SmolLM2 1.7B Instruct8K / 3.4 GB32804
Note: green Score (e.g. "73.2") means that the model is better than numind/NuExtract-1.5-smol.

Rank the NuExtract 1.5 Smol Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 46599 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227