OpenMath2 Llama3.1 8B by nvidia

 ยป  All LLMs  ยป  nvidia  ยป  OpenMath2 Llama3.1 8B   URL Share it on

  Arxiv:2410.01560   Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/llama-3....   Conversational Dataset:nvidia/openmathinstruc...   En   Endpoints compatible   Llama   Math   Nvidia   Region:us   Safetensors   Sharded   Tensorflow

OpenMath2 Llama3.1 8B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
OpenMath2 Llama3.1 8B (nvidia/OpenMath2-Llama3.1-8B)

OpenMath2 Llama3.1 8B Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
math domain
Primary Use Cases:
math problem solving
Limitations:
not instructed for general data
Additional Notes 
Pipeline and models are fully open-sourced.
Supported Languages 
en (native)
Training Details 
Data Sources:
OpenMathInstruct-2
Methodology:
Finetuning
Input Output 
Input Format:
System/User/Assistant tokens (chat format)
Accepted Modalities:
text
Output Format:
Generated text
Performance Tips:
Recommended to use instructions in provided repository for inference.
LLM NameOpenMath2 Llama3.1 8B
Repository ๐Ÿค—https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B 
Base Model(s)  meta-llama/Llama-3.1-8B   meta-llama/Llama-3.1-8B
Model Size8b
Required VRAM16.1 GB
Updated2024-12-26
Maintainernvidia
Model Typellama
Model Files  10.0 GB: 1-of-2   6.1 GB: 2-of-2
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licensellama3.1
Context Length131072
Model Max Length131072
Transformers Version4.42.3
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size128256
Torch Data Typebfloat16

Quantized Models of the OpenMath2 Llama3.1 8B

Model
Likes
Downloads
VRAM
OpenMath 8B GGUF75184 GB

Best Alternatives to OpenMath2 Llama3.1 8B

Best Alternatives
Context / RAM
Downloads
Likes
...a 3 8B Instruct Gradient 1048K1024K / 16.1 GB4387678
Thor V1.4 8B DARK FICTION1024K / 16.1 GB9412
MrRoboto ProLong 8B V2b1024K / 16.1 GB1780
MrRoboto ProLong 8B V1n1024K / 16.1 GB1770
MrRoboto ProLong 8B V4i1024K / 16.1 GB571
HEL V0.8 8B LONG DARK1024K / 16.1 GB2300
4834155661024K / 16.1 GB680
MrRoboto ProLong 8B V4h1024K / 16.1 GB960
MrRoboto ProLong 8B V4b1024K / 16.1 GB1070
MrRoboto ProLong 8B V4c1024K / 16.1 GB860
Note: green Score (e.g. "73.2") means that the model is better than nvidia/OpenMath2-Llama3.1-8B.

Rank the OpenMath2 Llama3.1 8B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40303 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227