Dart Math Llama3 8B Prop2diff by hkust-nlp

 ยป  All LLMs  ยป  hkust-nlp  ยป  Dart Math Llama3 8B Prop2diff   URL Share it on

  Arxiv:2403.02884   Arxiv:2403.04706   Arxiv:2407.13690   Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/meta-lla... Dataset:hkust-nlp/dart-math-ha...   En   Endpoints compatible   Llama   Mathematics   Model-index   Region:us   Safetensors   Sharded   Tensorflow

Dart Math Llama3 8B Prop2diff Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
๐ŸŒŸ Advertise your project ๐Ÿš€

Dart Math Llama3 8B Prop2diff Parameters and Internals

Model Type 
text-generation
Use Cases 
Areas:
Research, Educational Tools
Applications:
Mathematical Problem-Solving, Educational Math Platforms
Primary Use Cases:
Solving complex mathematical problems, Enhancing educational content in mathematics
Limitations:
May not perform well in non-mathematical contexts
Considerations:
Primarily for use in mathematical and educational contexts
Supported Languages 
en (proficient)
Training Details 
Data Sources:
MATH, GSM8K
Methodology:
The model is trained using Difficulty-Aware Rejection Sampling (DARS) on the MATH and GSM8K training sets, focusing on tackling biases towards easy queries.
Context Length:
4096
Hardware Used:
8 A100 GPUs
Model Architecture:
Llama3-8B architecture with synthetic datasets
Input Output 
Input Format:
Alpaca prompt template
Accepted Modalities:
text
Output Format:
Generated text responses
LLM NameDart Math Llama3 8B Prop2diff
Repository ๐Ÿค—https://huggingface.co/hkust-nlp/dart-math-llama3-8b-prop2diff 
Base Model(s)  Meta Llama 3 8B   meta-llama/Meta-Llama-3-8B
Model Size8b
Required VRAM16.1 GB
Updated2024-12-04
Maintainerhkust-nlp
Model Typellama
Model Files  5.0 GB: 1-of-4   5.0 GB: 2-of-4   4.9 GB: 3-of-4   1.2 GB: 4-of-4
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licensellama3
Context Length8192
Model Max Length8192
Transformers Version4.41.1
Tokenizer ClassPreTrainedTokenizerFast
Padding Token[PAD]
Vocabulary Size128320
Torch Data Typebfloat16
Dart Math Llama3 8B Prop2diff (hkust-nlp/dart-math-llama3-8b-prop2diff)

Best Alternatives to Dart Math Llama3 8B Prop2diff

Best Alternatives
Context / RAM
Downloads
Likes
...a 3 8B Instruct Gradient 1048K1024K / 16.1 GB14288676
Loki V2.8 8B EROTICA1024K / 16.1 GB192
Thor V1.3a 8B FANTASY 1024K1024K / 16.1 GB1501
Odin V1.1 8B FICTION 1024K1024K / 16.1 GB1040
RP Naughty V1.0e 8B1024K / 16.1 GB561
RP Naughty V1.2 8B1024K / 16.1 GB401
...or V1.35 8B DARK FANTASY 1024K1024K / 16.1 GB261
Loki V2.75B 8B EROTICA 1024K1024K / 16.1 GB191
Loki V2.75 8B EROTICA 1024K1024K / 16.1 GB171
8B Base Academic 11024K / 16.1 GB61
Note: green Score (e.g. "73.2") means that the model is better than hkust-nlp/dart-math-llama3-8b-prop2diff.

Rank the Dart Math Llama3 8B Prop2diff Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 38813 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124