Math Shepherd Mistral 7B Prm by peiyi9979

 Β»  All LLMs  Β»  peiyi9979  Β»  Math Shepherd Mistral 7B Prm   URL Share it on

  Arxiv:2312.08935   Autotrain compatible   Endpoints compatible   Llama   Pytorch   Region:us   Sharded

Math Shepherd Mistral 7B Prm Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Math Shepherd Mistral 7B Prm (peiyi9979/math-shepherd-mistral-7b-prm)

Math Shepherd Mistral 7B Prm Parameters and Internals

Model Type 
reward model
Additional Notes 
The reward model is part of the Math-Shepherd system, used to process and evaluate step-by-step mathematical solutions.
Input Output 
Input Format:
question + step-by-step solutions with a special step tag `ΠΊΠΈ`
Accepted Modalities:
text
Output Format:
logits of each solution step
LLM NameMath Shepherd Mistral 7B Prm
Repository πŸ€—https://huggingface.co/peiyi9979/math-shepherd-mistral-7b-prm 
Model Size7b
Required VRAM15.8 GB
Updated2025-04-29
Maintainerpeiyi9979
Model Typellama
Model Files  9.5 GB: 1-of-2   6.3 GB: 2-of-2
Model ArchitectureLlamaForCausalLM
Context Length4096
Model Max Length4096
Transformers Version4.33.1
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to Math Shepherd Mistral 7B Prm

Best Alternatives
Context / RAM
Downloads
Likes
A6 L1024K / 16.1 GB2010
M1024K / 16.1 GB1270
1571024K / 16.1 GB1010
1241024K / 16.1 GB930
A3.41024K / 16.1 GB130
A5.41024K / 16.1 GB120
A2.41024K / 16.1 GB120
2 Very Sci Fi1024K / 16.1 GB3170
1621024K / 16.1 GB600
1181024K / 16.1 GB150
Note: green Score (e.g. "73.2") means that the model is better than peiyi9979/math-shepherd-mistral-7b-prm.

Rank the Math Shepherd Mistral 7B Prm Capabilities

πŸ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 46792 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227