MAmmoTH Coder 13B by TIGER-Lab

 ยป  All LLMs  ยป  TIGER-Lab  ยป  MAmmoTH Coder 13B   URL Share it on

  Arxiv:2309.05653   Autotrain compatible   Codegen   Dataset:tiger-lab/mathinstruct   En   Endpoints compatible   Llama   Pytorch   Region:us   Sharded

MAmmoTH Coder 13B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
MAmmoTH Coder 13B (TIGER-Lab/MAmmoTH-Coder-13B)

MAmmoTH Coder 13B Parameters and Internals

Model Type 
Text generation, Math problem-solving
Use Cases 
Applications:
Research, Educational software, Tutoring systems
Primary Use Cases:
General math problem-solving
Limitations:
Performance may vary based on complexity and specifics of the math problem
Considerations:
Not all mathematical fields may be comprehensively covered
Supported Languages 
en (High)
Training Details 
Data Sources:
MathInstruct Dataset
Methodology:
Hybrid Instruction Tuning with Chain-of-Thought and Program-of-Thought rationales
Input Output 
Input Format:
Instruction-based with CoT or PoT rationale
Accepted Modalities:
text
Output Format:
Text - Solution to math problem
LLM NameMAmmoTH Coder 13B
Repository ๐Ÿค—https://huggingface.co/TIGER-Lab/MAmmoTH-Coder-13B 
Model Size13b
Required VRAM52.1 GB
Updated2024-12-22
MaintainerTIGER-Lab
Model Typellama
Model Files  10.0 GB: 1-of-6   9.9 GB: 2-of-6   9.9 GB: 3-of-6   9.9 GB: 4-of-6   9.9 GB: 5-of-6   2.5 GB: 6-of-6
Supported Languagesen
Generates CodeYes
Model ArchitectureLlamaForCausalLM
Licensemit
Context Length16384
Model Max Length16384
Transformers Version4.32.0
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32017
Torch Data Typefloat32

Quantized Models of the MAmmoTH Coder 13B

Model
Likes
Downloads
VRAM
MAmmoTH Coder 13B GGUF42675 GB
MAmmoTH Coder 13B AWQ1377 GB
MAmmoTH Coder 13B GPTQ3187 GB

Best Alternatives to MAmmoTH Coder 13B

Best Alternatives
Context / RAM
Downloads
Likes
CodeLlama 13B MORepair16K / 26 GB26502
NexusRaven V2 13B16K / 26 GB3919465
CodeLlama 13B Instruct Hf16K / 26 GB16223144
CodeLlama 13B Hf16K / 26 GB13785101
CodeLlama 13B Instruct Hf16K / 26 GB99318
...ma 13B Hf Truncated Embeddings16K / 52.3 GB170
CodeLlama 13B Hf16K / 26 GB48160
Tora Code 13B V1.016K / 26 GB116014
CodeLlama 13B Python Hf16K / 26 GB149848
Codellama 13B Oasst Sft V1016K / 25.9 GB194166
Note: green Score (e.g. "73.2") means that the model is better than TIGER-Lab/MAmmoTH-Coder-13B.

Rank the MAmmoTH Coder 13B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40066 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217