Marco O1 by AIDC-AI

 ยป  All LLMs  ยป  AIDC-AI  ยป  Marco O1   URL Share it on

  Arxiv:2411.14405   Autotrain compatible   Conversational   Qwen2   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/AIDC-AI/Marco-o1 

Marco O1 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Marco O1 (AIDC-AI/Marco-o1)

Marco O1 Parameters and Internals

Model Type 
text generation, reasoning
Use Cases 
Areas:
research, complex problem solving, translation
Primary Use Cases:
open-ended problem solving, machine translation, reasoning tasks
Training Details 
Data Sources:
Open-O1 CoT dataset, Marco-o1 CoT dataset, Marco-o1 Instruction dataset
Methodology:
Chain-of-Thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), reflection mechanisms, reasoning strategies
LLM NameMarco O1
Repository ๐Ÿค—https://huggingface.co/AIDC-AI/Marco-o1 
Model Size7.6b
Required VRAM15.2 GB
Updated2024-12-26
MaintainerAIDC-AI
Model Typeqwen2
Model Files  4.9 GB: 1-of-4   4.9 GB: 2-of-4   4.3 GB: 3-of-4   1.1 GB: 4-of-4
Model ArchitectureQwen2ForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.41.2
Tokenizer ClassQwen2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size152064
Torch Data Typebfloat16
Errorsreplace

Quantized Models of the Marco O1

Model
Likes
Downloads
VRAM
Marco O1 GGUF1114723 GB

Best Alternatives to Marco O1

Best Alternatives
Context / RAM
Downloads
Likes
Exp 3 Q R128K / 15.2 GB420
T36Model128K / 15.2 GB4450
T21Model128K / 15.2 GB3180
Exp 2 Q R128K / 15.2 GB250
T35Model128K / 15.2 GB500
Arcee Agent128K / 15.2 GB23688
Arcee Spark32K / 15.2 GB287086
Oolel V0.132K / 30.5 GB40213
Kurage Multilingual32K / 15.2 GB3528
Kurage Ru32K / 15.2 GB413

Rank the Marco O1 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40248 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217