Callisto OCR3 2B Instruct by prithivMLmods

 ยป  All LLMs  ยป  prithivMLmods  ยป  Callisto OCR3 2B Instruct   URL Share it on

  2b Base model:finetune:qwen/qwen2... Base model:qwen/qwen2-vl-2b-in...   Callisto   Conversational   Dataset:linxy/latex ocr Dataset:mychen76/invoices-and-... Dataset:prithivmlmods/img2text... Dataset:prithivmlmods/img2text...   Dataset:unsloth/latex ocr   En   Endpoints compatible   Feature-extraction   Image-text-to-text   Instruct   Key information extraction   Kie   Messy handwriting recognition   Ocr   Ocr#3   Qwen2 vl   Rag   Region:us   Safetensors   Vlm

Callisto OCR3 2B Instruct Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Callisto OCR3 2B Instruct (prithivMLmods/Callisto-OCR3-2B-Instruct)

Callisto OCR3 2B Instruct Parameters and Internals

LLM NameCallisto OCR3 2B Instruct
Repository ๐Ÿค—https://huggingface.co/prithivMLmods/Callisto-OCR3-2B-Instruct 
Base Model(s)  Qwen2 VL 2B Instruct   Qwen/Qwen2-VL-2B-Instruct
Model Size2b
Required VRAM4.4 GB
Updated2025-04-28
MaintainerprithivMLmods
Model Typeqwen2_vl
Instruction-BasedYes
Model Files  4.4 GB
Supported Languagesen
Model ArchitectureQwen2VLModel
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.50.2
Tokenizer ClassQwen2Tokenizer
Padding Token<|vision_pad|>
Vocabulary Size151936
Torch Data Typefloat16
Errorsreplace

Rank the Callisto OCR3 2B Instruct Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 46763 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227