OCRonos Vintage By PleIAs: Benchmarks, Features and Detailed Analysis. Insights on OCRonos Vintage.

Archives Autotrain compatible En Endpoints compatible Gpt2 History Ocr Ocr-correction Pre-train Region:us Safetensors Slm Text-correction

Model Card on HF 🤗: https://huggingface.co/PleIAs/OCRonos-Vintage

OCRonos Vintage Benchmarks

LLME Score: 0.20526

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

OCRonos Vintage (PleIAs/OCRonos-Vintage)

OCRonos Vintage Parameters and Internals

Model Type

text-generation, OCR correction

Use Cases

Areas:

cultural heritage archives

Applications:

OCR correction

Primary Use Cases:

Historical text correction

Limitations:

Not reliable for modern concept correction

Considerations:

Performs well for documents published between the mid-19th century and mid-20th century

Additional Notes

Can simulate historical text generation. Historical pre-training limits exposure to modern content.

Supported Languages

en (comparable to GPT-4 or llama for English-speaking archives)

Training Details

Data Sources:

Library of Congress, Internet Archive, Hathi Trust

Data Volume:

18 billion tokens

Methodology:

Pre-trained with llm.c

Context Length:

1024

Training Time:

2.5 hours

Hardware Used:

4 H100s

Model Architecture:

GPT-2 tokenizer, future custom tokenizer for better performance

Input Output

Input Format:

### Text ###

Accepted Modalities:

text

Output Format:

### Correction ###

LLM Name	OCRonos Vintage
Repository 🤗	https://huggingface.co/PleIAs/OCRonos-Vintage
Model Size	124.4m
Required VRAM	0.2 GB
Updated	2025-06-01
Maintainer	PleIAs
Model Type	gpt2
Model Files	0.2 GB
Supported Languages	en
Model Architecture	GPT2LMHeadModel
License	apache-2.0
Model Max Length	1024
Transformers Version	4.43.3
Tokenizer Class	GPT2Tokenizer
Vocabulary Size	50257
Torch Data Type	bfloat16
Activation Function	gelu_new
Errors	replace

Best Alternatives to OCRonos Vintage

Best Alternatives	Context / RAM	Downloads	Likes
ArshGpt	0K / 0.5 GB	234	12
Gpt2 Coconut Gsm From Cot7	0K / 0.5 GB	18	0
MindMate V2	0K / 0.5 GB	49	0
Solacia	0K / 0.5 GB	23	2
MindMate V1	0K / 0.5 GB	21	1
GPT 2 Fine Tuned Mental Health	0K / 0.5 GB	622	4
MindMate	0K / 0.5 GB	5	1
Gpt2 Shakespeare Mini	0K / 0.5 GB	22	1
Tiny2GPT	0K / 0.5 GB	13	0
My Finetuned GPT	0K / 0.5 GB	10	0

Note: green Score (e.g. "73.2") means that the model is better than PleIAs/OCRonos-Vintage.

Rank the OCRonos Vintage Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47770 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

OCRonos Vintage by PleIAs

» All LLMs » PleIAs » OCRonos Vintage URL Share it on

OCRonos Vintage Benchmarks

OCRonos Vintage Parameters and Internals

Best Alternatives to OCRonos Vintage

Rank the OCRonos Vintage Capabilities

What open-source LLMs or SLMs are you in search of? 47770 in total.