Git Base Textvqa By microsoft: Benchmarks, Features and Detailed Analysis. Insights on Git Base Textvqa.

Arxiv:2205.14100 Autotrain compatible En Git Pytorch Region:us Safetensors Vision Visual-question-answering

Model Card on HF 🤗: https://huggingface.co/microsoft/git-base-textvqa

Git Base Textvqa Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Git Base Textvqa (microsoft/git-base-textvqa)

Git Base Textvqa Parameters and Internals

Model Type

Transformer, Visual Question Answering

Use Cases

Areas:

Visual Question Answering, Image and Video Captioning, Image Classification

Applications:

Research, Commercial applications

Primary Use Cases:

Visual question answering on TextVQA dataset

Additional Notes

The checkpoint described here is 'GIT-base', a smaller variant of the GIT model fine-tuned specifically for TextVQA.

Supported Languages

English (Proficient)

Training Details

Data Sources:

COCO, Conceptual Captions (CC3M), SBU, Visual Genome (VG), Conceptual Captions (CC12M), ALT200M, Additional 0.6B image-text pairs

Data Volume:

10 million image-text pairs for GIT-base variant

Methodology:

Teacher forcing

Model Architecture:

Transformer decoder conditioned on CLIP image tokens and text tokens.

LLM Name	Git Base Textvqa
Repository 🤗	https://huggingface.co/microsoft/git-base-textvqa
Model Name	microsoft/git-base-textvqa
Model Size	177.2m
Required VRAM	0.7 GB
Updated	2024-10-31
Maintainer	microsoft
Model Type	git
Model Files	0.7 GB 0.7 GB
Supported Languages	en
Model Architecture	GitForCausalLM
License	mit
Context Length	1024
Model Max Length	1024
Tokenizer Class	BertTokenizer
Padding Token	[PAD]
Vocabulary Size	30522
Torch Data Type	float32

Quantized Models of the Git Base Textvqa

Model	Likes	Downloads	VRAM
... Base Textvqa Bnb 4bit Smashed	0	17	0 GB

Rank the Git Base Textvqa Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 42577 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Git Base Textvqa by microsoft

» All LLMs » microsoft » Git Base Textvqa URL Share it on

Git Base Textvqa Benchmarks

Git Base Textvqa Parameters and Internals

Quantized Models of the Git Base Textvqa

Rank the Git Base Textvqa Capabilities

What open-source LLMs or SLMs are you in search of? 42577 in total.