ConvLLaVA JP 1.3B 768 By toshi456: Benchmarks, Features and Detailed Analysis. Insights on ConvLLaVA JP 1.3B 768.

Arxiv:2405.15738 Autotrain compatible Dataset:turing-motors/llava-pr... Dataset:turing-motors/llava-v1... Endpoints compatible Image-captioning Image-to-text Instruct Ja Llava-jp Region:us Safetensors Sharded Tensorflow Vision Vqa

Model Card on HF 🤗: https://huggingface.co/toshi456/ConvLLaVA-JP-1.3b-768

ConvLLaVA JP 1.3B 768 Benchmarks

LLME Score: 0.19119

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

ConvLLaVA JP 1.3B 768 (toshi456/ConvLLaVA-JP-1.3b-768)

ConvLLaVA JP 1.3B 768 Parameters and Internals

Model Type

vision-language, image-to-text, VQA, image-captioning

Additional Notes

ConvLLaVA-JP is trained using the laion/CLIP-convnext_large_d_320.laion2B-s29B-b131K-ft image encoder and llm-jp/llm-jp-1.3b-v1.0 text decoder. Input resolution: 768 x 768.

Supported Languages

ja (Native)

Training Details

Data Sources:

LLaVA-Pretrain-JA, LLaVA-v1.5-Instruct-620K-JA

Input Output

Accepted Modalities:

image

Output Format:

text

LLM Name	ConvLLaVA JP 1.3B 768
Repository 🤗	https://huggingface.co/toshi456/ConvLLaVA-JP-1.3b-768
Model Size	1.3b
Required VRAM	7.1 GB
Updated	2025-02-22
Maintainer	toshi456
Model Type	llava-jp
Instruction-Based	Yes
Model Files	5.0 GB: 1-of-2 2.1 GB: 2-of-2
Supported Languages	ja
Model Architecture	LlavaGpt2ForCausalLM
License	cc-by-nc-4.0
Model Max Length	1532
Transformers Version	4.38.2
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<unk\|LLM-jp>
Vocabulary Size	50688
Torch Data Type	float32
Activation Function	gelu

Best Alternatives to ConvLLaVA JP 1.3B 768

Best Alternatives	Context / RAM	Downloads	Likes
Llava Jp 1.3B V1.1	0K / 6.6 GB	705	11
ConvLLaVA JP 1.3B 1280	0K / 7.1 GB	19	1
...3B V1.1 Llava Jp Instruct 108K	0K / 6.6 GB	9	3
...V1.0 Siglip So400m Patch14 384	0K / 6.6 GB	65	0
Llava Jp 1.3B V1.0	0K / 6.3 GB	73	5

Note: green Score (e.g. "73.2") means that the model is better than toshi456/ConvLLaVA-JP-1.3b-768.

Rank the ConvLLaVA JP 1.3B 768 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

ConvLLaVA JP 1.3B 768 by toshi456

» All LLMs » toshi456 » ConvLLaVA JP 1.3B 768 URL Share it on

ConvLLaVA JP 1.3B 768 Benchmarks

ConvLLaVA JP 1.3B 768 Parameters and Internals

Best Alternatives to ConvLLaVA JP 1.3B 768

Rank the ConvLLaVA JP 1.3B 768 Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.