ConvLLaVA Pretrain 768 By ConvLLaVA: Benchmarks, Features and Detailed Analysis. Insights on ConvLLaVA Pretrain 768.

Arxiv:2405.15738 Autotrain compatible Dataset:freedomintelligence/al... Dataset:lin-chen/sharegpt4v Dataset:vision-flan/vision-fla... Endpoints compatible Llava Pytorch Region:us Sharded Vision

Model Card on HF 🤗: https://huggingface.co/ConvLLaVA/ConvLLaVA-pretrain-768

ConvLLaVA Pretrain 768 Benchmarks

LLME Score: 0.18865

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

ConvLLaVA Pretrain 768 (ConvLLaVA/ConvLLaVA-pretrain-768)

ConvLLaVA Pretrain 768 Parameters and Internals

Model Type

auto-regressive language model, transformer, multimodal

Use Cases

Primary Use Cases:

Research on large multimodal models and chatbots

Training Details

Data Sources:

1.2M ShareGPT4V-PT caption data, 100K ShareGPT4V caption data, 1.4M ALLaVA caption and instruction data, 186K VFLAN multitask data, 158K GPT-generated multimodal instruction-following data, 500K academic-task-oriented VQA data mixture, 40K ShareGPT data

Methodology:

Fine-tuning LLM on multimodal instruction-following data

Model Architecture:

Transformer

LLM Name	ConvLLaVA Pretrain 768
Repository 🤗	https://huggingface.co/ConvLLaVA/ConvLLaVA-pretrain-768
Required VRAM	14.9 GB
Updated	2025-02-22
Maintainer	ConvLLaVA
Model Type	llava
Model Files	10.0 GB: 1-of-2 4.9 GB: 2-of-2
Model Architecture	LlavaLlamaForCausalLM
Context Length	4096
Model Max Length	4096
Transformers Version	4.33.2
Tokenizer Class	LlamaTokenizer
Beginning of Sentence Token	<s>
End of Sentence Token	</s>
Unk Token	<unk>
Vocabulary Size	32000
Torch Data Type	bfloat16

Best Alternatives to ConvLLaVA Pretrain 768

Best Alternatives	Context / RAM	Downloads	Likes
ConvLLaVA Pretrain 1536	4K / 14.9 GB	23	2
ConvLLaVA Sft 768	4K / 14.9 GB	25	1
ConvLLaVA Sft 1536	4K / 14.9 GB	12	0
Dermatology LLaVA	4K / 14.7 GB	10	0
Uni MoE Speech Base Interval	4K / 54.6 GB	7	1
Uni MoE Speech Base	4K / 80.5 GB	13	1
Uni MoE Audio Base	4K / 53.4 GB	5	1
Qilin Med VL Chat	4K / 26 GB	48	1
Qilin Med VL	4K / 26.6 GB	23	5
...iGraph 100 Percent Lora Merged	4K / 26.6 GB	16	0

Note: green Score (e.g. "73.2") means that the model is better than ConvLLaVA/ConvLLaVA-pretrain-768.

Rank the ConvLLaVA Pretrain 768 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

ConvLLaVA Pretrain 768 by ConvLLaVA

» All LLMs » ConvLLaVA » ConvLLaVA Pretrain 768 URL Share it on

ConvLLaVA Pretrain 768 Benchmarks

ConvLLaVA Pretrain 768 Parameters and Internals

Best Alternatives to ConvLLaVA Pretrain 768

Rank the ConvLLaVA Pretrain 768 Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.