LLaMAntino 3 ANITA 8B Inst DPO ITA by swap-uniba

 ยป  All LLMs  ยป  swap-uniba  ยป  LLaMAntino 3 ANITA 8B Inst DPO ITA   URL Share it on

  Merged Model   Arxiv:2312.09993   Arxiv:2405.07101   Autotrain compatible Base model:meta-llama/meta-lla...   Conversational Dataset:chat-error/wizard alpa...   Dataset:gsarti/clean mc4 it Dataset:mlabonne/orpo-dpo-mix-...   En   Endpoints compatible   Facebook   Instruct   It   License:llama3   Llama   Llama-3   Llamantino   Meta   Model-index   Moe   Pythorch   Region:us   Safetensors   Sharded   Tensorflow

LLaMAntino 3 ANITA 8B Inst DPO ITA Benchmarks

Rank the LLaMAntino 3 ANITA 8B Inst DPO ITA Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
LLaMAntino 3 ANITA 8B Inst DPO ITA (swap-uniba/LLaMAntino-3-ANITA-8B-Inst-DPO-ITA)

Best Alternatives to LLaMAntino 3 ANITA 8B Inst DPO ITA

Best Alternatives
HF Rank
...ama 3 SauerkrautLM 8B Instruct73.748K / 16.1 GB2941347
Llama 3 8B Instruct V0.873.178K / 16 GB28651
Barcenas Llama3 8B ORPO72.58K / 16.1 GB86096
Llama 3 Stella 8B72.178K / 16 GB13271
Llama3 8B Spaetzle V1371.268K / 16 GB16430
Llama 3 8B Instruct V0.470.38K / 16 GB34352
Llama 3 Stinky V2 8B70.278K / 16 GB15884
ChimeraLlama 3 8B V370.068K / 16 GB253114
Meta Llama 3 8B Instruct DPO69.848K / 16.1 GB13724
ChimeraLlama 3 8B V269.698K / 16 GB132614

LLaMAntino 3 ANITA 8B Inst DPO ITA Parameters and Internals

LLM NameLLaMAntino 3 ANITA 8B Inst DPO ITA
RepositoryOpen on ๐Ÿค— 
Model CreatorMarco Polignano - SWAP Research Group
Base Model(s)  Meta Llama 3 8B Instruct   meta-llama/Meta-Llama-3-8B-Instruct
Merged ModelYes
Model Size8b
Required VRAM16.1 GB
Model Typellama
Model Files  5.0 GB: 1-of-4   5.0 GB: 2-of-4   4.9 GB: 3-of-4   1.2 GB: 4-of-4
Supported Languagesen it
Model ArchitectureLlamaForCausalLM
Context Length8192
Model Max Length8192
Transformers Version4.40.0.dev0
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size128256
Initializer Range0.02
Torch Data Typebfloat16

What open-source LLMs or SLMs are you in search of? 34902 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024042801