Tulu 2 DPO 7B by allenai

 ยป  All LLMs  ยป  allenai  ยป  Tulu 2 DPO 7B   URL Share it on

  Arxiv:2305.18290   Arxiv:2311.10702   Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/llama-2-...   Conversational Dataset:allenai/tulu-v2-sft-mi... Dataset:huggingfaceh4/ultrafee...   En   Endpoints compatible   Llama   Pytorch   Region:us   Sharded
Model Card on HF ๐Ÿค—: https://huggingface.co/allenai/tulu-2-dpo-7b 

Tulu 2 DPO 7B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Tulu 2 DPO 7B (allenai/tulu-2-dpo-7b)

Tulu 2 DPO 7B Parameters and Internals

Model Type 
instruction, RLHF tuned, chat model
Use Cases 
Areas:
chat assistants, instructional response models
Applications:
assistive artificial intelligence
Primary Use Cases:
handling diverse instructions and queries
Limitations:
not aligned for generating safe completions, potential to produce problematic outputs
Additional Notes 
Model card provides details on alignment, performance, and intended use.
Supported Languages 
en (primary)
Training Details 
Data Sources:
HuggingFaceH4/ultrafeedback_binarized, allenai/tulu-v2-sft-mixture, openbmb/UltraFeedback
Methodology:
Direct Preference Optimization (DPO)
Model Architecture:
fine-tuned version of Llama 2
Input Output 
Input Format:
<|user|> Your message here! <|assistant|>
Accepted Modalities:
text
Output Format:
text
Performance Tips:
Include a newline after '<|assistant|>' for better generation quality.
LLM NameTulu 2 DPO 7B
Repository ๐Ÿค—https://huggingface.co/allenai/tulu-2-dpo-7b 
Base Model(s)  Llama 2 7B Hf   meta-llama/Llama-2-7b-hf
Model Size7b
Required VRAM13.5 GB
Updated2025-02-22
Maintainerallenai
Model Typellama
Model Files  10.0 GB: 1-of-2   3.5 GB: 2-of-2
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length8192
Model Max Length8192
Transformers Version4.33.2
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typebfloat16

Quantized Models of the Tulu 2 DPO 7B

Model
Likes
Downloads
VRAM
Tulu 2 DPO 7B GGUF32922 GB
Tulu 2 DPO 7B AWQ0843 GB
Tulu 2 DPO 7B GPTQ0823 GB

Best Alternatives to Tulu 2 DPO 7B

Best Alternatives
Context / RAM
Downloads
Likes
2 Very Sci Fi1024K / 16.1 GB3170
...1M 1000000ctx AEZAKMI 3 1 17021024K / 13.5 GB231
... Qwen2.5llamaify 7B V23.1 200K195K / 15.2 GB39433
LlamaStock 8B128K / 16.1 GB111
SuperNeuralDreadDevil 8B128K / 16.1 GB541
Yarn Llama 2 7B 128K128K / 13.5 GB642239
LLaMA 7B PoSE YaRN 128K128K / 13.5 GB73
LLaMA 7B PoSE Linear 96K96K / 27 GB92
LLaMA 7B PoSE YaRN 96K96K / 13.5 GB111
Chat Llama2 7B 80K80K / 13.8 GB80
Note: green Score (e.g. "73.2") means that the model is better than allenai/tulu-2-dpo-7b.

Rank the Tulu 2 DPO 7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227