Selfbiorag 7B Wo Kqa Golden Iter DPO Step4 Filtered by Minbyul

 ยป  All LLMs  ยป  Minbyul  ยป  Selfbiorag 7B Wo Kqa Golden Iter DPO Step4 Filtered   URL Share it on

  Alignment-handbook   Autotrain compatible Base model:finetune:minbyul/se... Base model:minbyul/selfbiorag-... Dataset:huggingfaceh4/ultrafee...   Dpo   Endpoints compatible   Generated from trainer   Llama   Region:us   Safetensors   Sharded   Tensorflow   Trl

Selfbiorag 7B Wo Kqa Golden Iter DPO Step4 Filtered Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Selfbiorag 7B Wo Kqa Golden Iter DPO Step4 Filtered (Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step4-filtered)

Selfbiorag 7B Wo Kqa Golden Iter DPO Step4 Filtered Parameters and Internals

Additional Notes 
Training procedure and hyperparameters include learning_rate: 5e-07, train_batch_size: 8, eval_batch_size: 8, seed: 42, distributed_type: multi-GPU, num_devices: 4, gradient_accumulation_steps: 2, total_train_batch_size: 64, total_eval_batch_size: 32, optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08, lr_scheduler_type: cosine, lr_scheduler_warmup_ratio: 0.1, num_epochs: 1.
Training Details 
Data Sources:
HuggingFaceH4/ultrafeedback_binarized
LLM NameSelfbiorag 7B Wo Kqa Golden Iter DPO Step4 Filtered
Repository ๐Ÿค—https://huggingface.co/Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step4-filtered 
Base Model(s)  ...Golden Iter DPO Step3 Filtered   Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step3-filtered
Model Size7b
Required VRAM13.5 GB
Updated2025-02-22
MaintainerMinbyul
Model Typellama
Model Files  4.9 GB: 1-of-3   5.0 GB: 2-of-3   3.6 GB: 3-of-3   0.0 GB
Model ArchitectureLlamaForCausalLM
Context Length4096
Model Max Length4096
Transformers Version4.39.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token<pad>
Vocabulary Size32016
Torch Data Typebfloat16

Best Alternatives to Selfbiorag 7B Wo Kqa Golden Iter DPO Step4 Filtered

Best Alternatives
Context / RAM
Downloads
Likes
2 Very Sci Fi1024K / 16.1 GB3170
...1M 1000000ctx AEZAKMI 3 1 17021024K / 13.5 GB231
... Qwen2.5llamaify 7B V23.1 200K195K / 15.2 GB39433
LlamaStock 8B128K / 16.1 GB111
SuperNeuralDreadDevil 8B128K / 16.1 GB541
Yarn Llama 2 7B 128K128K / 13.5 GB642239
LLaMA 7B PoSE YaRN 128K128K / 13.5 GB73
LLaMA 7B PoSE Linear 96K96K / 27 GB92
LLaMA 7B PoSE YaRN 96K96K / 13.5 GB111
Chat Llama2 7B 80K80K / 13.8 GB80
Note: green Score (e.g. "73.2") means that the model is better than Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step4-filtered.

Rank the Selfbiorag 7B Wo Kqa Golden Iter DPO Step4 Filtered Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227