MayankDPOPhi 3 Mini by MayankRaj

 ยป  All LLMs  ยป  MayankRaj  ยป  MayankDPOPhi 3 Mini   URL Share it on

  Autotrain compatible   Custom code   Dataset:intel/orca dpo pairs   De   En   Endpoints compatible   Instruct   License:mit   Phi3   Region:us   Safetensors   Sharded   Tensorflow

MayankDPOPhi 3 Mini Parameters and Internals

Model Type 
Transformer
Use Cases 
Applications:
Text-to-text generation
Primary Use Cases:
Summarizing factual topics, Generating code comments, Creating concise instructions
Limitations:
Potential biases in training data, May generate factually incorrect information
Additional Notes 
The model generates more informative and concise responses compared to out-of-the-box large language models. The DPO approach helps the model adapt to expected response formats, reducing the number of tokens needed for instructions.
Supported Languages 
En (NLP proficiency), De ()
Training Details 
Data Sources:
Intel/orca_dpo_pairs
Methodology:
Direct Preference Optimization (DPO)
LLM NameMayankDPOPhi 3 Mini
Repository ๐Ÿค—https://huggingface.co/MayankRaj/MayankDPOPhi-3-Mini 
Model Size3.8b
Required VRAM7.7 GB
Updated2024-07-04
MaintainerMayankRaj
Model Typephi3
Instruction-BasedYes
Model Files  5.0 GB: 1-of-2   2.7 GB: 2-of-2
Model ArchitecturePhi3ForCausalLM
Licensemit
Context Length4096
Model Max Length4096
Transformers Version4.41.1
Vocabulary Size32064
Torch Data Typefloat16

Best Alternatives to MayankDPOPhi 3 Mini

Best Alternatives
Context / RAM
Downloads
Likes
Phi 3.5 Mini Instruct128K / 7.7 GB721615819
Phi 3 Mini 128K Instruct128K / 7.7 GB1741761636
NuExtract 1.5128K / 7.7 GB115378198
NuExtract V1.5128K / 7.7 GB10851189
Phi 3.5 Mini TitanFusion 0.1128K / 7.7 GB1650
Glider128K / 15.4 GB150436
Saka 3.8B128K / 7.7 GB3091
ECE EIFFEL 3Bv2128K / 7.7 GB100
Samantha2.0 Phi 3.5 Mini ITA128K / 7.7 GB41210
Artemide 3.5128K / 7.7 GB74532
Note: green Score (e.g. "73.2") means that the model is better than MayankRaj/MayankDPOPhi-3-Mini.

Rank the MayankDPOPhi 3 Mini Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227