FinetunedModelV2.0 by CitrusBoy

 ยป  All LLMs  ยป  CitrusBoy  ยป  FinetunedModelV2.0   URL Share it on

  4-bit   Adapter Base model:adapter:microsoft/p... Base model:microsoft/phi-3-min...   Bitsandbytes   Custom code   Finetuned   Generated from trainer   Instruct   Lora   Peft   Phi3   Region:us   Safetensors   Sft   Tensorboard   Trl

FinetunedModelV2.0 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
FinetunedModelV2.0 (CitrusBoy/FinetunedModelV2.0)

FinetunedModelV2.0 Parameters and Internals

Additional Notes 
This model is a fine-tuned version. More information is needed on its description, intended uses, limitations, and evaluation data.
LLM NameFinetunedModelV2.0
Repository ๐Ÿค—https://huggingface.co/CitrusBoy/FinetunedModelV2.0 
Base Model(s)  Phi 3 Mini 128K Instruct   microsoft/Phi-3-mini-128k-instruct
Model Size2.1b
Required VRAM0 GB
Updated2025-02-16
MaintainerCitrusBoy
Instruction-BasedYes
Model Files  0.0 GB   2.9 GB   0.0 GB
Model ArchitectureAdapter
Licensemit
Model Max Length131072
Is Biasednone
Tokenizer ClassLlamaTokenizer
Padding Token<|endoftext|>
PEFT TypeLORA
LoRA ModelYes
PEFT Target Modulesk_proj|up_proj|v_proj|gate_proj|o_proj|q_proj|down_proj
LoRA Alpha8
LoRA Dropout0
R Param8

Quantized Models of the FinetunedModelV2.0

Model
Likes
Downloads
VRAM
Phi 3 Mini 4K Instruct Q40662 GB

Rank the FinetunedModelV2.0 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227