Pygmalion 6B Chaicomp by alkahestry

 ยป  All LLMs  ยป  alkahestry  ยป  Pygmalion 6B Chaicomp   URL Share it on

  Autotrain compatible   Endpoints compatible   Gptj   Lora   Pytorch   Region:us   Sharded

Pygmalion 6B Chaicomp Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Pygmalion 6B Chaicomp (alkahestry/pygmalion-6b-chaicomp)

Pygmalion 6B Chaicomp Parameters and Internals

Model Type 
Chatbot
Additional Notes 
Model finetuned as an entry to Chai competition
Training Details 
Data Sources:
SODA, TeacherGPT
Data Volume:
250k samples
Methodology:
QLoRA finetuning
Context Length:
512
Training Time:
24 hours
Hardware Used:
RTX4090
LLM NamePygmalion 6B Chaicomp
Repository ๐Ÿค—https://huggingface.co/alkahestry/pygmalion-6b-chaicomp 
Model Size6b
Required VRAM16.3 GB
Updated2025-02-05
Maintaineralkahestry
Model Files  0.0 GB   10.4 GB: 1-of-2   5.9 GB: 2-of-2
Model ArchitectureAutoModelForCausalLM
Model Max Length1024
Is Biasednone
Tokenizer ClassGPT2Tokenizer
Beginning of Sentence Token<|endoftext|>
End of Sentence Token<|endoftext|>
Unk Token<|endoftext|>
PEFT TypeLORA
LoRA ModelYes
PEFT Target Modulesq_proj|v_proj
LoRA Alpha32
LoRA Dropout0.05
R Param8
Errorsreplace

Best Alternatives to Pygmalion 6B Chaicomp

Best Alternatives
Context / RAM
Downloads
Likes
Gpt2 A1 Model0K / 0.9 GB50
Yi 6B 200K AEZAKMI V2 LoRA0K / 0.1 GB51
Deci Finetuned Alpaca Cleaned0K / 11.5 GB140
Codegen 6B Lora0K / 0 GB102
Note: green Score (e.g. "73.2") means that the model is better than alkahestry/pygmalion-6b-chaicomp.

Rank the Pygmalion 6B Chaicomp Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227