Limarpv3 Llama2 70B Qlora by Doctor-Shotgun

 ยป  All LLMs  ยป  Doctor-Shotgun  ยป  Limarpv3 Llama2 70B Qlora   URL Share it on

  4-bit   Autotrain compatible   Bitsandbytes   Endpoints compatible   Generated from trainer   Llama   Lora   Region:us

Limarpv3 Llama2 70B Qlora Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Limarpv3 Llama2 70B Qlora (Doctor-Shotgun/limarpv3-llama2-70b-qlora)

Limarpv3 Llama2 70B Qlora Parameters and Internals

Model Type 
roleplaying, chat
Use Cases 
Primary Use Cases:
longform-oriented, novel-style roleplaying chat model intended to replicate the experience of 1-on-1 roleplay on Internet forums.
Limitations:
Short-form, IRC/Discord-style RP (aka "Markdown format") is not supported., The model will show biases similar to those observed in niche roleplaying forums on the Internet, besides those exhibited by the base model.
Additional Notes 
General format and functionality inspired by the previously named "Roleplay" preset in SillyTavern.
Training Details 
Data Sources:
LimaRP v3 dataset
Methodology:
Training without the pretraining stage using stories. Does not include instruction tuning, only manually picked and slightly edited RP conversations with persona and scenario data.
Input Output 
Input Format:
Extended Alpaca format
Performance Tips:
Use length modifier in response instruction sequence for better control over response length.
LLM NameLimarpv3 Llama2 70B Qlora
Repository ๐Ÿค—https://huggingface.co/Doctor-Shotgun/limarpv3-llama2-70b-qlora 
Model Size70b
Required VRAM1.7 GB
Updated2025-02-22
MaintainerDoctor-Shotgun
Model Files  1.7 GB
Model ArchitectureAutoModelForCausalLM
Licenseapache-2.0
Is Biasednone
Tokenizer ClassLlamaTokenizer
Padding Token</s>
PEFT TypeLORA
LoRA ModelYes
PEFT Target Moduleso_proj|v_proj|gate_proj|down_proj|k_proj|up_proj|q_proj
LoRA Alpha16
LoRA Dropout0.05
R Param32

Best Alternatives to Limarpv3 Llama2 70B Qlora

Best Alternatives
Context / RAM
Downloads
Likes
Llama 3.1 Tango 70B0K / 3 GB347
LLama3 70B SWE LLM0K / 16.1 GB71
Nous Hermes Llama2 70B0K / 138 GB179783
Llama 2 70B Chat Longlora 32K0K / 0.1 GB129
Llama 2 70B Longlora 32K0K / 0.1 GB2718
... 70M Instruct Orca Chkpt 640000K / 0.2 GB1631
NorskGPT Llama 3 70B Adapter0K / 0.2 GB946
Llama 3 70B Tagengo0K / 141.9 GB81
...3 70B Instruct Uncensored Lora0K / 0.8 GB63
Note: green Score (e.g. "73.2") means that the model is better than Doctor-Shotgun/limarpv3-llama2-70b-qlora.

Rank the Limarpv3 Llama2 70B Qlora Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227