Gpt2 Chatbot by KnutJaegersberg

 ยป  All LLMs  ยป  KnutJaegersberg  ยป  Gpt2 Chatbot   URL Share it on

  Autotrain compatible Dataset:knutjaegersberg/deita-...   Endpoints compatible   Gpt2   Model-index   Region:us   Safetensors   Sharded   Tensorflow

Gpt2 Chatbot Benchmarks

Gpt2 Chatbot (KnutJaegersberg/gpt2-chatbot)

Gpt2 Chatbot Parameters and Internals

Model Type 
text-generation
Additional Notes 
GPT2-XL SFT on Deita dataset to change Sam's mind. Supports multi-turn dialogue.
LLM NameGpt2 Chatbot
Repository ๐Ÿค—https://huggingface.co/KnutJaegersberg/gpt2-chatbot 
Model Size1.6b
Required VRAM6.3 GB
Updated2024-12-21
MaintainerKnutJaegersberg
Model Typegpt2
Model Files  5.0 GB: 1-of-2   1.3 GB: 2-of-2
Model ArchitectureGPT2LMHeadModel
Licenseapache-2.0
Model Max Length1024
Transformers Version4.37.0
Tokenizer ClassGPT2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size50257
Torch Data Typefloat32
Activation Functiongelu_new

Best Alternatives to Gpt2 Chatbot

Best Alternatives
Context / RAM
Downloads
Likes
Gpt2o Chatbot 070K / 3.1 GB3400
Gpt2o Chatbot 080K / 3.1 GB3380
Gpt2o Chatbot 090K / 3.1 GB3370
BetterGPT20K / 3.1 GB240
Gpt2o Chatbot 020K / 3.1 GB400
Gpt2o Chatbot 110K / 3.1 GB230
Gpt2o Chatbot 030K / 3.1 GB220
Gpt2 Xl Lima0K / 3.1 GB12600
GPT 2 Xl Camel Ai Physics0K / 3.1 GB12600
GPT 2 Xl EvolInstruct0K / 6.3 GB13870

Rank the Gpt2 Chatbot Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40013 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217