BetterGPT2 by mergekit-community

 ยป  All LLMs  ยป  mergekit-community  ยป  BetterGPT2   URL Share it on

  Merged Model   Arxiv:2306.01708   Autotrain compatible Base model:finetune:openai-com... Base model:openai-community/gp...   Endpoints compatible   Gpt2   Region:us   Safetensors   Sharded   Tensorflow

BetterGPT2 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
BetterGPT2 (mergekit-community/BetterGPT2)

BetterGPT2 Parameters and Internals

Additional Notes 
This model is a merge of pre-trained language models created using mergekit.
Training Details 
Methodology:
The model was created using the TIES merge method.
LLM NameBetterGPT2
Repository ๐Ÿค—https://huggingface.co/mergekit-community/BetterGPT2 
Base Model(s)  Gpt2 Xl   openai-community/gpt2-xl
Merged ModelYes
Model Size1.6b
Required VRAM3.1 GB
Updated2024-12-21
Maintainermergekit-community
Model Typegpt2
Model Files  3.1 GB: 1-of-1
Model ArchitectureGPT2LMHeadModel
Model Max Length1024
Transformers Version4.44.1
Tokenizer ClassGPT2Tokenizer
Vocabulary Size50257
Torch Data Typefloat16
Activation Functiongelu_new

Best Alternatives to BetterGPT2

Best Alternatives
Context / RAM
Downloads
Likes
Gpt2 Chatbot0K / 6.3 GB44713
Gpt2o Chatbot 070K / 3.1 GB3400
Gpt2o Chatbot 080K / 3.1 GB3380
Gpt2o Chatbot 090K / 3.1 GB3370
Gpt2o Chatbot 020K / 3.1 GB400
Gpt2o Chatbot 110K / 3.1 GB230
Gpt2o Chatbot 030K / 3.1 GB220
Gpt2 Xl Lima0K / 3.1 GB12600
GPT 2 Xl Camel Ai Physics0K / 3.1 GB12600
GPT 2 Xl EvolInstruct0K / 6.3 GB13870
Note: green Score (e.g. "73.2") means that the model is better than mergekit-community/BetterGPT2.

Rank the BetterGPT2 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40013 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217