Gpt2023 By crumb: Benchmarks, Features and Detailed Analysis. Insights on Gpt2023.

Autotrain compatible En Endpoints compatible Gpt2 Pytorch Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/crumb/gpt2023

Gpt2023 Benchmarks

ARC: 21.93 vs 96.7 (so35)^-77.3%

HellaSwag: 31.11 vs 95.3 (gpt4)^-67.4%

MMLU: 25.05 vs 88.3 (so35)^-71.6%

TruthfulQA: 40.71 vs 59 (gpt4)^-31%

WinoGrande: 50.12 vs 87.5 (gpt4)^-42.7%

GSM8K: 0.3 vs 96.4 (so35)^-99.7%

LLME Score: 0.16549

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Gpt2023 Parameters and Internals

Model Type

causal-lm

Use Cases

Areas:

Research

Limitations:

Lack of awareness of some recent events due to finetuning on a limited dataset

Supported Languages

en (Fluent)

Training Details

Data Sources:

common crawl sites, ArXiv, GitHub

Data Volume:

2.23 billion tokens

Methodology:

Finetuning on existing GPT-2 model with learning rate adjustments

Context Length:

1024

Training Time:

79.32 hours

Hardware Used:

12GB RTX3060

Model Architecture:

Transformer-based architecture, left-to-right causal language model

Input Output

Input Format:

Text input, up to 1024 tokens

Accepted Modalities:

text

Output Format:

Text generation

Performance Tips:

Setting a seed can help achieve reproducible results

LLM Name	Gpt2023
Repository 🤗	https://huggingface.co/crumb/gpt2023
Model Size	137m
Required VRAM	0.3 GB
Updated	2025-03-14
Maintainer	crumb
Model Type	gpt2
Model Files	0.3 GB 0.3 GB
Supported Languages	en
Model Architecture	GPT2LMHeadModel
License	mit
Model Max Length	1024
Transformers Version	4.29.0.dev0
Tokenizer Class	GPT2Tokenizer
Vocabulary Size	50257
Torch Data Type	bfloat16
Activation Function	gelu_new

Best Alternatives to Gpt2023

Best Alternatives	Context / RAM	Downloads	Likes
Phantasor 137M	0K / 0.5 GB	211	1
Phantasor V0.1 137M	0K / 0.5 GB	93	1
Phantasor V0.2 137M	0K / 0.5 GB	73	1
Phantasor V0.3 137M	0K / 0.5 GB	37	1
Gpt2	0K / 0.5 GB	17050068	2617
Gpt2 Auth	0K / 0.5 GB	69	0
My GPT2	0K / 0.5 GB	2065	0
Gpt2 Test	0K / 0.5 GB	1774	0
Gpt2 Alpaca	0K / 0.5 GB	8855	9
Xuanxuan	0K / 0.3 GB	82	0

Note: green Score (e.g. "73.2") means that the model is better than crumb/gpt2023.

Rank the Gpt2023 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 45019 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Gpt2023 by crumb

» All LLMs » crumb » Gpt2023 URL Share it on

Gpt2023 Benchmarks

Gpt2023 Parameters and Internals

Best Alternatives to Gpt2023

Rank the Gpt2023 Capabilities

What open-source LLMs or SLMs are you in search of? 45019 in total.