GPT Neo 33M Simplewiki 2048 Scratch By pszemraj: Benchmarks, Features and Detailed Analysis. Insights on GPT Neo 33M Simplewiki 2048 Scratch.

Autotrain compatible Base model:finetune:roneneldan... Base model:roneneldan/tinystor... Dataset:pszemraj/simple wikipe... En Endpoints compatible Generated from trainer Gpt neo Region:us Safetensors

Model Card on HF 🤗: https://huggingface.co/pszemraj/GPT-Neo-33M-simplewiki-2048-scratch

GPT Neo 33M Simplewiki 2048 Scratch Benchmarks

LLME Score: 0.13441

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

GPT Neo 33M Simplewiki 2048 Scratch (pszemraj/GPT-Neo-33M-simplewiki-2048-scratch)

GPT Neo 33M Simplewiki 2048 Scratch Parameters and Internals

Model Type

text-generation

Supported Languages

en (Supported)

Training Details

Data Sources:

pszemraj/simple_wikipedia_LM

Context Length:

2048

LLM Name	GPT Neo 33M Simplewiki 2048 Scratch
Repository 🤗	https://huggingface.co/pszemraj/GPT-Neo-33M-simplewiki-2048-scratch
Base Model(s)	TinyStories 33M roneneldan/TinyStories-33M
Model Size	33m
Required VRAM	0.3 GB
Updated	2025-02-05
Maintainer	pszemraj
Model Type	gpt_neo
Model Files	0.3 GB 0.0 GB
Supported Languages	en
Model Architecture	GPTNeoForCausalLM
License	apache-2.0
Context Length	2048
Model Max Length	2048
Transformers Version	4.33.1
Tokenizer Class	GPT2Tokenizer
Beginning of Sentence Token	<\|endoftext\|>
End of Sentence Token	<\|endoftext\|>
Unk Token	<\|endoftext\|>
Vocabulary Size	50257
Torch Data Type	float32
Activation Function	gelu_new
Errors	replace

Best Alternatives to GPT Neo 33M Simplewiki 2048 Scratch

Best Alternatives	Context / RAM	Downloads	Likes
TinyStories 33M	2K / 0.3 GB	21153	94
TinyStories 33M Ds	2K / GB	7	0
...Tinystories 33M Epoch10 Merged	2K / 0.3 GB	124	2
TinyStories Instruct 33M	2K / 0.3 GB	1708	9
TinyStories 33M Finetuned	2K / 0.3 GB	175	0
TinyStories 2Layers 33M	2K / 0.3 GB	895	5
...nyStories Instruct 2Layers 33M	2K / 0.3 GB	765	7

Note: green Score (e.g. "73.2") means that the model is better than pszemraj/GPT-Neo-33M-simplewiki-2048-scratch.

Rank the GPT Neo 33M Simplewiki 2048 Scratch Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 42577 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

GPT Neo 33M Simplewiki 2048 Scratch by pszemraj

» All LLMs » pszemraj » GPT Neo 33M Simplewiki 2048 Scratch URL Share it on

GPT Neo 33M Simplewiki 2048 Scratch Benchmarks

GPT Neo 33M Simplewiki 2048 Scratch Parameters and Internals

Best Alternatives to GPT Neo 33M Simplewiki 2048 Scratch

Rank the GPT Neo 33M Simplewiki 2048 Scratch Capabilities

What open-source LLMs or SLMs are you in search of? 42577 in total.