EverythingLM 13B 16K By totally-not-an-llm: Benchmarks, Features and Detailed Analysis. Insights on EverythingLM 13B 16K.

Autotrain compatible Dataset:totally-not-an-llm/eve... Endpoints compatible Llama Pytorch Region:us Sharded

Model Card on HF 🤗: https://huggingface.co/totally-not-an-llm/EverythingLM-13b-16k

EverythingLM 13B 16K Benchmarks

ARC: 56.57 vs 96.7 (so35)^-41.5%

HellaSwag: 80.58 vs 95.3 (gpt4)^-15.4%

MMLU: 50.18 vs 88.3 (so35)^-43.2%

TruthfulQA: 47.46 vs 59 (gpt4)^-19.6%

WinoGrande: 72.77 vs 87.5 (gpt4)^-16.8%

GSM8K: 6.44 vs 96.4 (so35)^-93.3%

LLME Score: 0.17592

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

EverythingLM 13B 16K (totally-not-an-llm/EverythingLM-13b-16k)

EverythingLM 13B 16K Parameters and Internals

Model Type

general-purpose

Use Cases

Primary Use Cases:

Creative stories, Prompt understanding, CoT reasoning

Limitations:

Better with more detail, Uses numbered lists, Prefers fairy tales, May fall into repetition, Limited testing with full 16k context

Additional Notes

An early test of a new dataset and experimental principles.

Training Details

Data Sources:

EverythingLM dataset

Methodology:

QLoRa

Context Length:

16000

Training Time:

1 hour

Hardware Used:

1xA100

Input Output

Input Format:

Modified Vicuna format

Output Format:

Verbose and detailed replies

Performance Tips:

Model performs better with more detailed prompts

LLM Name	EverythingLM 13B 16K
Repository 🤗	https://huggingface.co/totally-not-an-llm/EverythingLM-13b-16k
Model Size	13b
Required VRAM	26 GB
Updated	2025-03-12
Maintainer	totally-not-an-llm
Model Type	llama
Model Files	9.9 GB: 1-of-3 9.9 GB: 2-of-3 6.2 GB: 3-of-3
Model Architecture	LlamaForCausalLM
License	llama2
Context Length	16384
Model Max Length	16384
Transformers Version	4.31.0
Tokenizer Class	LlamaTokenizer
Beginning of Sentence Token	<s>
End of Sentence Token	</s>
Unk Token	<unk>
Vocabulary Size	32000
Torch Data Type	float16

Quantized Models of the EverythingLM 13B 16K

Model	Likes	Downloads	VRAM
EverythingLM 13B 16K GPTQ	13	48	7 GB
EverythingLM 13B 16K GGUF	3	435	5 GB
EverythingLM 13B 16K AWQ	1	90	7 GB
EverythingLM 13B 16K GGML	13	16	5 GB

Best Alternatives to EverythingLM 13B 16K

Best Alternatives	Context / RAM	Downloads	Likes
Luminaura RP 13B	128K / 26 GB	32	0
Yarn Llama 2 13B 128K	128K / 26 GB	2637	112
Agent Llama2 13B 80K	80K / 26.4 GB	15	0
Chat Llama2 13B 80K	80K / 52.8 GB	13	0
LongAlign 13B 64K	64K / 26 GB	44	13
LongAlign 13B 64K Base	64K / 26 GB	33	3
Yarn Llama 2 13B 64K	64K / 26 GB	4893	17
Openbuddy Llama2 13B V15p1 64K	64K / 26.1 GB	19	4
Openbuddy Llama2 13b64k V15	64K / 26.1 GB	14	1
Airoboros L2 13B 2.1 YaRN 64K	64K / 26 GB	42	7

Note: green Score (e.g. "73.2") means that the model is better than totally-not-an-llm/EverythingLM-13b-16k.

Rank the EverythingLM 13B 16K Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 44949 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer