LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 By adamo1139: Benchmarks, Features and Detailed Analysis. Insights on LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702.

Autotrain compatible Endpoints compatible Llama Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/adamo1139/LWM-7B-1M-1000000ctx-AEZAKMI-3_1-1702

LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 Benchmarks

ARC: 51.19 vs 96.7 (so35)^-47.1%

HellaSwag: 77.08 vs 95.3 (gpt4)^-19.1%

MMLU: 43.12 vs 88.3 (so35)^-51.2%

TruthfulQA: 44.19 vs 59 (gpt4)^-25.1%

WinoGrande: 72.06 vs 87.5 (gpt4)^-17.6%

GSM8K: 8.87 vs 96.4 (so35)^-90.8%

LLME Score: 0.18701

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 (adamo1139/LWM-7B-1M-1000000ctx-AEZAKMI-3_1-1702)

LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 Parameters and Internals

Model Type

text generation

Additional Notes

Model is finetuned on AEZAKMI v3.1 dataset. Exl2 quants and base model will be available soon in safetensors format. Most long context capabilities expected to remain.

Training Details

Data Sources:

AEZAKMI v3.1 dataset

Methodology:

Finetuning using QLoRA with lora_r 32 and cosine learning rate decaying from 0.00015, finetuned with unsloth, FA2

Context Length:

4000

Training Time:

6 hours

Hardware Used:

Local RTX 3090 Ti

LLM Name	LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702
Repository 🤗	https://huggingface.co/adamo1139/LWM-7B-1M-1000000ctx-AEZAKMI-3_1-1702
Model Size	7b
Required VRAM	13.5 GB
Updated	2025-04-15
Maintainer	adamo1139
Model Type	llama
Model Files	4.9 GB: 1-of-3 5.0 GB: 2-of-3 3.6 GB: 3-of-3
Model Architecture	LlamaForCausalLM
License	llama2
Context Length	1048576
Model Max Length	1048576
Transformers Version	4.36.2
Tokenizer Class	LlamaTokenizer
Vocabulary Size	32000
Torch Data Type	float16

Best Alternatives to LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702

Best Alternatives	Context / RAM	Downloads
A6 L	1024K / 16.1 GB	201
M	1024K / 16.1 GB	127
157	1024K / 16.1 GB	101
A3.4	1024K / 16.1 GB	13
124	1024K / 16.1 GB	93
A5.4	1024K / 16.1 GB	12
A2.4	1024K / 16.1 GB	12
2 Very Sci Fi	1024K / 16.1 GB	317
162	1024K / 16.1 GB	60
118	1024K / 16.1 GB	15

Note: green Score (e.g. "73.2") means that the model is better than adamo1139/LWM-7B-1M-1000000ctx-AEZAKMI-3_1-1702.

Rank the LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 46347 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 by adamo1139

» All LLMs » adamo1139 » LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 URL Share it on

LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 Benchmarks

LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 Parameters and Internals

Best Alternatives to LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702

Rank the LWM 7B 1M 1000000ctx AEZAKMI 3 1 1702 Capabilities

What open-source LLMs or SLMs are you in search of? 46347 in total.