Mpt 7B Storywriter 4bit 128g by OccamRazor

 ยป  All LLMs  ยป  OccamRazor  ยป  Mpt 7B Storywriter 4bit 128g   URL Share it on

  Arxiv:2108.12409   Arxiv:2205.14135   4bit   Autotrain compatible   Composer   Custom code   Dataset:the pile books3   Llm-foundry   Mosaicml   Mpt   Quantized   Region:us   Safetensors

Mpt 7B Storywriter 4bit 128g Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Mpt 7B Storywriter 4bit 128g (OccamRazor/mpt-7b-storywriter-4bit-128g)

Mpt 7B Storywriter 4bit 128g Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
Fiction writing, Story generation, Creative writing
Applications:
Personal assistant, Creative content generation
Primary Use Cases:
Long context story writing, Text generation
Limitations:
Can produce factually incorrect output, Can generate biased or offensive outputs
Considerations:
Should not be relied on to produce factually accurate information
Additional Notes 
Utilizes FlashAttention and ALiBi techniques for extended context handling.
Training Details 
Data Sources:
books3 dataset (filtered fiction subset)
Methodology:
Built by finetuning MPT-7B with a context length of 65k tokens using the ALiBi technique
Context Length:
65536
Hardware Used:
8 A100-80GB GPUs
Model Architecture:
Modified decoder-only transformer
Input Output 
Accepted Modalities:
text
Release Notes 
Version:
MPT-7B-StoryWriter-65k+
Date:
2023-05-05
Notes:
Released with a context length of 65k tokens, capable of generating 84k tokens when extrapolating with ALiBi.
LLM NameMpt 7B Storywriter 4bit 128g
Repository ๐Ÿค—https://huggingface.co/OccamRazor/mpt-7b-storywriter-4bit-128g 
Model Size7b
Required VRAM3.9 GB
Updated2024-12-22
MaintainerOccamRazor
Model Typempt
Model Files  3.9 GB
Quantization Type4bit
Model ArchitectureMPTForCausalLM
Licenseapache-2.0
Model Max Length65536
Transformers Version4.28.1
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50432
Torch Data Typebfloat16

Best Alternatives to Mpt 7B Storywriter 4bit 128g

Best Alternatives
Context / RAM
Downloads
Likes
Mpt 7B Chat Q80K / 6.9 GB221
Mpt 7B Storywriter Q80K / 6.9 GB242
Mpt 7B Instruct Q80K / 6.9 GB192
Mpt 7B Q80K / 6.9 GB231
...writer 4bit 128g 65kTokens CPU0K / 3.9 GB159
Mpt 7B Storywriter 4bit 128g0K / 3.9 GB3422
Mpt 7B0K / 13.3 GB298221164
Mpt 7B Chat0K / 13.3 GB20372512
Mpt 7B Storywriter0K / 13.3 GB1694824
Mpt 7B Instruct0K / 13.3 GB7997468
Note: green Score (e.g. "73.2") means that the model is better than OccamRazor/mpt-7b-storywriter-4bit-128g.

Rank the Mpt 7B Storywriter 4bit 128g Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40066 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217