LWM Text Chat 256K by LargeWorldModel

 ยป  All LLMs  ยป  LargeWorldModel  ยป  LWM Text Chat 256K   URL Share it on

  Autotrain compatible   Llama   Pytorch   Region:us   Sharded

LWM Text Chat 256K Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
LWM Text Chat 256K (LargeWorldModel/LWM-Text-Chat-256K)

LWM Text Chat 256K Parameters and Internals

Model Type 
auto-regressive language model
Training Details 
Data Sources:
Books3
Data Volume:
37K subset with 200K to 500K tokens
Training Time:
December 2023
Model Architecture:
transformer
LLM NameLWM Text Chat 256K
Repository ๐Ÿค—https://huggingface.co/LargeWorldModel/LWM-Text-Chat-256K 
Required VRAM13.5 GB
Updated2024-12-22
MaintainerLargeWorldModel
Model Typellama
Model Files  10.0 GB: 1-of-2   3.5 GB: 2-of-2
Model ArchitectureLlamaForCausalLM
Context Length262144
Model Max Length262144
Transformers Version4.29.2
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Quantized Models of the LWM Text Chat 256K

Model
Likes
Downloads
VRAM
LWM Text Chat 256K GPTQ174 GB

Best Alternatives to LWM Text Chat 256K

Best Alternatives
Context / RAM
Downloads
Likes
LWM Text Chat 512K512K / 13.5 GB502
LWM Text 512K512K / 13.5 GB82
LWM Text 256K256K / 13.5 GB3963
Pallas 0.5 LASER 0.1195K / 68.9 GB11472
Ashley3b X 1.2128K / 6.5 GB250
Cyber13128K / 16.1 GB150
Cyber8128K / 16.1 GB170
LWM Text Chat 128K128K / 13.5 GB32220
...Model LWM Text Chat 128K 55bpw128K / 4.8 GB73
LWM Text 128K128K / 13.5 GB431
Note: green Score (e.g. "73.2") means that the model is better than LargeWorldModel/LWM-Text-Chat-256K.

Rank the LWM Text Chat 256K Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40066 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217