V3 Gptq by NotoriousH2

 ยป  All LLMs  ยป  NotoriousH2  ยป  V3 Gptq   URL Share it on

  Merged Model   Arxiv:1910.09700   4-bit   4bit   Autotrain compatible   Endpoints compatible   Gptq   Llama   Quantized   Region:us

V3 Gptq Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

V3 Gptq Parameters and Internals

LLM NameV3 Gptq
Repository ๐Ÿค—https://huggingface.co/NotoriousH2/v3_gptq 
Merged ModelYes
Required VRAM6.1 GB
Updated2024-09-18
MaintainerNotoriousH2
Model Typellama
Model Files  6.1 GB
GPTQ QuantizationYes
Quantization Typegptq|4bit
Model ArchitectureLlamaForCausalLM
Context Length4096
Model Max Length4096
Transformers Version4.38.2
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size40960
Torch Data Typefloat16
V3 Gptq (NotoriousH2/v3_gptq)

Best Alternatives to V3 Gptq

Best Alternatives
Context / RAM
Downloads
Likes
LWM Text Chat 512K GPTQ512K / 4.3 GB42
LWM Text Chat 256K GPTQ256K / 4.3 GB51
LWM Text Chat 128K GPTQ128K / 4.3 GB41
StoryTeller10.7B GPTQ 4Bit41K / 6.6 GB50
Alpha Merged Gptq19K / 9.2 GB60
...lama 3 Lima Nsfw 16K Test GPTQ16K / 5.7 GB684
Taiwan LLaMa V1.0 4bits GPTQ4K / 7.3 GB61
MythoMax22b Falseblock GPT4K / 12 GB60
Taiwan LLaMa V1.0 4bits GPTQ4K / 7.3 GB259
Nous Hermes Llama2 8bit GPTQ4K / 13.7 GB71
Note: green Score (e.g. "73.2") means that the model is better than NotoriousH2/v3_gptq.

Rank the V3 Gptq Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 36026 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803