Llama 30B Supercot 3bit 128g Cuda By tsumeone: Benchmarks, Features and Detailed Analysis. Insights on Llama 30B Supercot 3bit 128g Cuda.

3bit Autotrain compatible Endpoints compatible Llama Pytorch Quantized Region:us

Model Card on HF 🤗: https://huggingface.co/tsumeone/llama-30b-supercot-3bit-128g-cuda

Llama 30B Supercot 3bit 128g Cuda Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama 30B Supercot 3bit 128g Cuda (tsumeone/llama-30b-supercot-3bit-128g-cuda)

Llama 30B Supercot 3bit 128g Cuda Parameters and Internals

Model Type

Language Model

Additional Notes

Created using GPTQ quantization with 3-bit weights at the request of a user. Cannot be tested by the developer due to lack of a functioning Ooba setup.

LLM Name	Llama 30B Supercot 3bit 128g Cuda
Repository 🤗	https://huggingface.co/tsumeone/llama-30b-supercot-3bit-128g-cuda
Model Size	30b
Required VRAM	14 GB
Updated	2025-04-28
Maintainer	tsumeone
Model Type	llama
Model Files	14.0 GB
Quantization Type	3bit
Model Architecture	LlamaForCausalLM
Context Length	2048
Model Max Length	2048
Transformers Version	4.28.0
Tokenizer Class	LlamaTokenizer
Beginning of Sentence Token	<s>
End of Sentence Token	</s>
Unk Token	<unk>
Vocabulary Size	32000
Torch Data Type	float16

Best Alternatives to Llama 30B Supercot 3bit 128g Cuda

Best Alternatives	Context / RAM	Downloads	Likes
Tenebra 30B Alpha01 FP16	16K / 65 GB	1833	9
Tenebra 30B Alpha01 4BIT	16K / 19.4 GB	165	1
...nebra 30B Alpha01 EXL2 2 80bpw	16K / 11.9 GB	3	1
...nebra 30B Alpha01 EXL2 2 50bpw	16K / 10.7 GB	5	0
Tenebra 30B Alpha01 EXL2 3bpw	16K / 12.7 GB	5	0
Tenebra 30B Alpha01 3BIT	16K / 12.9 GB	6	0
... 30B Supercot SuperHOT 8K Fp16	8K / 65.2 GB	185	4
Platypus 30B SuperHOT 8K Fp16	8K / 65.2 GB	188	2
GPlatty 30B SuperHOT 8K Fp16	8K / 65.2 GB	179	1
Llama 1 30B E8P 2Bit	2K / 8.9 GB	4	1

Note: green Score (e.g. "73.2") means that the model is better than tsumeone/llama-30b-supercot-3bit-128g-cuda.

Rank the Llama 30B Supercot 3bit 128g Cuda Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 46763 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Llama 30B Supercot 3bit 128g Cuda by tsumeone

» All LLMs » tsumeone » Llama 30B Supercot 3bit 128g Cuda URL Share it on

Llama 30B Supercot 3bit 128g Cuda Benchmarks

Llama 30B Supercot 3bit 128g Cuda Parameters and Internals

Best Alternatives to Llama 30B Supercot 3bit 128g Cuda

Rank the Llama 30B Supercot 3bit 128g Cuda Capabilities

What open-source LLMs or SLMs are you in search of? 46763 in total.