LLM Name | Llama3.1 Merged |
Repository ๐ค | https://huggingface.co/anthonymeo/llama3.1-merged |
Merged Model | Yes |
Model Size | 4.7b |
Required VRAM | 5.8 GB |
Updated | 2025-02-05 |
Maintainer | anthonymeo |
Model Type | llama |
Model Files | |
Quantization Type | 4bit |
Model Architecture | LlamaForCausalLM |
Context Length | 131072 |
Model Max Length | 131072 |
Transformers Version | 4.44.2 |
Tokenizer Class | PreTrainedTokenizerFast |
Padding Token | <|finetune_right_pad_id|> |
Vocabulary Size | 128256 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
EPFL TA Meister 4bit | 8K / 5.8 GB | 95 | 0 |
Book 4bitV5 | 8K / 5.8 GB | 77 | 0 |
Book 4bit | 8K / 5.8 GB | 5 | 0 |
Llama3 Ko 4bit | 8K / 5.8 GB | 77 | 0 |
Tenebra PreAlpha 128g 4BIT | 2K / 17.6 GB | 9 | 0 |
Airoboros C34B 3.1.2 GPTQ | 16K / 17.7 GB | 13 | 1 |
Airoboros C34b 2.2.1 GPTQ | 16K / 17.7 GB | 41 | 2 |
Airoboros C34B 2.2 GPTQ | 16K / 17.7 GB | 23 | 1 |
Airoboros C34B 2.1 GPTQ | 16K / 17.7 GB | 36 | 12 |
Hola | 8K / 5.8 GB | 77 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐