Llama3 AWQ by alejandrovil

 ยป  All LLMs  ยป  alejandrovil  ยป  Llama3 AWQ   URL Share it on

  4-bit   Autotrain compatible   Awq   Axolotl   Chatml   Conversational   Dataset:teknium/openhermes-2.5   Distillation   Dpo   En   Endpoints compatible   Finetuned   Function calling   Gpt4   Instruct   Json mode   Llama   Llama-3   Quantized   Region:us   Rlhf   Safetensors   Sharded   Synthetic data   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/alejandrovil/llama3-AWQ 

Llama3 AWQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Llama3 AWQ (alejandrovil/llama3-AWQ)

Llama3 AWQ Parameters and Internals

Model Type 
text-generation, instruct, finetune, chatml, DPO, RLHF
Use Cases 
Areas:
text generation, chat,
Applications:
function calling, json mode, distillation, synthetic data
Additional Notes 
Hermes-2-Pro-Llama-3-8B is part of the Llama-3 series and is designed with advanced quantization (AWQ) for efficient use.
Supported Languages 
en (proficient)
Input Output 
Input Format:
prompts in string format
Accepted Modalities:
text
Output Format:
generated text
LLM NameLlama3 AWQ
Repository ๐Ÿค—https://huggingface.co/alejandrovil/llama3-AWQ 
Model Size2b
Required VRAM5.8 GB
Updated2025-02-22
Maintaineralejandrovil
Model Typellama
Model Files  4.7 GB: 1-of-2   1.1 GB: 2-of-2
Supported Languagesen
AWQ QuantizationYes
Quantization Typeawq
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length8192
Model Max Length8192
Transformers Version4.40.1
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|end_of_text|>
Vocabulary Size128288
Torch Data Typefloat16

Best Alternatives to Llama3 AWQ

Best Alternatives
Context / RAM
Downloads
Likes
...ents Llama3 4.0.37 DPO 480 AWQ8K / 5.8 GB790
...t Agents Llama3 4.0.34 DPO AWQ8K / 5.8 GB810
...t Agents Llama3 4.0.35 DPO AWQ8K / 5.8 GB810
...t Agents Llama3 4.0.23 DPO AWQ8K / 5.8 GB780
Llama3 AWQ Hermes 2 Theta8K / 5.8 GB790
Drug Profile Llama 3 AWQ8K / 5.8 GB110
EPFL TA Meister AWQ 4bit8K / 5.8 GB840
...t Agents Llama3 4.0.22 DPO AWQ8K / 5.8 GB70
...ot Agents Llama3 4.0.5 SFT AWQ8K / 5.8 GB161
...ot Agents Llama3 4.0.2 SFT AWQ8K / 5.8 GB150
Note: green Score (e.g. "73.2") means that the model is better than alejandrovil/llama3-AWQ.

Rank the Llama3 AWQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227