Model Type |
| |||||||||
Use Cases |
| |||||||||
Additional Notes |
| |||||||||
Training Details |
| |||||||||
Input Output |
|
LLM Name | Yarn Llama 2 7B 64K AWQ |
Repository ๐ค | https://huggingface.co/TheBloke/Yarn-Llama-2-7B-64K-AWQ |
Model Name | Yarn Llama 2 7B 64K |
Model Creator | NousResearch |
Base Model(s) | |
Model Size | 7b |
Required VRAM | 3.9 GB |
Updated | 2024-12-22 |
Maintainer | TheBloke |
Model Type | llama |
Model Files | |
AWQ Quantization | Yes |
Quantization Type | awq |
Model Architecture | LlamaForCausalLM |
License | llama2 |
Context Length | 65536 |
Model Max Length | 65536 |
Transformers Version | 4.32.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Beginning of Sentence Token | <s> |
End of Sentence Token | </s> |
Unk Token | <unk> |
Vocabulary Size | 32000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Smaugv0.1 AWQ | 195K / 19.3 GB | 10 | 1 |
Calm2 7B Chat AWQ | 32K / 4.4 GB | 61 | 2 |
Llama 2 7B 32K Instruct AWQ | 32K / 3.9 GB | 22 | 2 |
... SWE Llama 7B Updated 4bit AWQ | 16K / 3.9 GB | 16 | 0 |
...Llama 7B Python Hf W4 G128 AWQ | 16K / 3.9 GB | 2359 | 0 |
Pandalyst 7B V1.2 AWQ | 16K / 3.9 GB | 23 | 1 |
Tora Code 7B V1.0 AWQ | 16K / 3.9 GB | 20 | 0 |
...eechless Tora Code 7B V1.0 AWQ | 16K / 3.9 GB | 22 | 1 |
CodeLlama 7B Instruct AWQ | 16K / 3.9 GB | 176 | 4 |
Pandalyst 7B V1.1 AWQ | 16K / 3.9 GB | 32 | 1 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐