Model Type |
| |||||||||||||||||||
Use Cases |
| |||||||||||||||||||
Additional Notes |
| |||||||||||||||||||
Supported Languages |
| |||||||||||||||||||
Training Details |
| |||||||||||||||||||
Input Output |
| |||||||||||||||||||
Release Notes |
|
LLM Name | ALMA 13B Pretrain |
Repository ๐ค | https://huggingface.co/haoranxu/ALMA-13B-Pretrain |
Base Model(s) | |
Model Size | 13b |
Required VRAM | 52.1 GB |
Updated | 2025-02-22 |
Maintainer | haoranxu |
Model Type | llama |
Model Files | |
Model Architecture | LlamaForCausalLM |
License | mit |
Context Length | 4096 |
Model Max Length | 4096 |
Transformers Version | 4.30.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Beginning of Sentence Token | <s> |
End of Sentence Token | </s> |
Unk Token | <unk> |
Vocabulary Size | 32000 |
Torch Data Type | float32 |
Model |
Likes |
Downloads |
VRAM |
---|---|---|---|
ALMA 13B Pretrain GGUF | 12 | 280 | 5 GB |
ALMA 13B Pretrain AWQ | 1 | 90 | 7 GB |
ALMA 13B Pretrain GPTQ | 1 | 43 | 7 GB |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Yarn Llama 2 13B 128K | 128K / 26 GB | 4968 | 113 |
Luminaura RP 13B | 128K / 26 GB | 27 | 0 |
Agent Llama2 13B 80K | 80K / 26.4 GB | 15 | 0 |
Chat Llama2 13B 80K | 80K / 52.8 GB | 13 | 0 |
Yarn Llama 2 13B 64K | 64K / 26 GB | 7190 | 17 |
LongAlign 13B 64K | 64K / 26 GB | 31 | 13 |
LongAlign 13B 64K Base | 64K / 26 GB | 26 | 3 |
Openbuddy Llama2 13B V15p1 64K | 64K / 26.1 GB | 13 | 4 |
Openbuddy Llama2 13b64k V15 | 64K / 26.1 GB | 16 | 1 |
Airoboros L2 13B 2.1 YaRN 64K | 64K / 26 GB | 13 | 7 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐