LLM Name | HermesBagel 34B V0.1 |
Repository ๐ค | https://huggingface.co/dfurman/HermesBagel-34B-v0.1 |
Base Model(s) | |
Merged Model | Yes |
Model Size | 34b |
Required VRAM | 68.9 GB |
Updated | 2024-10-07 |
Maintainer | dfurman |
Model Type | llama |
Model Files | |
Model Architecture | LlamaForCausalLM |
License | apache-2.0 |
Context Length | 4096 |
Model Max Length | 4096 |
Transformers Version | 4.36.2 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <unk> |
Vocabulary Size | 64000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Yi 34B 200K | 195K / 68.9 GB | 4514 | 313 |
34B Beta | 195K / 69.2 GB | 2964 | 61 |
Smaug 34B V0.1 | 195K / 69.2 GB | 2990 | 59 |
Bagel Hermes 34B Slerp | 195K / 68.9 GB | 3334 | 1 |
Bagel 34B V0.2 | 195K / 68.7 GB | 3143 | 39 |
Yi 34B 200K AEZAKMI V2 | 195K / 69.2 GB | 937 | 12 |
Smaug 34B V0.1 ExPO | 195K / 69.2 GB | 2357 | 0 |
Bagel DPO 34B V0.5 | 195K / 68.7 GB | 2382 | 17 |
Faro Yi 34B | 195K / 69.2 GB | 2982 | 6 |
Merged RP Stew V2 34B | 195K / 68.9 GB | 454 | 48 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐