LLM Name | Deepseek Coder 6.7B Instruct Trt Int8 G128 Hf |
Repository 🤗 | https://huggingface.co/juewang/deepseek-coder-6.7b-instruct-trt-int8-g128-hf |
Model Size | 6.7b |
Required VRAM | 7.2 GB |
Updated | 2025-02-22 |
Maintainer | juewang |
Model Type | llama |
Instruction-Based | Yes |
Model Files | |
Generates Code | Yes |
Model Architecture | LlamaForCausalLM |
Context Length | 16384 |
Model Max Length | 16384 |
Transformers Version | 4.41.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <|end▁of▁sentence|> |
Vocabulary Size | 32256 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...s Coder6.7b Reflct Adamw Iter1 | 16K / 13.5 GB | 475 | 0 |
...Coder6.7b Reflct Rmsprop Iter1 | 16K / 13.5 GB | 95 | 0 |
...Coder6.7b Reflct Rmsprop Iter1 | 16K / 13.5 GB | 110 | 0 |
...r6.7b Pos Reflct Rmsprop Iter1 | 16K / 13.5 GB | 87 | 0 |
Speechless Coder Ds 6.7B | 16K / 13.5 GB | 1805 | 6 |
...r6.7b Pos Reflct Rmsprop Iter1 | 16K / 13.5 GB | 90 | 0 |
...ir4 Ds Coder6.7b Rmsprop Iter1 | 16K / 13.5 GB | 43 | 0 |
Ds Coder6.7b Rmsprop Iter1 | 16K / 13.5 GB | 67 | 0 |
...Coder6.7b Reflct Rmsprop Iter1 | 16K / 13.5 GB | 62 | 0 |
Datascience Coder 6.7B | 16K / 13.5 GB | 1790 | 3 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟