๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐
Best Alternatives |
HF Rank |
Context/RAM |
Downloads |
Likes |
---|---|---|---|---|
Codegen 6B Mono Sharded Bnb | — | 0K / 14.1 GB | 8 | 2 |
Codegen 6B Mono | — | 0K / 14.3 GB | 310 | 36 |
Diff Codegen 6B V2 | — | 0K / 14.3 GB | 8 | 35 |
Codegen 6B Multi | — | 0K / 14.3 GB | 2603 | 18 |
Codegen 6B Nl | — | 0K / 14.3 GB | 3748 | 4 |
...egen 6B Nl Lora Adapter Merged | — | 0K / 14.3 GB | 8 | 1 |
Nsql 6B | — | 0K / 28.4 GB | 62 | 50 |
...n 6B Mono Instruct Py Critique | — | 0K / 28.4 GB | 9 | 2 |
...en 6B Mono Instruct Py Revised | — | 0K / 28.4 GB | 9 | 2 |
...gen 6B Nl Instruct Py Critique | — | 0K / 28.4 GB | 9 | 1 |
LLM Name | Codegen 6B Mono Lora Adapter Merged |
Repository | Open on ๐ค |
Model Size | 6b |
Required VRAM | 14.3 GB |
Updated | 2024-06-24 |
Maintainer | patent |
Model Type | codegen |
Model Files | |
Generates Code | Yes |
Model Architecture | CodeGenForCausalLM |
Transformers Version | 4.28.0.dev0 |
Tokenizer Class | GPT2Tokenizer |
Vocabulary Size | 51200 |
LoRA Model | Yes |
Initializer Range | 0.02 |
Torch Data Type | float16 |
Activation Function | gelu_new |
Attention Dropout | 0 |
Embedding Dropout | 0 |
Layer Norm Epsilon | 1.0E-5 |
Summary First Dropout | 0.1 |
Summary Type | cls_index |