Gemma 7B It GGUF (alokabhishek/gemma-7b-it-GGUF)

Gemma 7B It GGUF Parameters and Internals

LLM NameGemma 7B It GGUF
RepositoryOpen on ๐Ÿค— 
Model Size7b
Required VRAM5.3 GB
Model Typegemma
Model Files  17.1 GB   5.3 GB   6.1 GB
GGUF QuantizationYes
Quantization Typegguf|q4|q4_k|q5_k
Model ArchitectureGemmaForCausalLM
Context Length8192
Model Max Length8192
Transformers Version4.38.0.dev0
Tokenizer ClassGemmaTokenizer
Padding Token<pad>
Vocabulary Size256000
Initializer Range0.02
Torch Data Typebfloat16

