Zephyr 7B Beta Nf4 Fp16 Upscaled by arnavgrg

 ยป  All LLMs  ยป  arnavgrg  ยป  Zephyr 7B Beta Nf4 Fp16 Upscaled   URL Share it on

  Autotrain compatible   Conversational   Endpoints compatible   Fp16   Mistral   Quantized   Region:us   Safetensors   Sharded   Tensorflow

Zephyr 7B Beta Nf4 Fp16 Upscaled Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Zephyr 7B Beta Nf4 Fp16 Upscaled (arnavgrg/zephyr-7b-beta-nf4-fp16-upscaled)

Zephyr 7B Beta Nf4 Fp16 Upscaled Parameters and Internals

Model Type 
text-generation-inference
Additional Notes 
This is an upscaled fp16 variant of the original model after it has been loaded with nf4 4-bit quantization via bitsandbytes. It aims to upscale linear4bit layers to fp16 to minimize quantization cost per forward pass during inference. Note that the quantization to nf4 is lossy, affecting model performance when compared to the official base model.
LLM NameZephyr 7B Beta Nf4 Fp16 Upscaled
Repository ๐Ÿค—https://huggingface.co/arnavgrg/zephyr-7b-beta-nf4-fp16-upscaled 
Model Size7b
Required VRAM14.4 GB
Updated2025-01-15
Maintainerarnavgrg
Model Typemistral
Model Files  4.9 GB: 1-of-3   5.0 GB: 2-of-3   4.5 GB: 3-of-3
Quantization Typefp16
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32000
Torch Data Typefloat16

Quantized Models of the Zephyr 7B Beta Nf4 Fp16 Upscaled

Model
Likes
Downloads
VRAM
...7B Beta Nf4 Fp16 Upscaled GGUF0912 GB

Best Alternatives to Zephyr 7B Beta Nf4 Fp16 Upscaled

Best Alternatives
Context / RAM
Downloads
Likes
...al Nemo Instruct 2407 Bnb 4bit1000K / 8.3 GB1279525
...istral Nemo Base 2407 Bnb 4bit1000K / 8.3 GB683713
...t 3.5 0106 128K 8.0bpw H8 EXL2128K / 7.4 GB151
...t 3.5 0106 128K 4.0bpw H6 EXL2128K / 3.9 GB111
...tral 7B Instruct V0.3 Bnb 4bit32K / 4.1 GB23911916
Mistral 7B V0.3 Bnb 4bit32K / 4.1 GB4154814
User23ContinuedFine32K / 14.5 GB8830
Mistral 7B Instruct V0.2 Fp1632K / 14.4 GB290
Mistral 7B Instruct V0.2 4bit32K / 4.3 GB2361
NaturalLM 7B Instruct32K / 14.5 GB4010
Note: green Score (e.g. "73.2") means that the model is better than arnavgrg/zephyr-7b-beta-nf4-fp16-upscaled.

Rank the Zephyr 7B Beta Nf4 Fp16 Upscaled Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 41363 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227