Zephyr Quiklang 3B 4K by Walmart-the-bag

 ยป  All LLMs  ยป  Walmart-the-bag  ยป  Zephyr Quiklang 3B 4K   URL Share it on

Base model:finetune:walmart-th... Base model:walmart-the-bag/zep...   Causal lm   Conversational   Custom code   Dataset:teknium/openhermes   Feature-extraction   Pytorch   Region:us   Stablelm epoch

Zephyr Quiklang 3B 4K Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Zephyr Quiklang 3B 4K (Walmart-the-bag/zephyr-quiklang-3b-4K)

Zephyr Quiklang 3B 4K Parameters and Internals

Model Type 
text-generation, causal_lm
Additional Notes 
This is a 4K version with 1000 more samples of openhermes than the original model.
Training Details 
Data Sources:
teknium/openhermes, unalignment/toxic-dpo-v0.1
Data Volume:
10000 samples
Methodology:
finetuning
Hardware Used:
1xA6000-48GB
LLM NameZephyr Quiklang 3B 4K
Repository ๐Ÿค—https://huggingface.co/Walmart-the-bag/zephyr-quiklang-3b-4K 
Base Model(s)  Zephyr Quiklang 3B   Walmart-the-bag/zephyr-quiklang-3b
Model Size3b
Required VRAM5.9 GB
Updated2025-03-12
MaintainerWalmart-the-bag
Model Typestablelm_epoch
Model Files  5.9 GB   0.0 GB
Model ArchitectureStableLMEpochForCausalLM
Licenseother
Context Length4096
Model Max Length4096
Transformers Version4.34.1
Tokenizer ClassGPTNeoXTokenizer
Padding Token<|endoftext|>
Vocabulary Size50304
Torch Data Typefloat16

Quantized Models of the Zephyr Quiklang 3B 4K

Model
Likes
Downloads
VRAM
Zephyr Quiklang 3B 4K GGUF41901 GB
Zephyr Quiklang 3B 4K GPTQ2281 GB

Best Alternatives to Zephyr Quiklang 3B 4K

Best Alternatives
Context / RAM
Downloads
Likes
Stable Code 3B Mlx16K / 5.6 GB181
Aura 3B4K / 5.6 GB162
Slim Extract4K / 5.6 GB5312
Slim Sa Ner4K / 5.6 GB666
Slim Boolean4K / 5.6 GB444
Slim Tags 3B4K / 5.6 GB534
Slim Summary4K / 5.6 GB597
Slim Xsum4K / 5.6 GB506
Memphis CoT 3B4K / 5.6 GB19629
Fett Uccine Mini 3B4K / 5.6 GB1423
Note: green Score (e.g. "73.2") means that the model is better than Walmart-the-bag/zephyr-quiklang-3b-4K.

Rank the Zephyr Quiklang 3B 4K Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 44902 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227