Arabic DeepSeek R1 Distill 8B by Omartificial-Intelligence-Space

 ยป  All LLMs  ยป  Omartificial-Intelligence-Space  ยป  Arabic DeepSeek R1 Distill 8B   URL Share it on

  Arxiv:2501.12948   4bit   Adapter   Ar   Arabic Base model:adapter:unsloth/dee... Base model:unsloth/deepseek-r1...   Conversational Dataset:omartificial-intellige...   Deepseek-r1   Finetuned   Lora   Peft   Quantized   Region:us   Safetensors   Unsloth

Arabic DeepSeek R1 Distill 8B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Arabic DeepSeek R1 Distill 8B (Omartificial-Intelligence-Space/Arabic-DeepSeek-R1-Distill-8B)

Arabic DeepSeek R1 Distill 8B Parameters and Internals

LLM NameArabic DeepSeek R1 Distill 8B
Repository ๐Ÿค—https://huggingface.co/Omartificial-Intelligence-Space/Arabic-DeepSeek-R1-Distill-8B 
Base Model(s)  ...till Llama 8B Unsloth Bnb 4bit   unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
Model Size8b
Required VRAM0 GB
Updated2025-02-22
MaintainerOmartificial-Intelligence-Space
Model Files  0.0 GB   0.0 GB   0.0 GB   0.0 GB
Supported Languagesar
Quantization Type4bit
Model ArchitectureAdapter
Licenseapache-2.0
Model Max Length131072
Is Biasednone
Tokenizer ClassLlamaTokenizer
Padding Token<|finetune_right_pad_id|>
PEFT TypeLORA
LoRA ModelYes
PEFT Target Moduleso_proj|k_proj|q_proj|v_proj
LoRA Alpha16
LoRA Dropout0
R Param4

Best Alternatives to Arabic DeepSeek R1 Distill 8B

Best Alternatives
Context / RAM
Downloads
Likes
Llama 3 8B Muxue 1.0 Lora0K / 0.2 GB250
...ma3 Instruct Chat FCAIBylaw V10K / 0.2 GB50
BlueMoon Llama30K / 0.7 GB100
Llama3 RP ORPO LoRA0K / 0.7 GB360
Theory Of Mind Llama30K / 1.3 GB70
Llama3 Aesir Preview LoRA 1280K / 1.3 GB380
Llama3 RP ORPO LoRA0K / 0.7 GB50
RP Format QuoteAsterisk Llama30K / 1.3 GB50
Lora Model0K / 0.2 GB60
Wally FS AI0K / 5.8 GB50

Rank the Arabic DeepSeek R1 Distill 8B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227