Dictalm2.0 GPTQ by dicta-il

 ยป  All LLMs  ยป  dicta-il  ยป  Dictalm2.0 GPTQ   URL Share it on

  Arxiv:2407.07080   4-bit   Autotrain compatible   En   Gptq   He   Mistral   Pretrained   Quantized   Region:us   Safetensors

Dictalm2.0 GPTQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Dictalm2.0 GPTQ (dicta-il/dictalm2.0-GPTQ)

Dictalm2.0 GPTQ Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
Research, Commercial applications involving Hebrew text
Applications:
Hebrew text processing, Instruction capabilities in Hebrew
Primary Use Cases:
Text generation in Hebrew
Limitations:
No moderation mechanisms
Additional Notes 
Default version does not include moderation mechanisms.
Supported Languages 
en (Supported), he (Primary support)
Training Details 
Data Sources:
Hebrew text, General text data for foundational training
Data Volume:
Not specified
Methodology:
Extended tokenizer with 1,000 Hebrew-specific tokens
Model Architecture:
Based on the Mistral-7B-v0.1 with extended tokenizer for Hebrew
Input Output 
Accepted Modalities:
text
LLM NameDictalm2.0 GPTQ
Repository ๐Ÿค—https://huggingface.co/dicta-il/dictalm2.0-GPTQ 
Model Size1.2b
Required VRAM4.2 GB
Updated2024-12-26
Maintainerdicta-il
Model Typemistral
Model Files  4.2 GB
Supported Languagesen he
GPTQ QuantizationYes
Quantization Typegptq
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.36.2
Tokenizer ClassLlamaTokenizer
Vocabulary Size33152
Torch Data Typebfloat16

Best Alternatives to Dictalm2.0 GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
... Finetune 16bit Ver9 Main GPTQ32K / 4.2 GB130
Dictalm2.0 Instruct GPTQ32K / 4.2 GB1010
Multi Verse Model GPTQ32K / 4.2 GB781
Turdus GPTQ32K / 4.2 GB315
Garrulus GPTQ32K / 4.2 GB223
HamSter 0.1 GPTQ32K / 4.2 GB312
Phoenix GPTQ32K / 4.2 GB271
...hat 3.5 1210 Seraph Slerp GPTQ32K / 4.2 GB322
Mistral Ft Optimized 1227 GPTQ32K / 4.2 GB182
...h Openchat 3.5 1210 Slerp GPTQ32K / 4.2 GB191
Note: green Score (e.g. "73.2") means that the model is better than dicta-il/dictalm2.0-GPTQ.

Rank the Dictalm2.0 GPTQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40303 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227