LoKuS 13B by JoSw-14

 ยป  All LLMs  ยป  JoSw-14  ยป  LoKuS 13B   URL Share it on

  Autotrain compatible   Endpoints compatible   Instruct   Llama   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/JoSw-14/LoKuS-13B 

LoKuS 13B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
LoKuS 13B (JoSw-14/LoKuS-13B)

LoKuS 13B Parameters and Internals

Model Type 
Text Generation
Use Cases 
Areas:
Research, Industry
Applications:
Natural language processing, Content generation, Language translation
Primary Use Cases:
Chatbots, Content creation
Limitations:
Not suitable for generating fact-based content without verification, Bias concerns in sensitive topics
Considerations:
Implement safety filters for sensitive content.
Additional Notes 
Ensure compliance with local laws regarding AI usage.
Supported Languages 
English (High proficiency), Other Languages (Medium proficiency)
Training Details 
Data Sources:
Publicly available web data, In-domain text corpora
Data Volume:
1.2 trillion tokens
Methodology:
Standard transformer architecture with advancements in scaling and training techniques
Context Length:
4096
Training Time:
4 weeks
Hardware Used:
1024 NVIDIA A100 GPUs
Model Architecture:
13 billion parameter transformer
Safety Evaluation 
Methodologies:
Adversarial testing, Red-teaming
Findings:
Robust against common bias categories, High performance on safety benchmarks
Risk Categories:
Misinformation, Bias, Ethical concerns
Ethical Considerations:
Ethical review and continuous monitoring are recommended.
Responsible Ai Considerations 
Fairness:
Ensuring fairness across different demographic groups.
Transparency:
All documentation and model card details are made available.
Accountability:
Meta AI is responsible for the model's outputs.
Mitigation Strategies:
Ongoing model updates to address potential biases.
Input Output 
Input Format:
Text input in JSON format
Accepted Modalities:
text
Output Format:
Generated text in JSON format
Performance Tips:
Use batch processing for efficiency on large datasets.
Release Notes 
Version:
2.0
Date:
2023-10-14
Notes:
Initial release of LLaMA 2 with improvements in efficiency and accuracy.
LLM NameLoKuS 13B
Repository ๐Ÿค—https://huggingface.co/JoSw-14/LoKuS-13B 
Model Size13b
Required VRAM26.2 GB
Updated2025-03-14
MaintainerJoSw-14
Model Typellama
Instruction-BasedYes
Model Files  5.5 GB: 1-of-5   5.4 GB: 2-of-5   5.5 GB: 3-of-5   5.5 GB: 4-of-5   4.3 GB: 5-of-5
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length4096
Model Max Length4096
Transformers Version4.32.1
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32001
Torch Data Typefloat16

Quantized Models of the LoKuS 13B

Model
Likes
Downloads
VRAM
LoKuS 13B GGUF15225 GB
LoKuS 13B AWQ1837 GB
LoKuS 13B GPTQ2227 GB
LoKuS 13B GGML185 GB

Best Alternatives to LoKuS 13B

Best Alternatives
Context / RAM
Downloads
Likes
NexusRaven V2 13B16K / 26 GB3836466
CodeLlama 13B Instruct Hf16K / 26 GB22189145
CodeLlama 13B MORepair16K / 26 GB312
CodeLlama 13B Instruct Hf16K / 26 GB104721
TableLLM 13B16K / 26 GB33726
NexusRaven 13B16K / 26 GB182102
Panda Coder 13B16K / 26 GB16913
... Llama 2 13B Instruct Text2sql16K / 26 GB15627
Gen Sim16K / 0.3 GB472
Llama 3 13B Instruct Ft8K / 26.1 GB242
Note: green Score (e.g. "73.2") means that the model is better than JoSw-14/LoKuS-13B.

Rank the LoKuS 13B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 45019 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227