Llama 3 5B Sheard by raincandy-u

 ยป  All LLMs  ยป  raincandy-u  ยป  Llama 3 5B Sheard   URL Share it on

  Autotrain compatible   Conversational   Dataset:jeankaddour/minipile Dataset:raincandy-u/slimorca-l...   En   Endpoints compatible   Facebook   Llama   Llama-3   Meta   Pytorch   Region:us   Safetensors

Llama 3 5B Sheard Benchmarks

Llama 3 5B Sheard (raincandy-u/Llama-3-5B-Sheard)

Llama 3 5B Sheard Parameters and Internals

Model Type 
text-generation
Additional Notes 
This model is for testing purposes only; the output may repeat and not stop when the system prompt is not empty!
Supported Languages 
en (Full)
Training Details 
Data Sources:
JeanKaddour/minipile, raincandy-u/SlimOrca-Llama-3-Preference-DPO-Pairs
Methodology:
Sliced by Mergekit, continue-pretrained on minipile for 1 epoch and ~100k samples, followed by ORPO training on Llama-3-70b generated DPO pairs.
LLM NameLlama 3 5B Sheard
Repository ๐Ÿค—https://huggingface.co/raincandy-u/Llama-3-5B-Sheard 
Model Size5b
Required VRAM11.7 GB
Updated2025-02-12
Maintainerraincandy-u
Model Typellama
Model Files  11.7 GB
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length8192
Model Max Length8192
Transformers Version4.39.3
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|end_of_text|>
Vocabulary Size128256
Torch Data Typebfloat16

Best Alternatives to Llama 3 5B Sheard

Best Alternatives
Context / RAM
Downloads
Likes
Ko Llama 3.1 5B Instruct128K / 23.4 GB50
Llama 3.1 5B Instruct8K / 10.9 GB3257
Triangulum 5B8K / 10.9 GB698
Triangulum 5B It8K / 10.9 GB248
Mermaid Llama 3 5B Pruned8K / 10.9 GB71
Ko Llama 230M 0317 5B2K / 0.6 GB1440
Linux As A Model 5M0.5K / 0 GB1611
HelpingAI2.5 5B128K / 10.3 GB9872
HelpingAI2.5 5B128K / 10.3 GB612
Airoboros C34B 3.1.2 AWQ16K / 18.3 GB131
Note: green Score (e.g. "73.2") means that the model is better than raincandy-u/Llama-3-5B-Sheard.

Rank the Llama 3 5B Sheard Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42980 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227