Llama 3 13B Instruct Ft by elinas

 ยป  All LLMs  ยป  elinas  ยป  Llama 3 13B Instruct Ft   URL Share it on

  Autotrain compatible Base model:elinas/llama-3-13b-... Base model:finetune:elinas/lla...   Conversational Dataset:chat-error/pure-dove-s...   Endpoints compatible   Instruct   Llama   Merge   Mergekit   Region:us   Safetensors   Sharded   Tensorflow

Llama 3 13B Instruct Ft Benchmarks

Llama 3 13B Instruct Ft (elinas/Llama-3-13B-Instruct-ft)

Llama 3 13B Instruct Ft Parameters and Internals

Model Type 
text generation
Additional Notes 
This model was an experiment focusing on a 'mid' sized model to test finetuning using a small dataset. The next steps involve further testing and potentially larger datasets. Sample packing and padding were disabled to reduce VRAM. Performance was tested with RoPE up to 32k.
Training Details 
Data Sources:
Chat-Error/Pure-dove-sharegpt
Data Volume:
8192 context length
Methodology:
QLoRA finetuning
Context Length:
8192
Training Time:
4h 12m 13s
Hardware Used:
3x3090 GPUs
LLM NameLlama 3 13B Instruct Ft
Repository ๐Ÿค—https://huggingface.co/elinas/Llama-3-13B-Instruct-ft 
Base Model(s)  Llama 3 13B Instruct   elinas/Llama-3-13B-Instruct
Model Size13b
Required VRAM26.1 GB
Updated2024-12-22
Maintainerelinas
Model Typellama
Instruction-BasedYes
Model Files  5.0 GB: 1-of-6   5.0 GB: 2-of-6   4.9 GB: 3-of-6   5.0 GB: 4-of-6   4.9 GB: 5-of-6   1.3 GB: 6-of-6
Model ArchitectureLlamaForCausalLM
Licensellama3
Context Length8192
Model Max Length8192
Transformers Version4.40.0.dev0
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|end_of_text|>
Vocabulary Size128256
Torch Data Typebfloat16

Best Alternatives to Llama 3 13B Instruct Ft

Best Alternatives
Context / RAM
Downloads
Likes
CodeLlama 13B MORepair16K / 26 GB26502
NexusRaven V2 13B16K / 26 GB3919465
CodeLlama 13B Instruct Hf16K / 26 GB16223144
CodeLlama 13B Instruct Hf16K / 26 GB99318
TableLLM 13B16K / 26 GB23525
... Llama 2 13B Instruct Text2sql16K / 26 GB732727
NexusRaven 13B16K / 26 GB158103
Panda Coder 13B16K / 26 GB8613
Gen Sim16K / 0.3 GB172
Llama 2 13B SoftwareReq8K / 16.1 GB90
Note: green Score (e.g. "73.2") means that the model is better than elinas/Llama-3-13B-Instruct-ft.

Rank the Llama 3 13B Instruct Ft Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40066 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217