Llama3 8B Slerp Med 262K by shanchen

 ยป  All LLMs  ยป  shanchen  ยป  Llama3 8B Slerp Med 262K   URL Share it on

  Merged Model   Autotrain compatible Base model:gradientai/llama-3-... Base model:johnsnowlabs/jsl-me...   Conversational   Endpoints compatible Gradientai/llama-3-8b-instruct...   Instruct Johnsnowlabs/jsl-medllama-3-8b...   License:llama3   Llama   Region:us   Safetensors   Sharded   Tensorflow   Zh

Rank the Llama3 8B Slerp Med 262K Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Llama3 8B Slerp Med 262K (shanchen/llama3-8B-slerp-med-262k)

Best Alternatives to Llama3 8B Slerp Med 262K

Best Alternatives
HF Rank
...Antino 3 ANITA 8B Inst DPO ITA75.128K / 16.1 GB659719
...ama 3 SauerkrautLM 8B Instruct73.748K / 16.1 GB2941347
Llama 3 8B Instruct V0.873.178K / 16 GB28651
Barcenas Llama3 8B ORPO72.58K / 16.1 GB86096
Llama 3 Stella 8B72.178K / 16 GB13271
Llama3 8B Spaetzle V1371.268K / 16 GB16430
Llama 3 8B Instruct V0.470.38K / 16 GB34352
Llama 3 Stinky V2 8B70.278K / 16 GB15884
ChimeraLlama 3 8B V370.068K / 16 GB253114
Meta Llama 3 8B Instruct DPO69.848K / 16.1 GB13724
Note: green Score (e.g. "73.2") means that the model is better than shanchen/llama3-8B-slerp-med-262k.

Llama3 8B Slerp Med 262K Parameters and Internals

LLM NameLlama3 8B Slerp Med 262K
RepositoryOpen on ๐Ÿค— 
Base Model(s)  Llama 3 8B Instruct 262K   JSL MedLlama 3 8B V1.0   gradientai/Llama-3-8B-Instruct-262k   johnsnowlabs/JSL-MedLlama-3-8B-v1.0
Merged ModelYes
Model Size8b
Required VRAM16 GB
Model Typellama
Model Files  9.9 GB: 1-of-2   6.1 GB: 2-of-2
Supported Languageszh
Model ArchitectureLlamaForCausalLM
Context Length262144
Model Max Length262144
Transformers Version4.40.1
Tokenizer ClassPreTrainedTokenizerFast
Vocabulary Size128256
Initializer Range0.02
Torch Data Typebfloat16

What open-source LLMs or SLMs are you in search of? 34902 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024042801