Llama3 OpenBioLLM 8B by aaditya

 ยป  All LLMs  ยป  aaditya  ยป  Llama3 OpenBioLLM 8B   URL Share it on

  Arxiv:2212.13138   Arxiv:2303.13375   Arxiv:2305.09617   Arxiv:2305.18290   Arxiv:2402.07023   Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/meta-lla...   Chatml   Distillation   Dpo   En   Endpoints compatible   Finetuned   Gpt4   Instruct   Llama   Llama-3   Mixtral   Pytorch   Region:us   Rlhf   Sharded

Llama3 OpenBioLLM 8B Benchmarks

Llama3 OpenBioLLM 8B (aaditya/Llama3-OpenBioLLM-8B)

Llama3 OpenBioLLM 8B Parameters and Internals

Model Type 
biomedical, large language model
Use Cases 
Areas:
Research, Biomedical community
Applications:
Clinical note summarization, Medical question answering, Clinical entity recognition, Biomarkers extraction, Classification, De-Identification
Primary Use Cases:
Research assistance in healthcare and biomedical fields
Limitations:
Not evaluated in randomized controlled trials, Not for direct patient care or clinical decision support
Considerations:
Limited to research and exploratory applications by qualified individuals.
Additional Notes 
Model intended for research and non-clinical use. Extensive testing required for clinical settings.
Supported Languages 
en (English)
Training Details 
Data Sources:
Custom Medical Instruct dataset, DPO dataset
Methodology:
Fine-tuned using DPO dataset and diverse medical instruction dataset
Hardware Used:
GPU: H100 80GB SXM5
Input Output 
Input Format:
Chat template from Llama-3 instruct version
Accepted Modalities:
text
Performance Tips:
Use temperature = 0 to reduce verbosity.
Release Notes 
Notes:
A powerful tool for the biomedical domain, advancing open-source models in healthcare and life sciences.
LLM NameLlama3 OpenBioLLM 8B
Repository ๐Ÿค—https://huggingface.co/aaditya/Llama3-OpenBioLLM-8B 
Base Model(s)  Meta Llama 3 8B   meta-llama/Meta-Llama-3-8B
Model Size8b
Required VRAM16.1 GB
Updated2025-06-01
Maintaineraaditya
Model Typellama
Model Files  5.0 GB: 1-of-4   5.0 GB: 2-of-4   4.9 GB: 3-of-4   1.2 GB: 4-of-4
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licensellama3
Context Length8192
Model Max Length8192
Transformers Version4.40.0.dev0
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|end_of_text|>
Vocabulary Size128256
Torch Data Typebfloat16

Quantized Models of the Llama3 OpenBioLLM 8B

Model
Likes
Downloads
VRAM
...OpenBioLLM 8B Bnb 4bit Smashed0256 GB
...OpenBioLLM 8B HQQ 2bit Smashed0164 GB
...OpenBioLLM 8B AWQ 4bit Smashed0155 GB

Best Alternatives to Llama3 OpenBioLLM 8B

Best Alternatives
Context / RAM
Downloads
Likes
...otron 8B UltraLong 4M Instruct4192K / 32.1 GB3482107
UltraLong Thinking4192K / 16.1 GB3122
...a 3.1 8B UltraLong 4M Instruct4192K / 32.1 GB17624
...a 3.1 8B UltraLong 2M Instruct2096K / 32.1 GB8759
...otron 8B UltraLong 2M Instruct2096K / 32.1 GB41815
Zero Llama 3.1 8B Beta61048K / 16.1 GB7301
...otron 8B UltraLong 1M Instruct1048K / 32.1 GB179443
...a 3.1 8B UltraLong 1M Instruct1048K / 32.1 GB138729
...xis Bookwriter Llama3.1 8B Sft1048K / 16.1 GB534
...dger Nu Llama 3.1 8B UltraLong1048K / 16.2 GB573
Note: green Score (e.g. "73.2") means that the model is better than aaditya/Llama3-OpenBioLLM-8B.

Rank the Llama3 OpenBioLLM 8B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 47770 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227