Meditron 7B by epfl-llm

 ยป  All LLMs  ยป  epfl-llm  ยป  Meditron 7B   URL Share it on

  Arxiv:2311.16079   Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/llama-2-...   Dataset:epfl-llm/guidelines   En   Endpoints compatible   Llama   Pytorch   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/epfl-llm/meditron-7b 

Meditron 7B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Meditron 7B (epfl-llm/meditron-7b)

Meditron 7B Parameters and Internals

Model Type 
Causal decoder-only transformer language model
Use Cases 
Areas:
Medical exam question answering, Supporting differential diagnosis, Disease information query, General health information query
Limitations:
Not recommended for production use, Unsuitable for professional purposes related to health and medicine
Considerations:
Use in production environments requires rigorous evaluation and alignment processes.
Additional Notes 
Significant research is still required to fully explore potential bias, fairness, and safety issues.
Supported Languages 
English (mainly)
Training Details 
Data Sources:
Clinical Guidelines, Medical Paper Abstracts, Medical Papers, Replay Data
Data Volume:
48.1B tokens
Methodology:
Continued pretraining
Context Length:
2048
Training Time:
September 2023
Hardware Used:
1 node of 8x NVIDIA A100 (80GB) SXM GPUs
Model Architecture:
Llama 2, Hidden dimension: 4096, Number of attention heads: 32, Number of layers: 32
Input Output 
Input Format:
Text-only data
Accepted Modalities:
Text
Output Format:
Text
LLM NameMeditron 7B
Repository ๐Ÿค—https://huggingface.co/epfl-llm/meditron-7b 
Base Model(s)  Llama 2 7B   meta-llama/Llama-2-7b
Model Size7b
Required VRAM13.4 GB
Updated2025-02-22
Maintainerepfl-llm
Model Typellama
Model Files  1.9 GB: 1-of-8   1.9 GB: 2-of-8   1.8 GB: 3-of-8   1.9 GB: 4-of-8   1.9 GB: 5-of-8   1.8 GB: 6-of-8   1.9 GB: 7-of-8   0.3 GB: 8-of-8   1.9 GB: 1-of-8   1.9 GB: 2-of-8   1.8 GB: 3-of-8   1.9 GB: 4-of-8   1.9 GB: 5-of-8   1.8 GB: 6-of-8   1.9 GB: 7-of-8   0.3 GB: 8-of-8
Supported Languagesen
Gated ModelYes
Model ArchitectureLlamaForCausalLM
Licenseproprietary
Context Length2048
Model Max Length2048
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Padding Token<PAD>
Vocabulary Size32017
Torch Data Typebfloat16

Quantized Models of the Meditron 7B

Model
Likes
Downloads
VRAM
...editron 7B Lora Finetuned 4bit0223 GB
Meditron 7B AWQ290463 GB
Meditron 7B GGUF216942 GB
Meditron 7B GPTQ3403 GB

Best Alternatives to Meditron 7B

Best Alternatives
Context / RAM
Downloads
Likes
2 Very Sci Fi1024K / 16.1 GB3170
...1M 1000000ctx AEZAKMI 3 1 17021024K / 13.5 GB231
... Qwen2.5llamaify 7B V23.1 200K195K / 15.2 GB39433
LlamaStock 8B128K / 16.1 GB111
SuperNeuralDreadDevil 8B128K / 16.1 GB541
Yarn Llama 2 7B 128K128K / 13.5 GB642239
LLaMA 7B PoSE YaRN 128K128K / 13.5 GB73
LLaMA 7B PoSE Linear 96K96K / 27 GB92
LLaMA 7B PoSE YaRN 96K96K / 13.5 GB111
Chat Llama2 7B 80K80K / 13.8 GB80
Note: green Score (e.g. "73.2") means that the model is better than epfl-llm/meditron-7b.

Rank the Meditron 7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227