MMedLM2 1 8B by Henrychur

 ยป  All LLMs  ยป  Henrychur  ยป  MMedLM2 1 8B   URL Share it on

  Arxiv:2402.13963   Custom code   Dataset:henrychur/mmedc   En   Es   Feature-extraction   Fr   Internlm2   Ja   Medical   Region:us   Ru   Safetensors   Sharded   Tensorflow   Zh
Model Card on HF ๐Ÿค—: https://huggingface.co/Henrychur/MMedLM2-1_8B 

MMedLM2 1 8B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
MMedLM2 1 8B (Henrychur/MMedLM2-1_8B)

MMedLM2 1 8B Parameters and Internals

Model Type 
Multilingual, Medical, Auto-regressive
Use Cases 
Areas:
Medical domain research and applications
Applications:
Multilingual medical language processing
Primary Use Cases:
Multilingual medical text analysis
Additional Notes 
Foundation model not instruction fine-tuned yet.
Supported Languages 
languages_supported_and_proficiency_levels (English, Chinese, Japanese, French, Russian, Spanish)
Training Details 
Data Sources:
MMedC
Data Volume:
25.5B tokens
Methodology:
Auto-regressive pretraining
Context Length:
2048
Hardware Used:
GPU
Model Architecture:
Foundation model based on InternLM with further training on MMedC
Input Output 
Input Format:
Tokenized input with predefined tokenizer
Accepted Modalities:
Text
Output Format:
Textual output
Performance Tips:
Ensure using suggested tokenizer and torch float16
Release Notes 
Version:
2-1.8B
Date:
2024-02-21
Notes:
Released preprint paper and model
LLM NameMMedLM2 1 8B
Repository ๐Ÿค—https://huggingface.co/Henrychur/MMedLM2-1_8B 
Model Size8b
Required VRAM7.6 GB
Updated2025-02-05
MaintainerHenrychur
Model Typeinternlm2
Model Files  5.0 GB: 1-of-2   2.6 GB: 2-of-2
Supported Languagesen zh ja fr ru es
Model ArchitectureInternLM2ForCausalLM
Licensecc-by-4.0
Context Length32768
Model Max Length32768
Transformers Version4.38.0
Is Biased0
Tokenizer ClassInternLM2Tokenizer
Padding Token</s>
Vocabulary Size92544
Torch Data Typefloat32

Best Alternatives to MMedLM2 1 8B

Best Alternatives
Context / RAM
Downloads
Likes
Internlm2 5 1 8b Chat32K / 3.8 GB763725
Internlm2 5 1 8b32K / 3.8 GB105723
Internlm2 Chat 1 8b32K / 3.8 GB605930
Internlm2 1 8b32K / 3.8 GB866030
...ternlm2 Chat 1 8b Ultracabrita32K / 3.8 GB5470
Internlm2 Chat 1 8b ExPO32K / 3.8 GB1221
Internlm2 Chat 1 8b Sft32K / 3.8 GB1909
Internlm2 Math Plus 1 8b8K / 3.8 GB39810
Note: green Score (e.g. "73.2") means that the model is better than Henrychur/MMedLM2-1_8B.

Rank the MMedLM2 1 8B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227