DCLM 7B by apple

 ยป  All LLMs  ยป  apple  ยป  DCLM 7B   URL Share it on

  Arxiv:2406.11794   Endpoints compatible   Openlm   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/apple/DCLM-7B 

DCLM 7B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
DCLM 7B (apple/DCLM-7B)

DCLM 7B Parameters and Internals

Model Type 
Decoder-only Transformer language model
Use Cases 
Limitations:
Exhibits biases present in its training data, Performance on tasks not in the evaluation suite may vary, Limited to training data cutoff date
Additional Notes 
The model has not undergone specific alignment or safety fine-tuning.
Supported Languages 
English (Primarily)
Training Details 
Data Sources:
DCLM-BASELINE, StarCoder, ProofPile2
Data Volume:
4.1T tokens
Context Length:
2048
Hardware Used:
H100 GPUs
Model Architecture:
Decoder-only Transformer
Input Output 
Input Format:
Tokenizer inputs
Accepted Modalities:
text
Output Format:
Generated text
LLM NameDCLM 7B
Repository ๐Ÿค—https://huggingface.co/apple/DCLM-7B 
Model Size7b
Required VRAM27.7 GB
Updated2025-02-22
Maintainerapple
Model Typeopenlm
Model Files  4.9 GB: 1-of-6   4.9 GB: 2-of-6   4.9 GB: 3-of-6   4.9 GB: 4-of-6   4.9 GB: 5-of-6   3.2 GB: 6-of-6
Model ArchitectureOpenLMModel
Licenseapple-ascl
Transformers Version4.38.2
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50432
Torch Data Typefloat32

Rank the DCLM 7B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227