Open Llama 3B V2 by openlm-research

 ยป  All LLMs  ยป  openlm-research  ยป  Open Llama 3B V2   URL Share it on

  Autotrain compatible   Dataset:bigcode/starcoderdata Dataset:tiiuae/falcon-refinedw... Dataset:togethercomputer/redpa...   Endpoints compatible   Llama   Pytorch   Region:us

Open Llama 3b V2 Benchmarks

Open Llama 3B V2 (openlm-research/open_llama_3b_v2)

Open Llama 3B V2 Parameters and Internals

Model Type 
large language model
Additional Notes 
Please note that it is advised to avoid using the Hugging Face fast tokenizer for now, as it sometimes gives incorrect tokenizations. This can be avoided by using 'use_fast=False'.
Training Details 
Data Sources:
tiiuae/falcon-refinedweb, bigcode/starcoderdata, togethercomputer/RedPajama-Data-1T
Data Volume:
1 trillion tokens
Methodology:
Pre-trained with open datasets rather than the original LLaMA dataset, using the EasyLM framework.
Hardware Used:
cloud TPU-v4s
LLM NameOpen Llama 3b V2
Repository ๐Ÿค—https://huggingface.co/openlm-research/open_llama_3b_v2 
Model Size3b
Required VRAM6.8 GB
Updated2025-02-05
Maintaineropenlm-research
Model Typellama
Model Files  6.8 GB
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length2048
Model Max Length2048
Transformers Version4.31.0.dev0
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32000
Torch Data Typefloat16

Quantized Models of the Open Llama 3B V2

Model
Likes
Downloads
VRAM
...3b V2 Python Instruct 0.1 4bit052 GB

Best Alternatives to Open Llama 3B V2

Best Alternatives
Context / RAM
Downloads
Likes
Llama 3.2 3B Instruct128K / 6.5 GB1500707951
Llama 3.2 3B128K / 6.5 GB337713484
Hermes 3 Llama 3.2 3B128K / 6.5 GB13305135
Llama 3.2 3B Bespoke Thought128K / 6.4 GB303
Dolphin3.0 Llama3.2 3B128K / 6.5 GB1262831
Calme 3.1 Llamaloi 3B128K / 10.6 GB29181
Llama 3.2 3B Instruct128K / 6.4 GB38007047
Orca Mini V9 5 3B Instruct128K / 6.5 GB2486
Llama 3.2 3B Instruct128K / 6.5 GB5791072
Llasa 3B128K / 13.6 GB52715
Note: green Score (e.g. "73.2") means that the model is better than openlm-research/open_llama_3b_v2.

Rank the Open Llama 3B V2 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227