Llama2 70B Oasst Sft V10 By OpenAssistant: Benchmarks, Features and Detailed Analysis. Insights on Llama2 70B Oasst Sft V10.

Autotrain compatible Dataset:argilla/databricks-dol... Dataset:openassistant/oasst1 Dataset:rombodawg/losslessmega... Dataset:shahules786/orca-best En Endpoints compatible Llama Pytorch Region:us Sft Sharded

Model Card on HF 🤗: https://huggingface.co/OpenAssistant/llama2-70b-oasst-sft-v10

Llama2 70B Oasst Sft V10 Benchmarks

ARC: 67.06 vs 96.7 (so35)^-30.7%

HellaSwag: 86.38 vs 95.3 (gpt4)^-9.4%

MMLU: 67.7 vs 88.3 (so35)^-23.3%

TruthfulQA: 56.45 vs 59 (gpt4)^-4.3%

WinoGrande: 82 vs 87.5 (gpt4)^-6.3%

GSM8K: 27.22 vs 96.4 (so35)^-71.8%

LLME Score: 0.17393

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama2 70B Oasst Sft V10 (OpenAssistant/llama2-70b-oasst-sft-v10)

Llama2 70B Oasst Sft V10 Parameters and Internals

Model Type

Causal decoder-only transformer language model

Use Cases

Areas:

Research, Commercial applications

Applications:

Text-generation

Primary Use Cases:

Chat assistant applications

Limitations:

Model outputs may be unpredictable, inaccurate, biased, or objectionable.

Considerations:

Perform application-specific safety testing before deployment.

Additional Notes

Embeddings padded to multiple of 128 for sharded inference compatibility.

Supported Languages

en (full), de (limited), es (limited), fr (limited), it (limited), pt (limited), pl (limited), nl (limited), ro (limited), cs (limited), sv (limited)

Training Details

Data Sources:

rombodawg/LosslessMegaCodeTrainingV2_1m_Evol_Uncensored, OpenAssistant/oasst1, shahules786/orca-best, argilla/databricks-dolly-15k-curated-multilingual

Methodology:

Fine-tuned in two stages: first on synthetic instructions and coding tasks, then on top human demonstrations.

Context Length:

4096

Hardware Used:

EPFL's Machine Learning and Optimization Laboratory, Natural Language Processing Lab

Model Architecture:

Causal decoder-only transformer architecture

Responsible Ai Considerations

Fairness:

Testing mainly in English, outputs may be unpredictable in other scenarios.

Transparency:

Documented training processes and datasets.

Accountability:

Open-Assistant development team is accountable for model outputs.

Mitigation Strategies:

Developers should perform safety testing and tuning specific to their applications.

Input Output

Input Format:

Prompt dialogue template with OpenAI's chatml format.

Accepted Modalities:

text

Output Format:

Assistant responses

Performance Tips:

Use the official Llama2 system message for improved inference.

LLM Name	Llama2 70B Oasst Sft V10
Repository 🤗	https://huggingface.co/OpenAssistant/llama2-70b-oasst-sft-v10
Model Size	70b
Required VRAM	138 GB
Updated	2024-12-23
Maintainer	OpenAssistant
Model Type	llama
Model Files	9.8 GB: 1-of-15 9.8 GB: 2-of-15 10.0 GB: 3-of-15 9.8 GB: 4-of-15 9.8 GB: 5-of-15 9.8 GB: 6-of-15 10.0 GB: 7-of-15 9.8 GB: 8-of-15 9.8 GB: 9-of-15 9.8 GB: 10-of-15 10.0 GB: 11-of-15 9.8 GB: 12-of-15 9.8 GB: 13-of-15 9.5 GB: 14-of-15 0.5 GB: 15-of-15
Supported Languages	en
Model Architecture	LlamaForCausalLM
License	llama2
Context Length	4096
Model Max Length	4096
Transformers Version	4.32.1
Tokenizer Class	LlamaTokenizer
Beginning of Sentence Token	<s>
End of Sentence Token	</s>
Unk Token	<unk>
Vocabulary Size	32007
Torch Data Type	bfloat16

Quantized Models of the Llama2 70B Oasst Sft V10

Model	Likes	Downloads	VRAM
Llama2 70B OASST SFT V10 GPTQ	4	4092	35 GB
Llama2 70B OASST SFT V10 GGUF	10	278	29 GB
Llama2 70B OASST SFT V10 AWQ	2	40	36 GB
Llama2 70B OASST SFT V10 GGML	4	16	29 GB

Best Alternatives to Llama2 70B Oasst Sft V10

Best Alternatives	Context / RAM	Downloads	Likes
... Chat 1048K Chinese Llama3 70B	1024K / 141.9 GB	3187	5
... 3 70B Instruct Gradient 1048K	1024K / 141.9 GB	276	121
Llama3 Function Calling 1048K	1024K / 141.9 GB	3	1
...a 3 70B Instruct Gradient 524K	512K / 141.9 GB	54	23
...a 3 70B Instruct Gradient 262K	256K / 141.9 GB	199	55
...ama 3 70B Arimas Story RP V2.0	256K / 141.1 GB	52	3
...ama 3 70B Arimas Story RP V1.6	256K / 141.2 GB	16	0
...ama 3 70B Arimas Story RP V1.5	256K / 141.2 GB	31	2
Yi 70B 200K RPMerge Franken	195K / 142.4 GB	18	1
...a 3.1 Nemotron 70B Instruct HF	128K / 141.9 GB	156866	1929

Note: green Score (e.g. "73.2") means that the model is better than OpenAssistant/llama2-70b-oasst-sft-v10.

Rank the Llama2 70B Oasst Sft V10 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 40126 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241217

Support LLM Explorer