Falcon 40B Instruct 8bit By ichitaka: Benchmarks, Features and Detailed Analysis. Insights on Falcon 40B Instruct 8bit.

Arxiv:1911.02150 Arxiv:2005.14165 Arxiv:2104.09864 Arxiv:2205.14135 8-bit 8bit Autotrain compatible Custom code Dataset:tiiuae/falcon-refinedw... En Instruct Pytorch Quantized Refinedweb Region:us Sharded

Falcon 40B Instruct 8bit Benchmarks

LLME Score: 0.11747

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Falcon 40B Instruct 8bit Parameters and Internals

Model Type

Causal decoder-only

Use Cases

Areas:

Chatbots, Instruction-following systems

Primary Use Cases:

Ready-to-use chat/instruct model based on Falcon-40B

Limitations:

Mostly trained on English data, may not generalize to other languages

Considerations:

Ensure appropriate guardrails are in place for production systems.

Additional Notes

This model is a 40B parameter instruction model designed for use with GPUs using bitsandbytes for quantization.

Supported Languages

English (primary), French (secondary)

Training Details

Data Sources:

Baize, RefinedWeb

Data Volume:

150M tokens

Methodology:

Finetuned on a mixture of data

Context Length:

2048

Hardware Used:

64 A100 40GB GPUs

Model Architecture:

Causal decoder-only model with rotary positional embeddings and FlashAttention, multiquery attention, optimized MLP layer with single layer norm, parallel attention/MLP.

Safety Evaluation

Risk Categories:

Bias - related to language; stereotypes from online data

Ethical Considerations:

Users should implement guardrails and assess risks for production use.

Responsible Ai Considerations

Fairness:	Model primarily trained on English data, which may introduce bias and stereotypes.
Mitigation Strategies:	Developers should consider precautions for production deployment.

Input Output

Accepted Modalities:

text

Performance Tips:

Recommended for use on systems with adequate GPU resources.

LLM Name	Falcon 40B Instruct 8bit
Repository 🤗	https://huggingface.co/ichitaka/falcon-40b-instruct-8bit
Base Model(s)	Medfalcon 40B Lora nmitchko/medfalcon-40b-lora
Model Size	40b
Required VRAM	41.8 GB
Updated	2024-11-12
Maintainer	ichitaka
Model Type	RefinedWeb
Instruction-Based	Yes
Model Files	10.0 GB: 1-of-5 9.8 GB: 2-of-5 9.9 GB: 3-of-5 9.8 GB: 4-of-5 2.3 GB: 5-of-5
Supported Languages	en
Quantization Type	8bit
Model Architecture	RWForCausalLM
License	apache-2.0
Model Max Length	2048
Transformers Version	4.30.0.dev0
Is Biased	0
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	65024
Torch Data Type	float16

Falcon 40B Instruct 8bit (ichitaka/falcon-40b-instruct-8bit)

Best Alternatives to Falcon 40B Instruct 8bit

Best Alternatives	Context / RAM	Downloads	Likes
Falcon 40B Instruct GPTQ	0K / 22.5 GB	106	198
...alcon 40B Instruct W4 G128 AWQ	0K / 22.3 GB	26	2
Falcon 40B Instruct GPTQ	0K / 22.5 GB	14	1
H2ogpt Oig Oasst1 Falcon 40B	0K / 82.5 GB	27	6
...truct GPTQ Inference Endpoints	0K / 22.5 GB	19	2

Note: green Score (e.g. "73.2") means that the model is better than ichitaka/falcon-40b-instruct-8bit.

Rank the Falcon 40B Instruct 8bit Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 37901 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241110

Support LLM Explorer

Falcon 40B Instruct 8bit by ichitaka

» All LLMs » ichitaka » Falcon 40B Instruct 8bit URL Share it on

Falcon 40B Instruct 8bit Benchmarks

Falcon 40B Instruct 8bit Parameters and Internals

Best Alternatives to Falcon 40B Instruct 8bit

Rank the Falcon 40B Instruct 8bit Capabilities

What open-source LLMs or SLMs are you in search of? 37901 in total.