Bloomz 560M Sft Chat by cmarkea

 ยป  All LLMs  ยป  cmarkea  ยป  Bloomz 560M Sft Chat   URL Share it on

  Arxiv:2001.09977   Arxiv:2012.15613   Autotrain compatible   Bloom Dataset:ehartford/wizard vicun...   Dataset:laion/oig   Dataset:shahules786/orca-chat Dataset:timdettmers/openassist...   En   Endpoints compatible   Fr   Pytorch   Region:us   Safetensors

Bloomz 560M Sft Chat Benchmarks

๐ŸŒŸ Advertise your project ๐Ÿš€

Bloomz 560M Sft Chat Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
research, commercial applications
Primary Use Cases:
chatbot applications
Limitations:
Performance not guaranteed in languages other than French and English, Possibly reduced multilingual capabilities due to transposition from float16 to bfloat16
Considerations:
It's been tuned in a chatbot context and performs well on zero-shot tasks in English and French.
Additional Notes 
Performance degradation anticipated in non-supported languages.
Supported Languages 
French (High proficiency), English (High proficiency)
Training Details 
Data Sources:
ehartford/wizard_vicuna_70k_unfiltered, shahules7876/orca-chat, timdettmers/openassistant-guanaco, laion/OIG
Data Volume:
0.9 billion tokens
Methodology:
fine-tuned for chatbot applications from BigScience BLOOMZ-560m
Training Time:
41 hours
Hardware Used:
1 x A100 40GB
Model Architecture:
Transposition from float16 to bfloat16
Input Output 
Input Format:
Individual's prompt preceded by the EOS token (</s>).
Accepted Modalities:
text
Output Format:
Generated responses begin with BOS token (<s>).
Release Notes 
Version:
2023
Date:
October 12, 2023
Notes:
Release of bloomz-560m-sft-chat model, optimized for chat and instruction-based tasks in French and English.
LLM NameBloomz 560M Sft Chat
Repository ๐Ÿค—https://huggingface.co/cmarkea/bloomz-560m-sft-chat 
Model Size560m
Required VRAM1.1 GB
Updated2024-12-04
Maintainercmarkea
Model Typebloom
Model Files  1.1 GB   1.1 GB
Supported Languagesfr en
Model ArchitectureBloomForCausalLM
Licensebigscience-bloom-rail-1.0
Transformers Version4.31.0
Tokenizer ClassBloomTokenizer
Padding Token<pad>
Vocabulary Size250880
Torch Data Typebfloat16
Bloomz 560M Sft Chat (cmarkea/bloomz-560m-sft-chat)

Best Alternatives to Bloomz 560M Sft Chat

Best Alternatives
Context / RAM
Downloads
Likes
Train Test Bloom5600K / 2.2 GB80
Bloomz 560M0K / 1.1 GB14897077109
Promt Generator0K / 2.2 GB117017
Train Test0K / 2.2 GB300
Product Description Fr0K / 2.2 GB100
Guitester0K / 2.2 GB70
ModeloAJustadoBloom10K / 2.2 GB60
Bloom 560M RLHF V20K / 1.1 GB14283
Bloom 560M RLHF0K / 1.1 GB14281
Bloom VMLU A Text0K / 0 GB80
Note: green Score (e.g. "73.2") means that the model is better than cmarkea/bloomz-560m-sft-chat.

Rank the Bloomz 560M Sft Chat Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 38813 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124