Boreas 10.7B Chat Sft by yhavinga

 ยป  All LLMs  ยป  yhavinga  ยป  Boreas 10.7B Chat Sft   URL Share it on

  Arxiv:1910.09700   Autotrain compatible   Conversational   Endpoints compatible   Mistral   Region:us   Safetensors   Sharded   Tensorflow

Boreas 10.7B Chat Sft Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Boreas 10.7B Chat Sft (yhavinga/Boreas-10.7B-chat-sft)

Boreas 10.7B Chat Sft Parameters and Internals

Use Cases 
Areas:
More Information Needed
Applications:
More Information Needed
Primary Use Cases:
More Information Needed
Limitations:
More Information Needed
Considerations:
Users (both direct and downstream) should be made aware of the risks, biases, and limitations of the model.
Additional Notes 
The model card lacks detailed information on many aspects including hardware type, license, and environmental impact.
Supported Languages 
Dutch (NLP), English (NLP)
Training Details 
Data Sources:
Mistral +10B Nederlandse/Engelse tokens -> Boreas-7B, Pretrainen met 20B Nederlandse/Engelse tokens -> Boreas-10.7B-step2, Finetunen met 10B Nederlandse/Engelse chat tokens
Data Volume:
20B tokens for pretraining and 10B for fine-tuning
Methodology:
Finetuning Boreas-10.7B-step2 on a mix of Dutch and English chats and normal text
Responsible Ai Considerations 
Fairness:
Model card indicates awareness of risks, biases, and limitations.
LLM NameBoreas 10.7B Chat Sft
Repository ๐Ÿค—https://huggingface.co/yhavinga/Boreas-10.7B-chat-sft 
Model Size7b
Required VRAM21.4 GB
Updated2025-01-30
Maintaineryhavinga
Model Typemistral
Model Files  4.9 GB: 1-of-5   5.0 GB: 2-of-5   4.9 GB: 3-of-5   4.9 GB: 4-of-5   1.7 GB: 5-of-5
Model ArchitectureMistralForCausalLM
Context Length32768
Model Max Length32768
Transformers Version4.39.3
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32000
Torch Data Typebfloat16

Best Alternatives to Boreas 10.7B Chat Sft

Best Alternatives
Context / RAM
Downloads
Likes
...Nemo Instruct 2407 Abliterated1000K / 24.5 GB462011
MegaBeam Mistral 7B 512K512K / 14.4 GB568150
SpydazWeb AI HumanAI RP512K / 14.4 GB121
SpydazWeb AI HumanAI 002512K / 14.4 GB181
...daz Web AI ChatML 512K Project512K / 14.5 GB120
MegaBeam Mistral 7B 300K282K / 14.4 GB563316
Hebrew Mistral 7B 200K256K / 30 GB1461915
Astral 256K 7B V2250K / 14.4 GB70
Astral 256K 7B250K / 14.4 GB50
Test001128K / 14.5 GB90
Note: green Score (e.g. "73.2") means that the model is better than yhavinga/Boreas-10.7B-chat-sft.

Rank the Boreas 10.7B Chat Sft Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227