Tau 1.8B by M4-ai

 ยป  All LLMs  ยป  M4-ai  ยป  Tau 1.8B   URL Share it on

  Autotrain compatible   Conversational Dataset:locutusque/ultratextbo...   En   Endpoints compatible   Model-index   Qwen2   Region:us   Safetensors   Zh
Model Card on HF ๐Ÿค—: https://huggingface.co/M4-ai/tau-1.8B 

Tau 1.8B Benchmarks

Tau 1.8B (M4-ai/tau-1.8B)

Tau 1.8B Parameters and Internals

Model Type 
Language Model
Use Cases 
Areas:
Research, Educational technology
Applications:
Machine learning, Mathematics, Coding, Educational question answering, Text summarization, Content generation for educational purposes, Code understanding and generation, Mathematical problem solving
Limitations:
It is essential to note that the model may still exhibit biases or inaccuracies present in the training data.
Training Details 
Data Sources:
UltraTextbooks-2.0
Methodology:
Further pre-training of Qwen1.5-1.8B on UltraTextbooks-2.0.
Responsible Ai Considerations 
Fairness:
Users should be aware of these potential limitations and use the model responsibly. The model should not be used for tasks that could cause harm or discriminate against individuals or groups.
Mitigation Strategies:
Users are encouraged to critically evaluate the model's outputs and report any issues to facilitate continuous improvement.
LLM NameTau 1.8B
Repository ๐Ÿค—https://huggingface.co/M4-ai/tau-1.8B 
Model Size1.8b
Required VRAM3.7 GB
Updated2025-02-05
MaintainerM4-ai
Model Typeqwen2
Model Files  3.7 GB
Supported Languagesen zh
Model ArchitectureQwen2ForCausalLM
Licenseother
Context Length32768
Model Max Length32768
Transformers Version4.38.2
Tokenizer ClassQwen2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size151936
Torch Data Typebfloat16
Errorsreplace

Best Alternatives to Tau 1.8B

Best Alternatives
Context / RAM
Downloads
Likes
Qwen1.5 1.8B32K / 3.7 GB14603846
Qwen1.5 1.8B Chat32K / 3.7 GB1121448
MiniPLM Qwen 200M32K / 0.8 GB2690
MiniPLM Qwen 500M32K / 1.9 GB1805
Orca 2.0 Tau 1.8B32K / 3.7 GB5499
Qwen1.5 1.8B Seed Sft32K / 3.7 GB1160
Neural Chat Mini V2.2 1.8B32K / 3.7 GB1485
Qwen1.5 Wukong 1.8B32K / 3.7 GB1404
SmartQwen1.5 1.8B Orpo V132K / 3.7 GB70
Hercules Mini 1.8B32K / 3.7 GB1647
Note: green Score (e.g. "73.2") means that the model is better than M4-ai/tau-1.8B.

Rank the Tau 1.8B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227