Machroom 3B Model Stock by DreadPoor

 ยป  All LLMs  ยป  DreadPoor  ยป  Machroom 3B Model Stock   URL Share it on

  Merged Model   Arxiv:2403.19522   Autotrain compatible Base model:jeiku/alpaca 128 st... Base model:jeiku/bluemoon clea... Base model:jeiku/everything v3... Base model:jeiku/gnosis 256 st... Base model:jeiku/limarp stable... Base model:jeiku/no robots alp... Base model:jeiku/pippa 128 sta... Base model:jeiku/rpgpt stablel... Base model:jeiku/theory of min... Base model:jeiku/theory of min... Base model:jeiku/toxic dpo sta...   Conversational   En   Endpoints compatible   Model-index   Region:us   Safetensors   Sharded   Stablelm   Tensorflow

Machroom 3B Model Stock Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

Machroom 3B Model Stock Parameters and Internals

LLM NameMachroom 3B Model Stock
Repository ๐Ÿค—https://huggingface.co/DreadPoor/Machroom-3B-model_stock 
Base Model(s)  MarshmaToon 3B Model Stock   jeiku/Everything_v3_128_StableLM   MarshmaToon 3B Model Stock   Toxic DPO StableLM   MarshmaToon 3B Model Stock   jeiku/LimaRP_StableLM   MarshmaToon 3B Model Stock   MarshmaToon 3B Model Stock   Theory Of Mind 128 StableLM   MarshmaToon 3B Model Stock   Bluemoon Cleaned StableLM   MarshmaToon 3B Model Stock   jeiku/PIPPA_128_StableLM   MarshmaToon 3B Model Stock   jeiku/Gnosis_256_StableLM   MarshmaToon 3B Model Stock   Theory Of Mind RP 128 StableLM   MarshmaToon 3B Model Stock   jeiku/RPGPT_StableLM   MarshmaToon 3B Model Stock   jeiku/Alpaca_128_StableLM   MarshmaToon 3B Model Stock   No Robots Alpaca StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/Everything_v3_128_StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/Toxic_DPO_StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/LimaRP_StableLM   DreadPoor/MarshmaToon-3B-model_stock   DreadPoor/MarshmaToon-3B-model_stock   jeiku/Theory_of_Mind_128_StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/Bluemoon_cleaned_StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/PIPPA_128_StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/Gnosis_256_StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/Theory_of_Mind_RP_128_StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/RPGPT_StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/Alpaca_128_StableLM   DreadPoor/MarshmaToon-3B-model_stock   jeiku/No_Robots_Alpaca_StableLM
Merged ModelYes
Model Size3b
Required VRAM5.6 GB
Updated2024-08-12
MaintainerDreadPoor
Model Typestablelm
Model Files  5.0 GB: 1-of-2   0.6 GB: 2-of-2
Supported Languagesen
Model ArchitectureStableLmForCausalLM
Licenseapache-2.0
Context Length4096
Model Max Length4096
Transformers Version4.41.0
Tokenizer ClassGPTNeoXTokenizer
Padding Token<|endoftext|>
Vocabulary Size50304
Torch Data Typefloat16
Machroom 3B Model Stock (DreadPoor/Machroom-3B-model_stock)

Best Alternatives to Machroom 3B Model Stock

Best Alternatives
Context / RAM
Downloads
Likes
...t 3B Mix Spider Bird 200 Steps16K / 5.6 GB60
... Instruct 3B Spider 3500 Steps16K / 11.2 GB50
Stablelm 3B 4e1t4K / 5.6 GB109527306
Stablelm Zephyr 3B4K / 5.6 GB7984244
ReMask 3B4K / 11.2 GB7914
Stablelm 4e1t 2B V0.14K / 4 GB50
Rocket 3B4K / 5.6 GB166881
Zephyr Sumbot All Songs4K / 5.6 GB91
Canvers Slm Ov V14K / 2.8 GB60
Ft Stablelm Zephyr 3B4K / 2.5 GB50
Note: green Score (e.g. "73.2") means that the model is better than DreadPoor/Machroom-3B-model_stock.

Rank the Machroom 3B Model Stock Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 36026 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803