Jetmoe 8B Sft by jetmoe

 ยป  All LLMs  ยป  jetmoe  ยป  Jetmoe 8B Sft   URL Share it on

  Arxiv:2404.07413   Alignment-handbook   Autotrain compatible   Base model:jetmoe/jetmoe-8b   Conversational Dataset:huggingfaceh4/airoboro...   Dataset:huggingfaceh4/capybara Dataset:huggingfaceh4/code-fee... Dataset:huggingfaceh4/orca-mat... Dataset:huggingfaceh4/systemch... Dataset:huggingfaceh4/ultracha...   Endpoints compatible   Generated from trainer   Jetmoe   License:apache-2.0   Region:us   Safetensors   Sharded   Tensorflow

Rank the Jetmoe 8B Sft Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Jetmoe 8B Sft (jetmoe/jetmoe-8b-sft)

Best Alternatives to Jetmoe 8B Sft

Best Alternatives
HF Rank
Context/RAM
Downloads
Likes
Jetmoe 8B0K / 17 GB2114245
Jetmoe 8B Chat0K / 17 GB429

Jetmoe 8B Sft Parameters and Internals

LLM NameJetmoe 8B Sft
RepositoryOpen on ๐Ÿค— 
Base Model(s)  Jetmoe 8B   jetmoe/jetmoe-8b
Model Size8b
Required VRAM17 GB
Updated2024-07-01
Maintainerjetmoe
Model Typejetmoe
Model Files  4.9 GB: 1-of-4   4.9 GB: 2-of-4   4.9 GB: 3-of-4   2.3 GB: 4-of-4
Model ArchitectureJetMoEForCausalLM
Licenseapache-2.0
Model Max Length4096
Tokenizer ClassLlamaTokenizer
Padding Token</s>
Vocabulary Size32000
Activation Functionsilu
Layer Norm Epsilon1.0E-5

What open-source LLMs or SLMs are you in search of? 34238 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024042801