Swallow Hermes St V1 By napopoa32: Benchmarks, Features and Detailed Analysis. Insights on Swallow Hermes St V1.

Arxiv:2310.04799 Autotrain compatible Endpoints compatible Ja Mistral Region:us Safetensors Sharded Swallow Tensorflow

Model Card on HF 🤗: https://huggingface.co/napopoa32/swallow-hermes-st-v1

Swallow Hermes St V1 Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Swallow Hermes St V1 (napopoa32/swallow-hermes-st-v1)

Swallow Hermes St V1 Parameters and Internals

Model Type

text generation

Use Cases

Areas:

entertainment, story generation

Applications:

bedtime story generation

Primary Use Cases:

generating entertaining stories, creating bedtime stories

Limitations:

not optimized for factual content, may not produce intended emotional tone

Considerations:

The model aims to provide entertaining and engaging stories before sleep.

Additional Notes

This model is intended for creating engaging narratives but may produce unexpected tones.

Supported Languages

ja (Japanese)

Training Details

Methodology:

Based on Chat Vector and EvoLLM techniques with evolutionary strategy optimization.

Model Architecture:

Derived from Swallow-MS-7b-v0.1 with added vectors from Hermes-2-Pro-Mistral-7B and Mistral-7B-Instruct-v0.2-Neural-Story.

Input Output

Input Format:

Prompts with narrative or dialogue description

Accepted Modalities:

text

Output Format:

Generated narrative response

LLM Name	Swallow Hermes St V1
Repository 🤗	https://huggingface.co/napopoa32/swallow-hermes-st-v1
Model Size	7.3b
Required VRAM	14.6 GB
Updated	2025-06-01
Maintainer	napopoa32
Model Type	mistral
Model Files	4.9 GB: 1-of-3 4.9 GB: 2-of-3 4.8 GB: 3-of-3
Supported Languages	ja
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	4096
Model Max Length	4096
Transformers Version	4.40.0.dev0
Tokenizer Class	LlamaTokenizer
Vocabulary Size	42800
Torch Data Type	float16

Best Alternatives to Swallow Hermes St V1

Best Alternatives	Context / RAM	Downloads	Likes
Dictalm2.0 Instruct	32K / 14.5 GB	9091	22
Dictalm2.0	32K / 14.5 GB	12085	16
My Tokenizer 100 10000	32K / 22.3 GB	11	0
My Tokenizer 50 20000	32K / 14.6 GB	7	0
Dictalm2 It Qa Fine Tune	32K / 14.5 GB	3140	5
Dictalm2.0 Instruct Fine Tuned	32K / 14.5 GB	5826	0
... Fine Tuned Alpaca Gpt4 Hebrew	32K / 14.5 GB	4495	0
Mis	32K / 14.6 GB	5	0
Misjava Api 060924 V3 Merged	32K / 14.6 GB	11	0
Quietstar 8 Ahead	32K / 14.5 GB	129	90

Note: green Score (e.g. "73.2") means that the model is better than napopoa32/swallow-hermes-st-v1.

Rank the Swallow Hermes St V1 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Swallow Hermes St V1 by napopoa32

» All LLMs » napopoa32 » Swallow Hermes St V1 URL Share it on

Swallow Hermes St V1 Benchmarks

Swallow Hermes St V1 Parameters and Internals

Best Alternatives to Swallow Hermes St V1

Rank the Swallow Hermes St V1 Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.