Llama 3.1 Storm 8B by akjindal53244

 ยป  All LLMs  ยป  akjindal53244  ยป  Llama 3.1 Storm 8B   URL Share it on

  Arxiv:1803.05457   Arxiv:2109.07958   Arxiv:2210.09261   Arxiv:2310.16049   Arxiv:2311.07911   Arxiv:2311.12022   Arxiv:2406.01574   Arxiv:2406.06623   Autotrain compatible   Axolotl   Conversational   De   Doi:10.57967/hf/2902   En   Endpoints compatible   Es   Finetuning   Fr   Function calling   Hi   Instruction following   It   Llama   Llama-3.1   Mergekit   Model-index   Pt   Reasoning   Region:us   Safetensors   Sharded   Tensorflow   Th

Llama 3.1 Storm 8B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Llama 3.1 Storm 8B (akjindal53244/Llama-3.1-Storm-8B)

Llama 3.1 Storm 8B Parameters and Internals

Model Type 
text-generation, conversational, instruction following, reasoning, function calling
Use Cases 
Areas:
research, commercial applications
Applications:
instruction-following, knowledge-driven QA, reasoning, truthful answer generation, function calling
Primary Use Cases:
Conversational AI, Function Calling
Additional Notes 
Model merging was done using SLERP method with Llama-Spark model.
Supported Languages 
en (Proficient), de (Proficient), fr (Proficient), it (Proficient), pt (Proficient), hi (Proficient), es (Proficient), th (Proficient)
Training Details 
Data Sources:
open-source data
Data Volume:
~2.8 million examples
Methodology:
Self-Curation and Spectrum-based targeted fine-tuning
Model Architecture:
Llama
Input Output 
Input Format:
Transformed user queries using chat-template
Accepted Modalities:
text
Output Format:
Generated text responses
Performance Tips:
Use bfloat16 model type for optimal performance.
LLM NameLlama 3.1 Storm 8B
Repository ๐Ÿค—https://huggingface.co/akjindal53244/Llama-3.1-Storm-8B 
Model Size8b
Required VRAM16.1 GB
Updated2025-03-24
Maintainerakjindal53244
Model Typellama
Model Files  5.0 GB: 1-of-4   5.0 GB: 2-of-4   4.9 GB: 3-of-4   1.2 GB: 4-of-4
Supported Languagesen de fr it pt hi es th
Model ArchitectureLlamaForCausalLM
Licensellama3.1
Context Length131072
Model Max Length131072
Transformers Version4.44.0
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|end_of_text|>
Vocabulary Size128256
Torch Data Typebfloat16

Quantized Models of the Llama 3.1 Storm 8B

Model
Likes
Downloads
VRAM
Llama 3.1 Storm 8B GGUF4028074 GB

Best Alternatives to Llama 3.1 Storm 8B

Best Alternatives
Context / RAM
Downloads
Likes
...a 3 8B Instruct Gradient 1048K1024K / 16.1 GB5272682
A61024K / 16.1 GB3860
A81024K / 16.1 GB2840
A41024K / 16.1 GB3630
A21024K / 16.1 GB3600
A181024K / 16.1 GB2720
A101024K / 16.1 GB3030
A121024K / 16.1 GB2560
A11024K / 16.1 GB3080
C311024K / 16.1 GB1830

Rank the Llama 3.1 Storm 8B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 45494 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227