Llama3 1.3B Gptneox Init by emozilla

 ยป  All LLMs  ยป  emozilla  ยป  Llama3 1.3B Gptneox Init   URL Share it on

  Arxiv:1910.09700   Autotrain compatible   Endpoints compatible   Llama   Region:us   Safetensors

Llama3 1.3B Gptneox Init Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Llama3 1.3B Gptneox Init (emozilla/llama3-1.3b-gptneox-init)

Llama3 1.3B Gptneox Init Parameters and Internals

LLM NameLlama3 1.3B Gptneox Init
Repository ๐Ÿค—https://huggingface.co/emozilla/llama3-1.3b-gptneox-init 
Model Size1.3b
Required VRAM2.6 GB
Updated2025-02-22
Maintaineremozilla
Model Typellama
Model Files  2.6 GB
Model ArchitectureLlamaForCausalLM
Context Length2048
Model Max Length2048
Transformers Version4.40.1
Vocabulary Size128256
Torch Data Typebfloat16

Best Alternatives to Llama3 1.3B Gptneox Init

Best Alternatives
Context / RAM
Downloads
Likes
Llama 1.3B 32K32K / 2.6 GB1752
Llama 1.3B 16K16K / 2.6 GB1250
Deepseek Coder 1.3B Instruct16K / 2.7 GB64461113
...c Deepseek Coder 1.3B Instruct16K / 5.4 GB740
Llm4decompile 1.3B V216K / 2.7 GB6748
CursorCore DS 1.3B SR16K / 2.7 GB1630
CursorCore DS 1.3B LC16K / 2.7 GB1620
CursorCore DS 1.3B16K / 2.7 GB1610
Speechless Coder Ds 1.3B16K / 2.7 GB18680
Deepseek Coder 1.3B Base16K / 2.7 GB6689383
Note: green Score (e.g. "73.2") means that the model is better than emozilla/llama3-1.3b-gptneox-init.

Rank the Llama3 1.3B Gptneox Init Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227