Amber by LLM360

 ยป  All LLMs  ยป  LLM360  ยป  Amber   URL Share it on

  Arxiv:2312.06550   Autotrain compatible   En   Endpoints compatible   Llama   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/LLM360/Amber 

Amber Benchmarks

Amber (LLM360/Amber)

Amber Parameters and Internals

Model Type 
Language model, text generation
Use Cases 
Areas:
Research, Commercial applications
Limitations:
Not a SOTA model
Considerations:
Amber is released to make LLM training knowledge accessible to all.
Additional Notes 
360 checkpoints available. To download other checkpoints, change the branch from 'main' to the checkpoint you want.
Supported Languages 
English (NLP)
Training Details 
Data Sources:
Arxiv, Book, C4, Refined-Web, StarCoder, StackExchange, Wikipedia
Data Volume:
1259.13 Billion tokens
Methodology:
Same architecture as LLaMA
Context Length:
2048
Model Architecture:
LLaMA architecture
Input Output 
Accepted Modalities:
text
LLM NameAmber
Repository ๐Ÿค—https://huggingface.co/LLM360/Amber 
Model Size6.7b
Required VRAM13.5 GB
Updated2025-02-22
MaintainerLLM360
Model Typellama
Model Files  4.9 GB: 1-of-3   5.0 GB: 2-of-3   3.6 GB: 3-of-3
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length2048
Model Max Length2048
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typebfloat16

Quantized Models of the Amber

Model
Likes
Downloads
VRAM
Amber AWQ1803 GB
Amber GGUF1432 GB
Amber GPTQ1203 GB

Best Alternatives to Amber

Best Alternatives
Context / RAM
Downloads
Likes
Gptnoise37128K / 13.5 GB670
Phi 4 RRStock16K / 7.7 GB660
...s Coder6.7b Reflct Adamw Iter216K / 13.5 GB4800
...s Coder6.7b Reflct Adamw Iter116K / 13.5 GB4750
...ir8 Ds Coder6.7b Rmsprop Iter416K / 13.5 GB1830
...s Coder6.7b Reflct Adamw Iter316K / 13.5 GB4300
...ir8 Ds Coder6.7b Rmsprop Iter216K / 13.5 GB1830
Ds Coder6.7b Adamw Iter516K / 13.5 GB4050
...ir8 Ds Coder6.7b Rmsprop Iter316K / 13.5 GB1650
...s Coder6.7b Reflct Adamw Iter416K / 13.5 GB2730
Note: green Score (e.g. "73.2") means that the model is better than LLM360/Amber.

Rank the Amber Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227