Granite 7B Base by ibm-granite

 Β»  All LLMs  Β»  ibm-granite  Β»  Granite 7B Base   URL Share it on

  Autotrain compatible   Endpoints compatible   Llama   Region:us   Safetensors   Sharded   Tensorflow

Granite 7B Base Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Granite 7B Base (ibm-granite/granite-7b-base)

Granite 7B Base Parameters and Internals

Model Type 
Pre-trained LLM
Additional Notes 
In a commitment to data transparency and fostering open innovation, the data sources, sampling proportions, and URLs for access are provided.
Supported Languages 
English (primary)
Training Details 
Data Sources:
Common Crawl, Github_Clean, Wikipedia and Wikimedia, USPTO, PubMed Central, arXiv, StackExchange, PG19, Webhose
Data Volume:
2T tokens
Context Length:
4000
Model Architecture:
The model architecture is a replica of Meta’s Llama2-7B base variant with MHA, trained with 1M batch size on 2T tokens.
Responsible Ai Considerations 
Mitigation Strategies:
In the absence of adequate safeguards and RLHF, there exists a risk of malicious utilization of these models for generating disinformation or harmful content.
LLM NameGranite 7B Base
Repository πŸ€—https://huggingface.co/ibm-granite/granite-7b-base 
Model Size7b
Required VRAM27.1 GB
Updated2025-01-15
Maintaineribm-granite
Model Typellama
Model Files  4.8 GB: 1-of-6   4.9 GB: 2-of-6   4.9 GB: 3-of-6   4.9 GB: 4-of-6   4.9 GB: 5-of-6   2.7 GB: 6-of-6
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length4096
Model Max Length4096
Transformers Version4.36.2
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typefloat32

Best Alternatives to Granite 7B Base

Best Alternatives
Context / RAM
Downloads
Likes
...1M 1000000ctx AEZAKMI 3 1 17021024K / 13.5 GB671
... Qwen2.5llamaify 7B V23.1 200K195K / 15.2 GB30821
LlamaStock 8B128K / 16.1 GB211
SuperNeuralDreadDevil 8B128K / 16.1 GB181
Yarn Llama 2 7B 128K128K / 13.5 GB356239
LLaMA 7B PoSE YaRN 128K128K / 13.5 GB83
LLaMA 7B PoSE Linear 96K96K / 27 GB92
LLaMA 7B PoSE YaRN 96K96K / 13.5 GB111
Chat Llama2 7B 80K80K / 13.8 GB90
Llama2 7B 80K80K / 13.8 GB110
Note: green Score (e.g. "73.2") means that the model is better than ibm-granite/granite-7b-base.

Rank the Granite 7B Base Capabilities

πŸ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 41363 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227