T Llama by capleaf

 ยป  All LLMs  ยป  capleaf  ยป  T Llama   URL Share it on

  Autotrain compatible   En   Endpoints compatible   Llama   Model-index   Pytorch   Region:us   Safetensors   Sharded   Tensorflow   Vi
Model Card on HF ๐Ÿค—: https://huggingface.co/capleaf/T-Llama 

T Llama Benchmarks

T Llama (capleaf/T-Llama)

T Llama Parameters and Internals

Model Type 
Llama2-7B Decoder-only
Additional Notes 
The model is a proof of effort that a single person can achieve state-of-the-art results by fine-tuning their own model.
Supported Languages 
English (full), Vietnamese (full)
Training Details 
Data Sources:
BactrianX, OpenOrca_translated, WizardLM_70k_translated, TigerLabMathInstruct_translated_vi, GradeSchoolMathInstruct_translated, vilm_lima-vi, MTEngVietnamese, databricks_dolly15k_translated, AlpacaCleaned_translated, databricks_dolly15k, OpenOrca, GradeSchoolMathInstruct, AlpacaCleaned, WebglmQA
Data Volume:
120GB (Vietnamese data)
Methodology:
Fine-tuning with bilingual support
Training Time:
~47.5 days Approx*
Hardware Used:
GPU: VGA NVIDIA Tesla P100 16GB, SYSTEM RAM: 32GB
Model Architecture:
Decoder-only architecture based on Llama2-7b
LLM NameT Llama
Repository ๐Ÿค—https://huggingface.co/capleaf/T-Llama 
Model Size6.8b
Required VRAM13.8 GB
Updated2025-02-22
Maintainercapleaf
Model Typellama
Model Files  0.9 GB: 1-of-16   0.8 GB: 2-of-16   0.9 GB: 3-of-16   0.9 GB: 4-of-16   0.9 GB: 5-of-16   0.9 GB: 6-of-16   0.9 GB: 7-of-16   0.9 GB: 8-of-16   0.9 GB: 9-of-16   0.9 GB: 10-of-16   0.9 GB: 11-of-16   0.9 GB: 12-of-16   0.9 GB: 13-of-16   0.9 GB: 14-of-16   0.9 GB: 15-of-16   0.4 GB: 16-of-16   0.9 GB: 1-of-16   0.8 GB: 2-of-16   0.9 GB: 3-of-16   0.9 GB: 4-of-16   0.9 GB: 5-of-16   0.9 GB: 6-of-16   0.9 GB: 7-of-16   0.9 GB: 8-of-16   0.9 GB: 9-of-16   0.9 GB: 10-of-16   0.9 GB: 11-of-16   0.9 GB: 12-of-16   0.9 GB: 13-of-16   0.9 GB: 14-of-16   0.9 GB: 15-of-16   0.4 GB: 16-of-16
Supported Languagesvi en
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length2048
Model Max Length2048
Transformers Version4.33.1
Tokenizer ClassLlamaTokenizer
Padding Token[PAD]
Vocabulary Size45452
Torch Data Typebfloat16

Best Alternatives to T Llama

Best Alternatives
Context / RAM
Downloads
Likes
PersianMind V1.02K / 13.7 GB75254
Note: green Score (e.g. "73.2") means that the model is better than capleaf/T-Llama.

Rank the T Llama Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227