Lo101 3B AnD by altomek

 ยป  All LLMs  ยป  altomek  ยป  Lo101 3B AnD   URL Share it on

  Autotrain compatible Base model:alpindale/llama-3.2... Base model:intervitensinc/llam... Base model:merge:alpindale/lla... Base model:merge:intervitensin...   Conversational   En   Endpoints compatible   Finetuned   Instruct   Llama   Llama-3   Merge   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/altomek/Lo101-3B-AnD 

Lo101 3B AnD Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Lo101 3B AnD (altomek/Lo101-3B-AnD)

Lo101 3B AnD Parameters and Internals

Model Type 
text generation
Additional Notes 
Not as expressive like Llama Instruct. Writes simpler responses in chat. Somewhat broken due to tokenizer issues.
Supported Languages 
language (en), proficiency ()
Training Details 
Data Sources:
jeiku
Methodology:
first RP directed finetune and merge
LLM NameLo101 3B AnD
Repository ๐Ÿค—https://huggingface.co/altomek/Lo101-3B-AnD 
Base Model(s)  IntervitensInc/Llama-3.2-3B-chatml   alpindale/Llama-3.2-3B-Instruct   IntervitensInc/Llama-3.2-3B-chatml   alpindale/Llama-3.2-3B-Instruct
Model Size3b
Required VRAM6.5 GB
Updated2024-12-06
Maintaineraltomek
Model Typellama
Instruction-BasedYes
Model Files  2.0 GB: 1-of-4   2.0 GB: 2-of-4   2.0 GB: 3-of-4   0.5 GB: 4-of-4
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licensellama3.2
Context Length16386
Model Max Length16386
Transformers Version4.45.2
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|finetune_right_pad_id|>
Vocabulary Size128256
Torch Data Typefloat16

Best Alternatives to Lo101 3B AnD

Best Alternatives
Context / RAM
Downloads
Likes
Llama 3.2 3B Instruct128K / 6.5 GB1684000722
Llama 3.2 3B Instruct128K / 6.4 GB6904332
SummLlama3.2 3B128K / 6.5 GB309231
Llama Doctor 3.2 3B Instruct128K / 6.5 GB3127
Llama 3.2 3B Instruct Frog128K / 6.5 GB15867
Mergekit Ties Xflmond128K / 7.2 GB900
Mergekit Ties Qgcitfu128K / 7.2 GB630
...lama 3.2 Rabbit Ko 3B Instruct128K / 6.5 GB12768
Mergekit Ties Poovzrh128K / 7.2 GB810
Mergekit Ties Pghuyfi128K / 7.2 GB790
Note: green Score (e.g. "73.2") means that the model is better than altomek/Lo101-3B-AnD.

Rank the Lo101 3B AnD Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 38920 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124