Lo101 3B AnD by altomek

 ยป  All LLMs  ยป  altomek  ยป  Lo101 3B AnD   URL Share it on

  Autotrain compatible Base model:alpindale/llama-3.2... Base model:intervitensinc/llam... Base model:merge:alpindale/lla... Base model:merge:intervitensin...   Conversational   En   Endpoints compatible   Finetuned   Instruct   Llama   Llama-3   Merge   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/altomek/Lo101-3B-AnD 

Lo101 3B AnD Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Lo101 3B AnD (altomek/Lo101-3B-AnD)

Lo101 3B AnD Parameters and Internals

Model Type 
text generation
Additional Notes 
Not as expressive like Llama Instruct. Writes simpler responses in chat. Somewhat broken due to tokenizer issues.
Supported Languages 
language (en), proficiency ()
Training Details 
Data Sources:
jeiku
Methodology:
first RP directed finetune and merge
LLM NameLo101 3B AnD
Repository ๐Ÿค—https://huggingface.co/altomek/Lo101-3B-AnD 
Base Model(s)  IntervitensInc/Llama-3.2-3B-chatml   alpindale/Llama-3.2-3B-Instruct   IntervitensInc/Llama-3.2-3B-chatml   alpindale/Llama-3.2-3B-Instruct
Model Size3b
Required VRAM6.5 GB
Updated2025-05-15
Maintaineraltomek
Model Typellama
Instruction-BasedYes
Model Files  2.0 GB: 1-of-4   2.0 GB: 2-of-4   2.0 GB: 3-of-4   0.5 GB: 4-of-4
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licensellama3.2
Context Length16386
Model Max Length16386
Transformers Version4.45.2
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|finetune_right_pad_id|>
Vocabulary Size128256
Torch Data Typefloat16

Best Alternatives to Lo101 3B AnD

Best Alternatives
Context / RAM
Downloads
Likes
Llama 3.2 3B Instruct128K / 6.5 GB19156581441
DeepSeek R1 Distill Llama 3B128K / 6.5 GB32011
Llama 3.2 3B RP Toxic Fuse128K / 6.4 GB132
Orpheus 3B 0.1 Pretrained128K / 6.6 GB68330
Zeitgeist 3B V1128K / 6.5 GB1055
ReasoningCore 3B T1 1128K / 6.5 GB611
Llama 3.2 3B Instruct128K / 6.5 GB12098362
... 3.2 3B Math Instruct RE1 ORPO128K / 6.5 GB480
Llama 3.2 3B ToxicKod128K / 6.4 GB82
FuseChat Llama 3.2 3B Instruct128K / 6.5 GB987
Note: green Score (e.g. "73.2") means that the model is better than altomek/Lo101-3B-AnD.

Rank the Lo101 3B AnD Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 47368 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227