Lo101 3B AnD by altomek

 ยป  All LLMs  ยป  altomek  ยป  Lo101 3B AnD   URL Share it on

  Autotrain compatible Base model:alpindale/llama-3.2... Base model:intervitensinc/llam... Base model:merge:alpindale/lla... Base model:merge:intervitensin...   Conversational   En   Endpoints compatible   Finetuned   Instruct   Llama   Llama-3   Merge   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/altomek/Lo101-3B-AnD 

Lo101 3B AnD Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Lo101 3B AnD (altomek/Lo101-3B-AnD)

Lo101 3B AnD Parameters and Internals

Model Type 
text generation
Additional Notes 
Not as expressive like Llama Instruct. Writes simpler responses in chat. Somewhat broken due to tokenizer issues.
Supported Languages 
language (en), proficiency ()
Training Details 
Data Sources:
jeiku
Methodology:
first RP directed finetune and merge
LLM NameLo101 3B AnD
Repository ๐Ÿค—https://huggingface.co/altomek/Lo101-3B-AnD 
Base Model(s)  IntervitensInc/Llama-3.2-3B-chatml   alpindale/Llama-3.2-3B-Instruct   IntervitensInc/Llama-3.2-3B-chatml   alpindale/Llama-3.2-3B-Instruct
Model Size3b
Required VRAM6.5 GB
Updated2024-12-26
Maintaineraltomek
Model Typellama
Instruction-BasedYes
Model Files  2.0 GB: 1-of-4   2.0 GB: 2-of-4   2.0 GB: 3-of-4   0.5 GB: 4-of-4
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licensellama3.2
Context Length16386
Model Max Length16386
Transformers Version4.45.2
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|finetune_right_pad_id|>
Vocabulary Size128256
Torch Data Typefloat16

Best Alternatives to Lo101 3B AnD

Best Alternatives
Context / RAM
Downloads
Likes
Llama 3.2 3B Instruct128K / 6.5 GB2146659821
Codepy Deepthink 3B128K / 6.5 GB07
Llama 3.2 3B Instruct128K / 6.4 GB6021838
Llama 3.2 3B Instruct128K / 6.5 GB894692
...lama 3.2 Rabbit Ko 3B Instruct128K / 6.5 GB9618
FinMatcha 3B Instruct128K / 6.5 GB3260
Llama Chat Summary 3.2 3B128K / 6.5 GB1447
SummLlama3.2 3B128K / 6.5 GB448635
Llama Song Stream 3B Instruct128K / 6.5 GB5710
...3.2 Rabbit Ko 3B Instruct 2412128K / 6.5 GB290
Note: green Score (e.g. "73.2") means that the model is better than altomek/Lo101-3B-AnD.

Rank the Lo101 3B AnD Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40303 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227