A4.1 by MrRobotoAI

 ยป  All LLMs  ยป  MrRobotoAI  ยป  A4.1   URL Share it on

  Arxiv:2203.05482   Autotrain compatible Base model:arushimgupta/lora s... Base model:aryanagrawal1/llama... Base model:athirdpath/bigmistr... Base model:automorphic/lora 20... Base model:azazelle/go-bruins-... Base model:azazelle/l3-daybrea... Base model:azazelle/llama-3-8b... Base model:azazelle/llama-3-lo... Base model:azazelle/llama-3-su... Base model:azazelle/llama3-8b-...   Base model:azazelle/nimue-8b Base model:azazelle/smarts lla... Base model:basileplus/llama3-8... Base model:blackroot/llama-3-l... Base model:blackroot/llama3-rp... Base model:chat-error/claude-k... Base model:dreadpoor/everythin... Base model:epiculous/mika-7b-l... Base model:erbacher/zephyr-con... Base model:erbacher/zephyr-rag... Base model:erbacher/zephyr-rag... Base model:grimjim/llama-3-ins... Base model:hannahbillo/dpo-lla... Base model:hf-100/llama-3.1-sp... Base model:ian00000/llama-3-8b... Base model:merge:arushimgupta/... Base model:merge:aryanagrawal1... Base model:merge:athirdpath/bi... Base model:merge:automorphic/l... Base model:merge:azazelle/go-b... Base model:merge:azazelle/l3-d... Base model:merge:azazelle/llam... Base model:merge:azazelle/llam... Base model:merge:azazelle/llam... Base model:merge:azazelle/llam... Base model:merge:azazelle/nimu... Base model:merge:azazelle/smar... Base model:merge:basileplus/ll... Base model:merge:blackroot/lla... Base model:merge:blackroot/lla... Base model:merge:chat-error/cl... Base model:merge:dreadpoor/eve... Base model:merge:epiculous/mik... Base model:merge:erbacher/zeph... Base model:merge:erbacher/zeph... Base model:merge:erbacher/zeph... Base model:merge:grimjim/llama... Base model:merge:hannahbillo/d... Base model:merge:hf-100/llama-... Base model:merge:ian00000/llam...   Base model:merge:mrrobotoai/a4   Base model:mrrobotoai/a4   Conversational   Endpoints compatible   Instruct   Llama   Lora   Merge   Mergekit   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/MrRobotoAI/A4.1 

A4.1 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
A4.1 (MrRobotoAI/A4.1)

A4.1 Parameters and Internals

LLM NameA4.1
Repository ๐Ÿค—https://huggingface.co/MrRobotoAI/A4.1 
Base Model(s)  A4   Azazelle/Llama-3-8B-Abomination-LORA   A4   Azazelle/Llama-3-LongStory-LORA   A4   Go Bruins V3 Lora   A4   Everything COT 8B R128 LoRA   A4   Azazelle/Llama-3-Sunfall-8b-lora   A4   DPO Llama3 8B Grammar Rules   A4   ...t Sft Rewriting Fs Promptbench   A4   Azazelle/Smarts_Llama3   A4   automorphic/LORA_20231221_042843_philosophy   A4   Llama 3 LongStory LORA   A4   Azazelle/llama3-8b-hikikomori-v0.4   A4   Zephyr Rag Agent   A4   Llama3 RP Lora   A4   hf-100/Llama-3.1-Spellbound-StoryWriter-0.1-lora   A4   BigMistral 11B GLUE LORA   A4   Nimue 8B   A4   L3 Daybreak 8B Lora   A4   Claude Kimiko   A4   Llama3 8B Schopenhauer   A4   Mika 7B LoRA   A4   arushimgupta/lora_save_7   A4   ... Instruct Abliteration LoRA 8B   A4   ...a 3 8B Offensive CoT Finetuned   A4   Zephyr Rag Agent Webgpt   A4   erbacher/zephyr-convsearch-7b-v2   MrRobotoAI/A4   Azazelle/Llama-3-8B-Abomination-LORA   MrRobotoAI/A4   Azazelle/Llama-3-LongStory-LORA   MrRobotoAI/A4   Azazelle/go-bruins-v3-lora   MrRobotoAI/A4   DreadPoor/Everything-COT-8B-r128-LoRA   MrRobotoAI/A4   Azazelle/Llama-3-Sunfall-8b-lora   MrRobotoAI/A4   hannahbillo/dpo-llama3-8b-grammar-rules   MrRobotoAI/A4   aryanagrawal1/llama-3-8b-instruct-sft-rewriting-fs-promptbench   MrRobotoAI/A4   Azazelle/Smarts_Llama3   MrRobotoAI/A4   automorphic/LORA_20231221_042843_philosophy   MrRobotoAI/A4   Blackroot/Llama-3-LongStory-LORA   MrRobotoAI/A4   Azazelle/llama3-8b-hikikomori-v0.4   MrRobotoAI/A4   erbacher/zephyr-rag-agent   MrRobotoAI/A4   Blackroot/Llama3-RP-Lora   MrRobotoAI/A4   hf-100/Llama-3.1-Spellbound-StoryWriter-0.1-lora   MrRobotoAI/A4   athirdpath/BigMistral-11b-GLUE_LORA   MrRobotoAI/A4   Azazelle/Nimue-8B   MrRobotoAI/A4   Azazelle/L3-Daybreak-8b-lora   MrRobotoAI/A4   Chat-Error/Claude-Kimiko   MrRobotoAI/A4   basilePlus/llama3-8b-schopenhauer   MrRobotoAI/A4   Epiculous/Mika-7B-LoRA   MrRobotoAI/A4   arushimgupta/lora_save_7   MrRobotoAI/A4   grimjim/Llama-3-Instruct-abliteration-LoRA-8B   MrRobotoAI/A4   ian00000/Llama-3-8B_offensive_CoT_finetuned   MrRobotoAI/A4   erbacher/zephyr-rag-agent-webgpt   MrRobotoAI/A4   erbacher/zephyr-convsearch-7b-v2
Model Size8b
Required VRAM16.1 GB
Updated2025-04-02
MaintainerMrRobotoAI
Model Typellama
Instruction-BasedYes
Model Files  5.0 GB: 1-of-4   5.0 GB: 2-of-4   4.9 GB: 3-of-4   1.2 GB: 4-of-4
Model ArchitectureLlamaForCausalLM
Context Length1048576
Model Max Length1048576
Transformers Version4.49.0
Tokenizer ClassPreTrainedTokenizer
Vocabulary Size128256
LoRA ModelYes
Torch Data Typefloat16

Best Alternatives to A4.1

Best Alternatives
Context / RAM
Downloads
Likes
...a 3 8B Instruct Gradient 1048K1024K / 16.1 GB16571680
A4.21024K / 16.1 GB1340
A3.21024K / 16.1 GB1330
A2.21024K / 16.1 GB1330
A1.21024K / 16.1 GB1330
A6.21024K / 16.1 GB1330
A5.21024K / 16.1 GB1330
A5.11024K / 16.1 GB1310
A3.11024K / 16.1 GB1310
A2.11024K / 16.1 GB1310
Note: green Score (e.g. "73.2") means that the model is better than MrRobotoAI/A4.1.

Rank the A4.1 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 45881 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227