Ao Karasu 72B AWQ 4bit By lightblue: Benchmarks, Features and Detailed Analysis. Insights on Ao Karasu 72B AWQ 4bit.

Merged Model 4-bit 4bit Autotrain compatible Awq Conversational Endpoints compatible Quantized Qwen2 Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/lightblue/ao-karasu-72B-AWQ-4bit

Ao Karasu 72B AWQ 4bit Benchmarks

LLME Score: 0.17093

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Ao Karasu 72B AWQ 4bit (lightblue/ao-karasu-72B-AWQ-4bit)

Ao Karasu 72B AWQ 4bit Parameters and Internals

Model Type

text generation, chatbot

Use Cases

Areas:

AI assistance

Applications:

chat-based applications

Primary Use Cases:

Conversation with users in various domains

Additional Notes

The model was trained with a focus on Japanese language proficiency.

Supported Languages

Japanese (proficient)

Training Details

Data Sources:

Wikipedia-based QA, technical blogs, Japanese QA site answers, LLM generated prompts and responses, news articles

Data Volume:

1.1 billion characters

Training Time:

~1 day

Hardware Used:

A100 (80GB) GPU

Input Output

Input Format:

Prompt format using chat template.

Accepted Modalities:

text

Output Format:

Text

LLM Name	Ao Karasu 72B AWQ 4bit
Repository 🤗	https://huggingface.co/lightblue/ao-karasu-72B-AWQ-4bit
Merged Model	Yes
Model Size	72b
Required VRAM	41.3 GB
Updated	2025-02-22
Maintainer	lightblue
Model Type	qwen2
Model Files	5.0 GB: 1-of-9 5.0 GB: 2-of-9 5.0 GB: 3-of-9 5.0 GB: 4-of-9 5.0 GB: 5-of-9 5.0 GB: 6-of-9 5.0 GB: 7-of-9 3.8 GB: 8-of-9 2.5 GB: 9-of-9
AWQ Quantization	Yes
Quantization Type	awq\|4bit
Model Architecture	Qwen2ForCausalLM
Context Length	32768
Model Max Length	32768
Transformers Version	4.39.0.dev0
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	152064
Torch Data Type	float16
Errors	replace

Best Alternatives to Ao Karasu 72B AWQ 4bit

Best Alternatives	Context / RAM	Downloads	Likes
Tess V2.5.2 Qwen2 72B AWQ	128K / 41.6 GB	14	0
Qwen2.5 72B Instruct AWQ	32K / 41.6 GB	740721	57
Qwen2 72B Instruct AWQ	32K / 41.6 GB	82951	38
Athene V2 Chat AWQ	32K / 41.6 GB	73	4
Qwen2 72B Instruct AWQ GEMM	32K / 41.6 GB	8	0
Qwen1.5 72B Chat AWQ	32K / 41.3 GB	543	23
Qwen1.5 72B AWQ	32K / 41.3 GB	11	1
Typhoon V1.5 72B Instruct AWQ	32K / 41.3 GB	19	3
Liberated Qwen1.5 72B AWQ	32K / 41.3 GB	19	1
Qwen2 72B Bnb 4bit	128K / 41.2 GB	173	4

Note: green Score (e.g. "73.2") means that the model is better than lightblue/ao-karasu-72B-AWQ-4bit.

Rank the Ao Karasu 72B AWQ 4bit Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Ao Karasu 72B AWQ 4bit by lightblue

» All LLMs » lightblue » Ao Karasu 72B AWQ 4bit URL Share it on

Ao Karasu 72B AWQ 4bit Benchmarks

Ao Karasu 72B AWQ 4bit Parameters and Internals

Best Alternatives to Ao Karasu 72B AWQ 4bit

Rank the Ao Karasu 72B AWQ 4bit Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.