Gemma ChemWiz 16bit By dbands: Benchmarks, Features and Detailed Analysis. Insights on Gemma ChemWiz 16bit.

16bit Dataset:ai-mo/numinamath-cot Dataset:ai4chem/chemdata700k Dataset:andersonbcdefg/chemist... Dataset:medalpaca/medical mead... Gemma Quantized Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/dbands/Gemma_ChemWiz_16bit

Gemma ChemWiz 16bit Benchmarks

LLME Score: 0.20866

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Gemma ChemWiz 16bit (dbands/Gemma_ChemWiz_16bit)

Gemma ChemWiz 16bit Parameters and Internals

Model Type

text-generation-inference, transformers

Additional Notes

This gemma model was trained 2x faster with Unsloth and Huggingface's TRL library.

Supported Languages

en (high)

Training Details

Data Sources:

AI-MO/NuminaMath-CoT, AI4Chem/ChemData700K, medalpaca/medical_meadow_mediqa, andersonbcdefg/chemistry

Methodology:

Fine-tuned on chemical memory and logical reasoning using datasets.

Release Notes

Version:

2024-08-15

Date:

2024-08-15

Notes:

This is now the base model. The model with python RDKit training is created as Gemma_ChemWiz_rdkit_16bit.

Version:

2024-08-15

Date:

2024-08-15

Notes:

Splitting model today. This model will be the base ChemWiz Model.

Version:

2024-08-13

Date:

2024-08-13

Notes:

Second round of AI4Chem/ChemData700K, results of chemical smiles are very low.

Version:

2024-08-12

Date:

2024-08-12

Notes:

medalpaca/medical_meadow_mediqa dataset was used, model converged in less than one epoch.

Version:

2024-08-12

Date:

2024-08-12

Notes:

Fine-tuning on chemical memory rather than chemistry reasoning.

Version:

2024-08-09

Date:

2024-08-09

Notes:

Fine-tuned for logical reasoning, model still experimental.

LLM Name	Gemma ChemWiz 16bit
Repository 🤗	https://huggingface.co/dbands/Gemma_ChemWiz_16bit
Model Size	8.5b
Required VRAM	17.1 GB
Updated	2025-02-05
Maintainer	dbands
Model Type	gemma
Model Files	5.0 GB: 1-of-4 5.0 GB: 2-of-4 5.0 GB: 3-of-4 2.1 GB: 4-of-4
Quantization Type	16bit
Model Architecture	GemmaForCausalLM
Context Length	8192
Model Max Length	8192
Transformers Version	4.44.0
Tokenizer Class	GemmaTokenizer
Padding Token	<pad>
Vocabulary Size	256000
Torch Data Type	bfloat16

Quantized Models of the Gemma ChemWiz 16bit

Model	Likes	Downloads	VRAM
Gemma ChemWiz Rdkit 16bit	0	6	17 GB

Best Alternatives to Gemma ChemWiz 16bit

Best Alternatives	Context / RAM	Downloads	Likes
Gemma ChemWiz Rdkit 16bit	8K / 17.1 GB	6	0
Openchat 3.5 0106 Gemma	8K / 17.1 GB	7912	57
FiLLM3 NER	8K / 17.1 GB	24	0
FiLLM Experimental	8K / 17.1 GB	20	0
... Health Counseling V0.1 Merged	8K / 9.4 GB	7	0

Note: green Score (e.g. "73.2") means that the model is better than dbands/Gemma_ChemWiz_16bit.

Rank the Gemma ChemWiz 16bit Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 42577 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Gemma ChemWiz 16bit by dbands

» All LLMs » dbands » Gemma ChemWiz 16bit URL Share it on

Gemma ChemWiz 16bit Benchmarks

Gemma ChemWiz 16bit Parameters and Internals

Quantized Models of the Gemma ChemWiz 16bit

Best Alternatives to Gemma ChemWiz 16bit

Rank the Gemma ChemWiz 16bit Capabilities

What open-source LLMs or SLMs are you in search of? 42577 in total.