DiarizationLM 13B Fisher V1 by google

 ยป  All LLMs  ยป  google  ยป  DiarizationLM 13B Fisher V1   URL Share it on

  Arxiv:2401.03506   4bit   Autotrain compatible   Endpoints compatible   Gguf   Llama   Q4   Quantized   Region:us   Safetensors   Sharded   Tensorflow

DiarizationLM 13B Fisher V1 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

DiarizationLM 13B Fisher V1 Parameters and Internals

Model Type 
Speaker Diarization, Causal Language Model
Use Cases 
Areas:
Research applications
Additional Notes 
This model is outdated. Please use google/DiarizationLM-8b-Fisher-v2 instead.
Training Details 
Data Sources:
Fisher corpus
Data Volume:
48,142 prompt-completion pairs
Methodology:
Finetuning a foundation model using LoRA adapter
Context Length:
4096
Training Time:
More than 3 days
Hardware Used:
Google Cloud VM instance with one NVIDIA A100 GPU (80GB)
LLM NameDiarizationLM 13B Fisher V1
Repository ๐Ÿค—https://huggingface.co/google/DiarizationLM-13b-Fisher-v1 
Model Size13b
Required VRAM26 GB
Updated2024-11-16
Maintainergoogle
Model Typellama
Model Files  5.0 GB: 1-of-6   5.0 GB: 2-of-6   5.0 GB: 3-of-6   4.9 GB: 4-of-6   4.9 GB: 5-of-6   1.2 GB: 6-of-6   7.9 GB   13.8 GB
GGUF QuantizationYes
Quantization Typegguf|q4|4bit|q4_k
Model ArchitectureLlamaForCausalLM
Licensellama2
Context Length4096
Model Max Length4096
Transformers Version4.41.2
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size32000
Torch Data Typebfloat16
DiarizationLM 13B Fisher V1 (google/DiarizationLM-13b-Fisher-v1)

Best Alternatives to DiarizationLM 13B Fisher V1

Best Alternatives
Context / RAM
Downloads
Likes
Llm Compiler 13B Ftd GGUF16K / 4.8 GB2600
Llm Compiler 13B GGUF16K / 4.8 GB870
Llm Compiler 13B Ftd GGUF16K / 4.8 GB710
Llm Compiler 13B GGUF16K / 4.8 GB270
CodeLlama 13B Instruct GGUF16K / 5.4 GB1812
Mythomax L2 13B Q4 K M GGUF4K / 8.1 GB111731
Luminia 13B V34K / 26 GB1015
HyperLlama2Test4K / 26 GB80
...V2 13B L2 BetaTest Q4 K M GGUF4K / 7.9 GB160
AppleSauce L2 13B4K / 26.7 GB14311
Note: green Score (e.g. "73.2") means that the model is better than google/DiarizationLM-13b-Fisher-v1.

Rank the DiarizationLM 13B Fisher V1 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 38020 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241110