fixes https://github.com/QuentinFuxa/WhisperLiveKit/issues/269

2026-03-07 22:33:36 +00:00 · 2025-11-09 20:08:18 +01:00
parent a732e0903e
commit 7108d2ddc5
3 changed files with 38 additions and 22 deletions
--- a/docs/models_compatible_formats.md
+++ b/docs/models_compatible_formats.md
@@ -3,12 +3,14 @@
 The `--model-path` parameter accepts:

 ## File Path
- **`.pt` format only** (required for AlignAtt policy decoder)
+- **`.pt` / `.bin` / `.safetensor` formats** Should be openable by pytorch/safetensor.

 ## Directory Path (recommended)
 Must contain:
- **`.pt` file** (required for decoder)
+- **`.pt` / `.bin` / `.safetensor` file** (required for decoder)

 May optionally contain:
 - **`.bin` file** - faster-whisper model for encoder (requires faster-whisper)
- **`weights.npz`** or **`weights.safetensors`** - for encoder (requires whisper-mlx)
+- **`weights.npz`** or **`weights.safetensors`** - for encoder (requires whisper-mlx)
+
+To improve speed/reduce allucinations, you may want to use `scripts/determine_alignment_heads.py` to determine the alignment heads to use for your model, and use the `--custom-alignment-heads` to pass them to WLK. If not, alignement heads are set to be all the heads of the last half layer of decoder.