mirror of
https://github.com/QuentinFuxa/WhisperLiveKit.git
synced 2026-03-07 14:23:18 +00:00
982 B
982 B
Model Path Formats
The --model-path parameter accepts:
File Path
.pt/.bin/.safetensorformats Should be openable by pytorch/safetensor.
Directory Path (recommended)
Must contain:
.pt/.bin/.safetensorfile (required for decoder)
May optionally contain:
.binfile - faster-whisper model for encoder (requires faster-whisper)weights.npzorweights.safetensors- for encoder (requires whisper-mlx)
Hugging Face Repo ID
- Provide the repo ID (e.g.
openai/whisper-large-v3) and WhisperLiveKit will download and cache the snapshot automatically. For gated repos, authenticate viahuggingface-cli loginfirst.
To improve speed/reduce allucinations, you may want to use scripts/determine_alignment_heads.py to determine the alignment heads to use for your model, and use the --custom-alignment-heads to pass them to WLK. If not, alignement heads are set to be all the heads of the last half layer of decoder.