mirror of
https://github.com/QuentinFuxa/WhisperLiveKit.git
synced 2026-03-07 22:33:36 +00:00
20 lines
982 B
Markdown
20 lines
982 B
Markdown
# Model Path Formats
|
|
|
|
The `--model-path` parameter accepts:
|
|
|
|
## File Path
|
|
- **`.pt` / `.bin` / `.safetensor` formats** Should be openable by pytorch/safetensor.
|
|
|
|
## Directory Path (recommended)
|
|
Must contain:
|
|
- **`.pt` / `.bin` / `.safetensor` file** (required for decoder)
|
|
|
|
May optionally contain:
|
|
- **`.bin` file** - faster-whisper model for encoder (requires faster-whisper)
|
|
- **`weights.npz`** or **`weights.safetensors`** - for encoder (requires whisper-mlx)
|
|
|
|
## Hugging Face Repo ID
|
|
- Provide the repo ID (e.g. `openai/whisper-large-v3`) and WhisperLiveKit will download and cache the snapshot automatically. For gated repos, authenticate via `huggingface-cli login` first.
|
|
|
|
To improve speed/reduce allucinations, you may want to use `scripts/determine_alignment_heads.py` to determine the alignment heads to use for your model, and use the `--custom-alignment-heads` to pass them to WLK. If not, alignement heads are set to be all the heads of the last half layer of decoder.
|