WhisperLiveKit/__init__.py at a4da246ea5f8b7960c86c309e26ceaaab05b9d56 - WhisperLiveKit - Gitea: Git with a cup of tea

LLM/WhisperLiveKit

mirror of https://github.com/QuentinFuxa/WhisperLiveKit.git synced 2026-03-07 14:23:18 +00:00

Files

Quentin Fuxa a4da246ea5 feat: add voxtral-mlx native backend for Apple Silicon

Pure-MLX implementation of Voxtral Mini 4B Realtime for low-latency
speech transcription on Apple Silicon. Avoids the transformers/torch
overhead and runs at 0.18-0.32x real-time factor.

- voxtral_mlx/model.py: MLX model with spectrogram, encoder, decoder
- voxtral_mlx/loader.py: model loading with 6-bit quantized weights
- voxtral_mlx/spectrogram.py: mel spectrogram computation in MLX
- voxtral_mlx_asr.py: VoxtralASR adapter for the AudioProcessor pipeline

2026-02-22 23:28:10 +01:00

7 lines

188 B

Python

Raw Blame History

 """Pure-MLX Voxtral Realtime backend for WhisperLiveKit."""
 from .loader import load_voxtral_model
 from .model import VoxtralMLXModel
 __all__ = ["load_voxtral_model", "VoxtralMLXModel"]