WhisperLiveKit/benchmark_chart.png at main

mirror of https://github.com/QuentinFuxa/WhisperLiveKit.git synced 2026-03-06 22:04:06 +00:00

Files

Quentin Fuxa c76b2ef2c6 docs: rewrite benchmark with base/small comparison, proper French results

- Re-ran all whisper benchmarks with --lan fr for the French file
  (previously ran with --lan en which made the results meaningless)
- Added small model results alongside base for all backends
- Added model size comparison table (base vs small tradeoffs)
- Added benchmark chart (30s English, WER + RTF by backend)
- Added caveats section about dataset size and RTF variance
- Key findings: SimulStreaming saturates at 5.3% WER on base already,
  small model mainly helps LocalAgreement and French timestamps
- mlx-whisper LA base is unstable on French (hallucination loops)

2026-02-23 10:16:34 +01:00

69 KiB

2085x770px

Raw Permalink History

/LLM/WhisperLiveKit/raw/branch/main/benchmark_chart.png

69 KiB 2085x770px Raw Permalink History

69 KiB

2085x770px

Raw Permalink History