Commit Graph

  • 8bc0937c46 Update README section on powered research main Quentin Fuxa 2026-03-06 18:46:07 +01:00
  • 929cf7a26b add link to AlignAtt interactive playground Quentin Fuxa 2026-03-06 18:43:25 +01:00
  • abfaf06203 Merge branch 'main' of https://github.com/QuentinFuxa/WhisperLiveKit Quentin Fuxa 2026-03-04 18:17:23 +01:00
  • d1fe932241 Apply DRY method v0 - to try to catch and resolve infinite loops such as in #338 Quentin Fuxa 2026-03-03 22:52:00 +01:00
  • c112ceffb6 Merge pull request #342 from mnicnc404/fix/whisper-tokenizer-index-error Quentin Fuxa 2026-03-02 20:36:58 +01:00
  • 4917406e06 Merge pull request #341 from AymurAI/feat/uv-deps-resolution Quentin Fuxa 2026-03-02 20:34:49 +01:00
  • b63f54e838 fix(whisper/tokenizer): prevent IndexError from crashing multilingual streams Chingning Chen 2026-03-02 15:31:43 +08:00
  • c56a53fbf4 deps(mlx-groups): add optional dependencies for Apple Silicon MLX backends jedzill4 2026-03-01 20:05:52 -03:00
  • 66e58624b9 disable MLXAlignAtt which fails on special characters Quentin Fuxa 2026-03-01 11:52:00 +01:00
  • 9366e067f9 deps(pyproject): add torch and torchaudio to main dependencies jedzill4 2026-02-27 19:19:18 -03:00
  • 866c25670c deps(docker): change CUDA base image to runtime version jedzill4 2026-02-27 19:16:29 -03:00
  • 2553ef283e deps(docker): fix dependency group for cu129 image jedzill4 2026-02-25 21:49:08 -03:00
  • 73e7fafc48 feat(tests): python matrix support test jedzill4 2026-02-25 21:35:41 -03:00
  • bbcebcb1fe deps(sortformer): adjust nemo-toolkit version constraints jedzill4 2026-02-25 21:33:00 -03:00
  • 4bb58dc7aa deps(diart): improve diart dependency tree. rename gpu-cu129 dependency group to cu129 jedzill4 2026-02-25 20:27:26 -03:00
  • 27ca028479 ci(github): add GitHub Actions workflows for Docker image publishing and support matrix jedzill4 2026-02-25 14:27:51 -03:00
  • d24805cc18 🚀 chore (docker): update docker images improving caching and using uv as python package manager jedzill4 2026-02-25 14:22:43 -03:00
  • 994ce21365 📌 chore(deps): pin dependences to python 3.11 to 3.13 due dependency resolution matrix jedzill4 2026-02-25 14:21:19 -03:00
  • 132823dc09 deps: improve deps dependency resolution (wip) jedzill4 2026-02-24 20:15:53 -03:00
  • d6d8c2635f chore: use uv as python project manager to improve dependency resolution jedzill4 2026-02-23 22:16:32 -03:00
  • 8fedeb9fed Merge pull request #340 from QuentinFuxa/voxtral_tests v0.2.19 Quentin Fuxa 2026-02-23 10:37:40 +01:00
  • b1fc23807a docs: add benchmark collaboration call, voxtral in powered-by section voxtral_tests Quentin Fuxa 2026-02-23 10:37:22 +01:00
  • 10c4e5f730 docs: add speed vs accuracy scatter plot to benchmark and README Quentin Fuxa 2026-02-23 10:27:53 +01:00
  • c76b2ef2c6 docs: rewrite benchmark with base/small comparison, proper French results Quentin Fuxa 2026-02-23 10:16:34 +01:00
  • 4b2377c243 fix: correct false auto-detect claim, median bug, RTF inflation Quentin Fuxa 2026-02-22 23:38:04 +01:00
  • a4da246ea5 feat: add voxtral-mlx native backend for Apple Silicon Quentin Fuxa 2026-02-22 23:28:10 +01:00
  • 9b2c3ee844 docs: update README with voxtral backend, benchmarks, testing sections Quentin Fuxa 2026-02-22 23:27:57 +01:00
  • 83d0fa3fac feat: benchmark suite with WER, timestamp accuracy, cross-backend comparison Quentin Fuxa 2026-02-22 23:27:50 +01:00
  • 5a12c627b4 feat: add 99-test unit test suite with zero model dependencies Quentin Fuxa 2026-02-22 23:27:40 +01:00
  • f5eee67b11 fix: silence double-counting bug, add metrics module and runtime instrumentation Quentin Fuxa 2026-02-22 23:27:12 +01:00
  • 4a6868e3e1 correct processor attributes mixtral Quentin Fuxa 2026-02-22 21:13:21 +01:00
  • 3c15246fc0 mixstral hf v0 Quentin Fuxa 2026-02-20 20:46:37 +01:00
  • d337248fda feat: add healthcheck to Dockerfiles (#228) Quentin Fuxa 2026-02-19 22:18:00 +01:00
  • b8d9d7d289 fix: handle numpy object_ dtype from ctranslate2 encoder (#337) Quentin Fuxa 2026-02-19 22:18:00 +01:00
  • 4c7706e2cf fix: use vac_chunk_size for audio processing interval when VAC is enabled (#334) Quentin Fuxa 2026-02-19 22:18:00 +01:00
  • 7ea507ed8e Add Voxtral MLX streaming backend feature/voxtral Quentin Fuxa 2026-02-17 09:20:28 +01:00
  • d9a4c8dcb2 Refactor transcription and diarization handling with token-by-token validation. Introduce segment buffers for ephemeral content and update API to return structured segment data. Enhance silence handling and improve web interface for text transcripts. api_live Quentin Fuxa 2025-11-30 16:39:27 +01:00
  • 4fb735a784 new token treatment only iar Quentin Fuxa 2025-11-30 15:16:36 +01:00
  • d2f998cb7e val Quentin Fuxa 2025-11-30 14:37:37 +01:00
  • 7b18917f2b LoRA archi Quentin Fuxa 2025-11-30 12:30:18 +01:00
  • 9d4ae33249 WIP. Trying ten VAD #280 VAD-evolutions Quentin Fuxa 2025-11-23 11:20:00 +01:00
  • a246ba9bfe v0 new_api Quentin Fuxa 2025-11-09 22:02:15 +01:00
  • aa44a92a67 add embedded web interface HTML (single-file version with inline CSS/JS/SVG) translation-improvements Quentin Fuxa 2025-08-29 21:58:51 +02:00
  • 01d791470b add test files Quentin Fuxa 2025-08-29 17:45:32 +02:00
  • e704b0b0db Refactor imports and update __all__ to include internal argument parsing functions windows_audio_loopback Quentin Fuxa 2025-05-05 09:38:46 +02:00
  • 2dd974ade0 Add support for PyAudioWPatch audio input on Windows Quentin Fuxa 2025-05-05 09:30:18 +02:00
  • 7f3a3df620 simulstreaming mlx & torch dedup of common base Quentin Fuxa 2025-02-15 23:52:00 +01:00
  • e7e82f7c19 bump to 0.2.18 0.2.18 Quentin Fuxa 2026-02-11 22:10:00 +01:00
  • 8c799fa4d1 fix simulstreaming vram leak: cap cross-attn accumulation + token budget Quentin Fuxa 2026-02-11 22:10:00 +01:00
  • 8923337380 fix --direct-english-translation not setting task=translate for localagreement backends Quentin Fuxa 2026-02-11 22:10:00 +01:00
  • aded1649ae fix model_cache_dir + direct_english_translation task in simulstreaming Quentin Fuxa 2026-02-11 22:10:00 +01:00
  • 3b535e857a fix NoneType concatenation in add_translation Quentin Fuxa 2026-02-11 22:10:00 +01:00
  • d649250b9a fix Segment classmethod call + isinstance type narrowing Quentin Fuxa 2026-02-11 22:10:00 +01:00
  • 7735478286 add insert_audio_chunk to DiartDiarization Quentin Fuxa 2026-02-11 22:10:00 +01:00
  • b9e72d2b9a add probability field to ASRToken Quentin Fuxa 2026-02-11 22:10:00 +01:00
  • e5b01033af add json normalizers for english language in build Quentin Fuxa 2026-01-16 10:47:46 +01:00
  • 6ae545bcb1 bump to 0.2.17.post1 0.2.17.post1 Quentin Fuxa 2026-01-16 10:43:52 +01:00
  • 04980d3f5e Merge branch 'main' of https://github.com/QuentinFuxa/WhisperLiveKit Quentin Fuxa 2026-01-16 10:38:29 +01:00
  • 79a705c969 fixes #323 Quentin Fuxa 2026-01-16 10:38:07 +01:00
  • 34e4abd455 Merge pull request #322 from eschmidbauer/fix/thread-safety-issues Quentin Fuxa 2026-01-09 19:23:35 +01:00
  • d59ddbaeae Fix critical thread safety issues Emmanuel Schmidbauer 2026-01-09 11:23:19 -05:00
  • 4dd66e7766 Merge pull request #317 from jantonj/fix-bug-diarization-lag Quentin Fuxa 2025-12-19 17:43:07 +01:00
  • 3db5d81a20 update diarization lag after stream analysed Anton Jacobson 2025-12-18 14:13:28 +01:00
  • b67ddea494 bump to 0.2.17 0.2.17 Quentin Fuxa 2025-12-08 23:52:00 +01:00
  • 3192553e20 fixes #307 Quentin Fuxa 2025-12-08 23:52:00 +01:00
  • f379a243fe Merge pull request #274 from blakkd/patch-1 Quentin Fuxa 2025-12-09 10:10:32 +01:00
  • ec09898a9f fixes #301 Quentin Fuxa 2025-12-06 10:19:50 +01:00
  • befbae56c7 minor path change blakkd 2025-11-16 23:47:58 +01:00
  • 719e8b1a20 adapt online for mlx detection Quentin Fuxa 2025-11-25 23:52:00 +01:00
  • f1b47178d8 adapt online for mlx detection Quentin Fuxa 2025-11-25 23:52:00 +01:00
  • 59db08e961 loader for full mlx Quentin Fuxa 2025-11-25 23:52:00 +01:00
  • 6fc20b9562 new dec class Quentin Fuxa 2025-11-21 23:52:00 +01:00
  • fac8659161 uses native mlx function for attention Quentin Fuxa 2025-11-21 23:52:00 +01:00
  • 4d9332ce7d fixes #299 Quentin Fuxa 2025-12-05 17:54:14 +01:00
  • 62444ce746 session parameter required in OnnxWrapper Quentin Fuxa 2025-12-05 15:37:18 +01:00
  • 2431a6bf91 isolated VAD states per user: .onnx: share a stateless model. .jit: require duplicating the model. Quentin Fuxa 2025-12-05 15:27:14 +01:00
  • d1263e7228 Merge pull request #308 from gzz2000/main Quentin Fuxa 2025-12-05 11:34:05 +01:00
  • 30ddd522a4 Fix local agreement backend, removing excess parameter, fixes https://github.com/QuentinFuxa/WhisperLiveKit/issues/295 Zizheng Guo 2025-12-04 16:45:23 +08:00
  • 635bace09e update archi Quentin Fuxa 2025-11-30 18:39:10 +01:00
  • f1113e3eb0 update with LoRA Quentin Fuxa 2025-11-29 18:33:30 +01:00
  • cc5f819ce7 hf weights Quentin Fuxa 2025-11-29 17:50:46 +01:00
  • 82cd24bb75 LoRa path v0 - functional Quentin Fuxa 2025-11-29 17:21:10 +01:00
  • d45c397c6a simulstreaming: limit n tokens to prevent hallucinations Quentin Fuxa 2025-11-28 21:41:19 +01:00
  • 45bf3f57d7 troubleshooting doc for aarch64 systems Quentin Fuxa 2025-11-28 21:40:43 +01:00
  • 1d88ba9d69 Fixes #294. improve model path backend detection and file extraction Quentin Fuxa 2025-11-27 23:14:00 +01:00
  • c0965c6c31 Lines to Segments. Merging dataclasses Quentin Fuxa 2025-11-27 21:54:58 +01:00
  • 34ddd2ac02 update doc Quentin Fuxa 2025-11-25 23:20:00 +01:00
  • 345d781e97 update doc Quentin Fuxa 2025-11-25 23:20:00 +01:00
  • 28cf831701 indicate for context token limits for --max-context-tokens. bump to 0.2.16.dev0 Quentin Fuxa 2025-11-25 23:45:15 +01:00
  • 60c62f8f84 troubleshooting #271 #276 #284 #286 0.2.16 Quentin Fuxa 2025-11-25 23:31:46 +01:00
  • 7faa21f95f alignatt: enable model sharing by removing hooks and centralizing session state. Solves #282 Co-authored-by: Emmanuel Schmidbauer <eschmidbauer@gmail.com> Quentin Fuxa 2025-11-25 23:07:42 +01:00
  • 4e9f951551 correct silences handling when language not auto Quentin Fuxa 2025-10-23 11:20:00 +02:00
  • 870141298c isort Quentin Fuxa 2025-11-23 11:20:00 +01:00
  • 872faa422a correct silences handling when language not auto Quentin Fuxa 2025-10-23 11:20:00 +02:00
  • fc9cb66813 disabling vac is not advised Quentin Fuxa 2025-11-23 11:20:00 +01:00
  • a175d1a327 fixes silence detected but never reported by silero Quentin Fuxa 2025-11-23 11:20:00 +01:00
  • 6206fff118 0.2.15 0.2.15 Quentin Fuxa 2025-11-21 23:52:00 +01:00
  • b5067249c0 stt/diar/nllw alignment: internal rework 5 Quentin Fuxa 2025-11-20 23:52:00 +01:00
  • f4f9831d39 stt/diar/nllw alignment: internal rework 5 Quentin Fuxa 2025-11-20 23:52:00 +01:00
  • 254faaf64c stt/diar/nllw alignment: internal rework 5 Quentin Fuxa 2025-11-20 23:52:00 +01:00