Quentin Fuxa
bbba1d9bb7
add nllb-backend and translation perf test in dev_notes
2025-09-16 20:45:01 +02:00
Quentin Fuxa
99dc96c644
fixes #224
2025-09-16 18:34:35 +02:00
Quentin Fuxa
cd160caaa1
asyncio.to_thread for transcription and translation
2025-09-15 15:23:22 +02:00
Quentin Fuxa
5aa312e437
simulstreaming warmup is done in whisperlivekit.simul_whisper.backend.load_model, not in warmup_online
2025-09-13 20:19:19 +01:00
notV3NOM
ebaf36a8be
Fix warmup file behavior
2025-09-13 20:44:24 +05:30
Quentin Fuxa
a4e9f3cab7
support for raw PCM input option by @YeonjunNotFR
2025-09-11 21:32:11 +02:00
Quentin Fuxa
b06866877a
add --disable-punctuation-split option
2025-09-11 21:03:00 +02:00
Quentin Fuxa
967cdfebc8
fix Translation imports
2025-09-11 21:03:00 +02:00
Quentin Fuxa
3c11c60126
fix by @treeaaa
2025-09-11 21:03:00 +02:00
Quentin Fuxa
2963e8a757
translate when at least 3 new tokens
2025-09-09 21:45:00 +02:00
Quentin Fuxa
cb2d4ea88a
audio processor lines use now Lines objects instead of dict
2025-09-09 21:45:00 +02:00
Quentin Fuxa
add7ea07ee
translator takes all the tokens from the queue
2025-09-09 19:55:39 +02:00
Quentin Fuxa
3358877054
Fix StorageView conversion for CPU/GPU compatibility
2025-09-09 15:44:16 +02:00
Quentin Fuxa
1f7798c7c1
condition on encoder_feature_ctranslate type
2025-09-09 12:16:52 +02:00
Alexander Lindberg
c7b3bb5e58
Fix regression with faster-whisper encoder_feature
2025-09-09 11:18:55 +03:00
Quentin Fuxa
f661f21675
translation asyncio task
2025-09-08 18:34:31 +02:00
Quentin Fuxa
b6164aa59b
translation device determined with torch.device
2025-09-08 11:34:40 +02:00
Quentin Fuxa
4209d7f7c0
Place all tensors on the same device in sortformer diarization
2025-09-08 10:20:57 +02:00
Quentin Fuxa
334b338ab0
use platform to determine system and recommand mlx whisper
2025-09-07 15:49:11 +02:00
Quentin Fuxa
72f33be6f2
translation: use of get_nllb_code
2025-09-07 15:25:14 +02:00
notV3NOM
a178ed5c22
fix simulstreaming preload model count argument in cli
2025-09-06 18:18:09 +05:30
Quentin Fuxa
fad9ee4d21
Merge pull request #198 from notV3NOM/main
...
Fix scrolling UX with sticky header controls
2025-09-05 20:46:36 +02:00
Quentin Fuxa
d1a9913c47
nllb v0
2025-09-05 18:02:42 +02:00
notV3NOM
e4ca2623cb
Fix scrolling UX with sticky header controls
2025-09-05 21:25:13 +05:30
Quentin Fuxa
9c1bf37960
fixes #197
2025-09-05 16:34:13 +02:00
notV3NOM
abd8f2c269
Fix exponentially growing simulstreaming silence timer
2025-09-04 21:49:07 +05:30
Quentin Fuxa
f3ad4e39e4
torch.Tensor to torch.as_tensor
2025-09-04 16:39:11 +02:00
Quentin Fuxa
e0a5cbf0e7
v0.1.0 chrome extension
2025-09-04 16:36:28 +02:00
Quentin Fuxa
953697cd86
torch.Tensor to torch.as_tensor
2025-09-04 15:25:39 +02:00
Quentin Fuxa
3bd2122eb4
0.2.8 : only the decoder of whisper is loaded in memory when a different encoder is used
2025-09-02 21:12:25 +02:00
Quentin Fuxa
d5008ed828
mlx/fasterWhisper encoders are loaded once and shared in simulstreaming
2025-09-01 12:33:19 +02:00
Quentin Fuxa
d467716e26
add microphone picker
2025-08-31 10:12:52 +02:00
Quentin Fuxa
199e21b3ef
faster-whisper as an optional encoder alternative for simulstreaming
2025-08-30 23:50:16 +02:00
Quentin Fuxa
1d926f2e67
mlx-whisper used as simulstreaming encoder: improve speed for macos systems
2025-08-30 22:19:11 +02:00
Quentin Fuxa
4a71a391b8
get_web_interface_html to get_inline_ui_html for embedded web interface HTML
2025-08-30 13:44:06 +02:00
Quentin Fuxa
1ba171a58d
add embedded web interface HTML (single-file version with inline CSS/JS/SVG)
...
### Added
- `get_inline_ui_html()`: generates a self-contained version of the web interface, with CSS, JS, and SVG assets inlined directly into the HTML. useful for environments where serving static files is inconvenient or when a single-call UI delivery is preferred.
(cherry picked from commit aa44a92a67 )
2025-08-29 22:00:59 +02:00
Quentin Fuxa
4a5d5e1f3b
raise Exception when language == auto and task == translation
2025-08-29 17:44:46 +02:00
Quentin Fuxa
9895bc83bf
auto detection of language for warmup if not indicated
2025-08-27 20:37:48 +02:00
Quentin Fuxa
ab98c31f16
trim will happen before audio processor
2025-08-27 18:17:11 +02:00
Quentin Fuxa
4ed62e181d
when silences are detected, speaker correction is no more applied
2024-08-24 19:24:00 +02:00
Quentin Fuxa
9a8d3cbd90
improve diarization + silence handling
2024-08-24 19:20:00 +02:00
Quentin Fuxa
b101ce06bd
several users share the same sortformer model instance
2024-08-24 19:18:00 +02:00
Quentin Fuxa
c83fd179a8
improves phase shift correction between transcription and diarization
2024-08-24 19:15:00 +02:00
Quentin Fuxa
5258305745
default diarization backend in now sortformer
2025-08-24 18:32:01 +02:00
Quentin Fuxa
ce781831ee
punctuation is checked in audio-processor's result formatter
2025-08-24 18:32:01 +02:00
Quentin Fuxa
58297daf6d
sortformer diar implementation v0.3
2025-08-24 18:32:01 +02:00
Quentin Fuxa
3393a08f7e
sortformer diar implementation v0.2
2025-08-24 18:32:01 +02:00
Quentin Fuxa
26cc1072dd
new dockerfile for cpu only. update dockerfile from cuda 12.8 to 12.9
2025-08-22 11:04:35 +02:00
Quentin Fuxa
909ac9dd41
speaker -1 are no more sent in websocket - no buffer when their is a silence
2025-08-21 14:09:02 +02:00
Quentin Fuxa
d94a07d417
default model is now base. default backend simulstreaming
2025-08-21 11:55:36 +02:00