Quentin Fuxa
|
b8b07adeef
|
--vac to --no-vac
|
2025-08-17 09:44:26 +02:00 |
|
Quentin Fuxa
|
d0e9e37ef6
|
simulstreaming: cumulative_time_offset to keep timestamps correct when audio > 30s
|
2025-08-17 09:33:47 +02:00 |
|
Quentin Fuxa
|
820f92d8cb
|
audio_max_len to 30 -> 20, ffmpeg timeout 5 -> 20
|
2025-08-17 09:32:08 +02:00 |
|
Quentin Fuxa
|
e42523af84
|
VAC activated by default
|
2025-08-17 01:29:34 +02:00 |
|
Quentin Fuxa
|
e2184d5e06
|
better handle silences when VAC + correct offset issue with whisperstreaming backend
|
2025-08-17 01:27:07 +02:00 |
|
Quentin Fuxa
|
7fe0353260
|
vac model is loaded in TranscriptionEngine, and by default
|
2025-08-17 00:34:25 +02:00 |
|
Quentin Fuxa
|
0f2eba507e
|
use with_offset to add no audio offset to tokens
|
2025-08-17 00:33:24 +02:00 |
|
Quentin Fuxa
|
55e08474f3
|
recycle backend in simulstreaming thanks to new remove hooks function
|
2025-08-16 23:06:16 +02:00 |
|
Quentin Fuxa
|
28bdc52e1d
|
VAC before doing transcription and diarization. V0
|
2025-08-16 23:04:21 +02:00 |
|
Quentin Fuxa
|
e4221fa6c3
|
Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web
|
2025-08-15 23:04:05 +02:00 |
|
Quentin Fuxa
|
1652db9a2d
|
Use distinct backend models for simulstreaming and add --preloaded_model_count to preload them
|
2025-08-15 23:03:55 +02:00 |
|
Quentin Fuxa
|
601f17653a
|
Update CONTRIBUTING.md
|
2025-08-13 21:59:32 +02:00 |
|
Quentin Fuxa
|
7718190fcd
|
Update CONTRIBUTING.md
|
2025-08-13 21:59:00 +02:00 |
|
Quentin Fuxa
|
349c7dcb9e
|
bump version ro 0.2.5
0.2.5
|
2025-08-13 10:04:31 +02:00 |
|
Quentin Fuxa
|
1c42b867cf
|
Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web
|
2025-08-13 10:04:04 +02:00 |
|
Quentin Fuxa
|
d4771e563e
|
Increase END_SILENCE_DURATION to reduce false positives
|
2025-08-13 10:04:00 +02:00 |
|
Quentin Fuxa
|
b0a5fc0693
|
Merge pull request #155 from davidgumberg/keepawakescrolldown
frontend: Keep screen awake and scroll down when transcribing.
|
2025-08-13 10:02:52 +02:00 |
|
David Gumberg
|
3b96fb8776
|
frontend: Scroll down when appending transcription
|
2025-08-12 17:31:32 -07:00 |
|
David Gumberg
|
7f93c4b978
|
frontend: Don't let screen sleep when transcribing.
|
2025-08-12 17:30:57 -07:00 |
|
Quentin Fuxa
|
15c3df1cba
|
warmup base whisper when using simulstreaming
|
2025-08-12 18:52:52 +02:00 |
|
Quentin Fuxa
|
7fb8e66c01
|
typo
|
2025-08-12 18:36:32 +02:00 |
|
Quentin Fuxa
|
728e1f1290
|
simulstreaming warmup is done for each instance of online, not for the backend
|
2025-08-12 18:35:04 +02:00 |
|
Quentin Fuxa
|
87b9ed6ecd
|
nonspeech_prob from 1 to 0.5
|
2025-08-12 18:34:37 +02:00 |
|
Quentin Fuxa
|
38b4ebe8ba
|
Handle 3 types of silences: Indicated by whisper, between tokens, and at the end of the input. Display them in the frontend
|
2025-08-11 17:56:57 +02:00 |
|
Quentin Fuxa
|
d098af3185
|
each SimulStreamingOnlineProcessor now contains PaddedAlignAttWhisper instance. SimulStreamingASR only contains loaded whisper model
|
2025-08-11 08:24:14 +02:00 |
|
Quentin Fuxa
|
4e56130a40
|
frontend supports dark theme
|
2025-08-11 08:22:23 +02:00 |
|
Quentin Fuxa
|
2bbdc70187
|
lags are now updated every 0.1s
|
2025-08-09 23:11:05 +02:00 |
|
Quentin Fuxa
|
b678a55f63
|
remove duplicate file
|
2025-08-09 23:10:34 +02:00 |
|
Quentin Fuxa
|
5491964e81
|
clean SimulStreamingOnlineProcessor initialization + audio processing
|
2025-08-09 20:16:27 +02:00 |
|
Quentin Fuxa
|
b05297a96d
|
clean simulwhisper backend and online
|
2025-08-09 18:02:15 +02:00 |
|
Quentin Fuxa
|
197293e25e
|
refactor(simulstreaming): extract backend + online module into separate files from whisper streaming
|
2025-08-08 18:07:51 +02:00 |
|
Quentin Fuxa
|
ba41c4ab56
|
Remove download_simulstreaming_backend
|
2025-08-08 18:06:40 +02:00 |
|
Quentin Fuxa
|
bda72b8bc0
|
setup.py to pyproject.toml. Remove <2.0.0 condition on numpy dep
|
2025-08-03 16:32:31 +02:00 |
|
Quentin Fuxa
|
bb6b9f4cb1
|
architecture diagram : available backends for whisper streaming & diarization
|
2025-08-03 12:25:36 +02:00 |
|
Quentin Fuxa
|
e40b5a3ea0
|
Update architecture diagram
0.2.4.dev0
|
2025-08-02 13:51:15 +02:00 |
|
Quentin Fuxa
|
4cfed6e98e
|
in MultiHeadAttention and ResidualAttentionBlock include cache_id for compatibility with simulstreaming code
|
2025-08-02 13:16:58 +02:00 |
|
Quentin Fuxa
|
687e3dd5e2
|
update simulstreaming model.py to match the latest version of whisper sources
|
2025-08-02 13:16:10 +02:00 |
|
Quentin Fuxa
|
e4140cd299
|
Update Dockerfile to install build-essential and update PyTorch version
|
2025-08-02 13:08:43 +02:00 |
|
Quentin Fuxa
|
8e056cbdf2
|
Upgrade SimulStreaming Whisper core from version 20230918 to 20250625
|
2025-08-02 13:06:36 +02:00 |
|
Quentin Fuxa
|
9dcfb38967
|
Update README.md
|
2025-08-01 18:02:11 +02:00 |
|
Quentin Fuxa
|
47b9235d70
|
Update README.md
|
2025-08-01 17:55:40 +02:00 |
|
Quentin Fuxa
|
f3cd53a4db
|
Update README.md
|
2025-08-01 16:53:22 +02:00 |
|
Quentin Fuxa
|
dbdb4ea66c
|
Update README.md
|
2025-08-01 16:33:26 +02:00 |
|
Quentin Fuxa
|
00424d7ca3
|
latest version of simulstreaming
|
2025-07-31 16:44:23 +02:00 |
|
Quentin Fuxa
|
4b738d6f63
|
fix duplicate line
|
2025-07-31 16:29:35 +02:00 |
|
Quentin Fuxa
|
8a5e2adb1e
|
simulstreaming: fixes token handling during warm-up phase
|
2025-07-31 16:25:34 +02:00 |
|
Quentin Fuxa
|
f85329e112
|
Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web
|
2025-07-31 11:42:16 +02:00 |
|
Quentin Fuxa
|
46efbdf1d9
|
solves https://github.com/QuentinFuxa/WhisperLiveKit/issues/151
|
2025-07-31 11:42:06 +02:00 |
|
Quentin Fuxa
|
8885ade003
|
Merge pull request #153 from luisla-rivas/main
Fix README.md to view correctly Deployment Guide info
|
2025-07-31 07:10:35 +02:00 |
|
luisla-rivas
|
2564928d83
|
Fix README.md to view correctly Deployment Guide info
|
2025-07-30 14:11:19 +02:00 |
|