Quentin Fuxa
|
583a2ec2e4
|
highlight Sortformer optional installation
|
2025-08-27 21:02:25 +02:00 |
|
Quentin Fuxa
|
19765e89e9
|
remove triton <3 condition
|
2025-08-27 20:44:39 +02:00 |
|
Quentin Fuxa
|
9895bc83bf
|
auto detection of language for warmup if not indicated
|
2025-08-27 20:37:48 +02:00 |
|
Quentin Fuxa
|
ab98c31f16
|
trim will happen before audio processor
0.2.7
|
2025-08-27 18:17:11 +02:00 |
|
Quentin Fuxa
|
f9c9c4188a
|
optional dependencies removed, ask to direct alternative package installations
|
2025-08-27 18:15:32 +02:00 |
|
Quentin Fuxa
|
c21d2302e7
|
to 0.2.7
|
2024-08-24 19:28:00 +02:00 |
|
Quentin Fuxa
|
4ed62e181d
|
when silences are detected, speaker correction is no more applied
|
2024-08-24 19:24:00 +02:00 |
|
Quentin Fuxa
|
52a755a08c
|
indications on how to choose a model
|
2024-08-24 19:22:00 +02:00 |
|
Quentin Fuxa
|
9a8d3cbd90
|
improve diarization + silence handling
|
2024-08-24 19:20:00 +02:00 |
|
Quentin Fuxa
|
b101ce06bd
|
several users share the same sortformer model instance
|
2024-08-24 19:18:00 +02:00 |
|
Quentin Fuxa
|
c83fd179a8
|
improves phase shift correction between transcription and diarization
|
2024-08-24 19:15:00 +02:00 |
|
Quentin Fuxa
|
5258305745
|
default diarization backend in now sortformer
|
2025-08-24 18:32:01 +02:00 |
|
Quentin Fuxa
|
ce781831ee
|
punctuation is checked in audio-processor's result formatter
|
2025-08-24 18:32:01 +02:00 |
|
Quentin Fuxa
|
58297daf6d
|
sortformer diar implementation v0.3
|
2025-08-24 18:32:01 +02:00 |
|
Quentin Fuxa
|
3393a08f7e
|
sortformer diar implementation v0.2
|
2025-08-24 18:32:01 +02:00 |
|
Quentin Fuxa
|
5b2ddeccdb
|
correct pip installation error in image build
|
2025-08-22 15:37:46 +02:00 |
|
Quentin Fuxa
|
26cc1072dd
|
new dockerfile for cpu only. update dockerfile from cuda 12.8 to 12.9
|
2025-08-22 11:04:35 +02:00 |
|
Quentin Fuxa
|
12973711f6
|
0.2.6
0.2.6
|
2025-08-21 14:34:46 +02:00 |
|
Quentin Fuxa
|
909ac9dd41
|
speaker -1 are no more sent in websocket - no buffer when their is a silence
|
2025-08-21 14:09:02 +02:00 |
|
Quentin Fuxa
|
d94a07d417
|
default model is now base. default backend simulstreaming
|
2025-08-21 11:55:36 +02:00 |
|
Quentin Fuxa
|
b32dd8bfc4
|
Align backend and frontend time handling
|
2025-08-21 10:33:15 +02:00 |
|
Quentin Fuxa
|
9feb0e597b
|
remove VACOnlineASRProcessor backend possibility
|
2025-08-20 20:57:43 +02:00 |
|
Quentin Fuxa
|
9dab84a573
|
update front
|
2025-08-20 20:15:38 +02:00 |
|
Quentin Fuxa
|
d089c7fce0
|
.html to .html + .css + .js
|
2025-08-20 20:00:31 +02:00 |
|
Quentin Fuxa
|
253a080df5
|
diart diarization handles pauses/silences thanks to offset
|
2025-08-19 21:12:55 +02:00 |
|
Quentin Fuxa
|
0c6e4b2aee
|
sortformer diar implementation v0.1
|
2025-08-19 19:48:51 +02:00 |
|
Quentin Fuxa
|
e14bbde77d
|
sortformer diar implementation v0
|
2025-08-19 17:02:55 +02:00 |
|
Quentin Fuxa
|
7496163467
|
rename diart backend
|
2025-08-19 15:02:27 +02:00 |
|
Quentin Fuxa
|
696a94d1ce
|
1rst sortformer backend implementation
|
2025-08-19 15:02:17 +02:00 |
|
Quentin Fuxa
|
2699b0974c
|
Fix simulstreaming imports
|
2025-08-19 14:43:54 +02:00 |
|
Quentin Fuxa
|
90c0250ba4
|
update optional dependencies
|
2025-08-19 09:36:59 +02:00 |
|
Quentin Fuxa
|
eb96153ffd
|
new vac parameters
|
2025-08-17 22:26:28 +02:00 |
|
Quentin Fuxa
|
47e3eb9b5b
|
Update README.md
|
2025-08-17 09:55:03 +02:00 |
|
Quentin Fuxa
|
b8b07adeef
|
--vac to --no-vac
|
2025-08-17 09:44:26 +02:00 |
|
Quentin Fuxa
|
d0e9e37ef6
|
simulstreaming: cumulative_time_offset to keep timestamps correct when audio > 30s
|
2025-08-17 09:33:47 +02:00 |
|
Quentin Fuxa
|
820f92d8cb
|
audio_max_len to 30 -> 20, ffmpeg timeout 5 -> 20
|
2025-08-17 09:32:08 +02:00 |
|
Quentin Fuxa
|
e42523af84
|
VAC activated by default
|
2025-08-17 01:29:34 +02:00 |
|
Quentin Fuxa
|
e2184d5e06
|
better handle silences when VAC + correct offset issue with whisperstreaming backend
|
2025-08-17 01:27:07 +02:00 |
|
Quentin Fuxa
|
7fe0353260
|
vac model is loaded in TranscriptionEngine, and by default
|
2025-08-17 00:34:25 +02:00 |
|
Quentin Fuxa
|
0f2eba507e
|
use with_offset to add no audio offset to tokens
|
2025-08-17 00:33:24 +02:00 |
|
Quentin Fuxa
|
55e08474f3
|
recycle backend in simulstreaming thanks to new remove hooks function
|
2025-08-16 23:06:16 +02:00 |
|
Quentin Fuxa
|
28bdc52e1d
|
VAC before doing transcription and diarization. V0
|
2025-08-16 23:04:21 +02:00 |
|
Quentin Fuxa
|
e4221fa6c3
|
Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web
|
2025-08-15 23:04:05 +02:00 |
|
Quentin Fuxa
|
1652db9a2d
|
Use distinct backend models for simulstreaming and add --preloaded_model_count to preload them
|
2025-08-15 23:03:55 +02:00 |
|
Quentin Fuxa
|
601f17653a
|
Update CONTRIBUTING.md
|
2025-08-13 21:59:32 +02:00 |
|
Quentin Fuxa
|
7718190fcd
|
Update CONTRIBUTING.md
|
2025-08-13 21:59:00 +02:00 |
|
Quentin Fuxa
|
349c7dcb9e
|
bump version ro 0.2.5
0.2.5
|
2025-08-13 10:04:31 +02:00 |
|
Quentin Fuxa
|
1c42b867cf
|
Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web
|
2025-08-13 10:04:04 +02:00 |
|
Quentin Fuxa
|
d4771e563e
|
Increase END_SILENCE_DURATION to reduce false positives
|
2025-08-13 10:04:00 +02:00 |
|
Quentin Fuxa
|
b0a5fc0693
|
Merge pull request #155 from davidgumberg/keepawakescrolldown
frontend: Keep screen awake and scroll down when transcribing.
|
2025-08-13 10:02:52 +02:00 |
|