WhisperLiveKit

mirror of https://github.com/QuentinFuxa/WhisperLiveKit.git synced 2026-04-28 09:30:05 +00:00

Author	SHA1	Message	Date
Quentin Fuxa	696a94d1ce	1rst sortformer backend implementation	2025-08-19 15:02:17 +02:00
Quentin Fuxa	2699b0974c	Fix simulstreaming imports	2025-08-19 14:43:54 +02:00
Quentin Fuxa	d0e9e37ef6	simulstreaming: cumulative_time_offset to keep timestamps correct when audio > 30s	2025-08-17 09:33:47 +02:00
Quentin Fuxa	820f92d8cb	audio_max_len to 30 -> 20, ffmpeg timeout 5 -> 20	2025-08-17 09:32:08 +02:00
Quentin Fuxa	e2184d5e06	better handle silences when VAC + correct offset issue with whisperstreaming backend	2025-08-17 01:27:07 +02:00
Quentin Fuxa	7fe0353260	vac model is loaded in TranscriptionEngine, and by default	2025-08-17 00:34:25 +02:00
Quentin Fuxa	0f2eba507e	use with_offset to add no audio offset to tokens	2025-08-17 00:33:24 +02:00
Quentin Fuxa	55e08474f3	recycle backend in simulstreaming thanks to new remove hooks function	2025-08-16 23:06:16 +02:00
Quentin Fuxa	28bdc52e1d	VAC before doing transcription and diarization. V0	2025-08-16 23:04:21 +02:00
Quentin Fuxa	1652db9a2d	Use distinct backend models for simulstreaming and add --preloaded_model_count to preload them	2025-08-15 23:03:55 +02:00
Quentin Fuxa	1c42b867cf	Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web	2025-08-13 10:04:04 +02:00
Quentin Fuxa	d4771e563e	Increase END_SILENCE_DURATION to reduce false positives	2025-08-13 10:04:00 +02:00
David Gumberg	3b96fb8776	frontend: Scroll down when appending transcription	2025-08-12 17:31:32 -07:00
David Gumberg	7f93c4b978	frontend: Don't let screen sleep when transcribing.	2025-08-12 17:30:57 -07:00
Quentin Fuxa	15c3df1cba	warmup base whisper when using simulstreaming	2025-08-12 18:52:52 +02:00
Quentin Fuxa	728e1f1290	simulstreaming warmup is done for each instance of online, not for the backend	2025-08-12 18:35:04 +02:00
Quentin Fuxa	87b9ed6ecd	nonspeech_prob from 1 to 0.5	2025-08-12 18:34:37 +02:00
Quentin Fuxa	38b4ebe8ba	Handle 3 types of silences: Indicated by whisper, between tokens, and at the end of the input. Display them in the frontend	2025-08-11 17:56:57 +02:00
Quentin Fuxa	d098af3185	each SimulStreamingOnlineProcessor now contains PaddedAlignAttWhisper instance. SimulStreamingASR only contains loaded whisper model	2025-08-11 08:24:14 +02:00
Quentin Fuxa	4e56130a40	frontend supports dark theme	2025-08-11 08:22:23 +02:00
Quentin Fuxa	2bbdc70187	lags are now updated every 0.1s	2025-08-09 23:11:05 +02:00
Quentin Fuxa	b678a55f63	remove duplicate file	2025-08-09 23:10:34 +02:00
Quentin Fuxa	5491964e81	clean SimulStreamingOnlineProcessor initialization + audio processing	2025-08-09 20:16:27 +02:00
Quentin Fuxa	b05297a96d	clean simulwhisper backend and online	2025-08-09 18:02:15 +02:00
Quentin Fuxa	197293e25e	refactor(simulstreaming): extract backend + online module into separate files from whisper streaming	2025-08-08 18:07:51 +02:00
Quentin Fuxa	ba41c4ab56	Remove download_simulstreaming_backend	2025-08-08 18:06:40 +02:00
Quentin Fuxa	4cfed6e98e	in MultiHeadAttention and ResidualAttentionBlock include cache_id for compatibility with simulstreaming code	2025-08-02 13:16:58 +02:00
Quentin Fuxa	687e3dd5e2	update simulstreaming model.py to match the latest version of whisper sources	2025-08-02 13:16:10 +02:00
Quentin Fuxa	8e056cbdf2	Upgrade SimulStreaming Whisper core from version 20230918 to 20250625	2025-08-02 13:06:36 +02:00
Quentin Fuxa	00424d7ca3	latest version of simulstreaming	2025-07-31 16:44:23 +02:00
Quentin Fuxa	4b738d6f63	fix duplicate line	2025-07-31 16:29:35 +02:00
Quentin Fuxa	8a5e2adb1e	simulstreaming: fixes token handling during warm-up phase	2025-07-31 16:25:34 +02:00
Quentin Fuxa	46efbdf1d9	solves https://github.com/QuentinFuxa/WhisperLiveKit/issues/151	2025-07-31 11:42:06 +02:00
Quentin Fuxa	56114d3071	Remove end_attributed_speaker in diarization_online. handled in audio processor	2025-07-16 12:09:43 +02:00
Quentin Fuxa	5b9977c9af	Enhanced use_punctuation_split for diarization. further improvements still needed	2025-07-16 12:06:17 +02:00
Quentin Fuxa	12a544164f	Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web	2025-07-16 12:05:01 +02:00
Quentin Fuxa	3ad3683ca7	Refactor speaker assignment in DiartDiarization for clarity and punctuation awareness	2025-07-15 14:38:53 +02:00
Quentin Fuxa	1599bd87a0	work on punctuation_split	2025-07-15 12:04:54 +02:00
Quentin Fuxa	90623400a4	Remove automatic downloading of SimulStreaming dependencies on import failure	2025-07-15 12:04:17 +02:00
choomegan	64e44fb24f	fix: logic of adding of pcm_array to diarization_queue	2025-07-15 15:33:41 +08:00
Quentin Fuxa	9ff513093b	simulstreaming uses empty space as separator	2025-07-04 17:03:01 +02:00
Quentin Fuxa	aad2c55d8c	download_simulstreaming_backend.py now downloads files in the correct lib dir	2025-07-03 11:07:28 +02:00
Quentin Fuxa	2f177c4a3b	add __init__.py file to simul_whisper assets directory	2025-07-03 10:41:12 +02:00
Quentin Fuxa	b362eccb23	new command to get simulstreaming backend	2025-07-03 10:24:02 +02:00
Quentin Fuxa	5daaf77258	add download script for SimulStreaming backend	2025-07-03 10:14:45 +02:00
Quentin Fuxa	36cc4412c3	update LICENSE with SimulStreaming dual licensing terms; include in .gitignore additional stuff	2025-07-03 09:21:38 +02:00
Quentin Fuxa	e1d4bf7e94	modify import paths in simul whisper backend so that it works in lib mode	2025-07-01 20:34:47 +02:00
Quentin Fuxa	62bf28949e	compatible with the latest version of simulstreaming	2025-07-01 20:10:45 +02:00
Quentin Fuxa	1e3fab9550	copy non python files from simulstreaming when installing package	2025-07-01 19:14:23 +02:00
Quentin Fuxa	f25de6d8a4	ffmpeg-python is not used anymore - ffmpeg is directly called through create_subprocess_exec	2025-07-01 18:53:35 +02:00

1 2

99 Commits