285 Commits

Author SHA1 Message Date
Quentin Fuxa
e61d1d111f script to lib 0.0.1 2025-03-19 15:10:05 +01:00
Quentin Fuxa
5624c1f6b7 Refactor import statement for AudioProcessor and update cleanup method to be awaited; remove unused formatters and state management files 2025-03-19 11:18:12 +01:00
Quentin Fuxa
7679370cf6 Refactor AudioProcessor methods for improved async handling and WebSocket integration 2025-03-19 10:59:50 +01:00
Quentin Fuxa
5ca65e21b7 Refactor DiartDiarization initialization and streamline WebSocket audio processing 2025-03-19 10:33:22 +01:00
Quentin Fuxa
dc02bcdbdd refacto 0 2025-03-18 18:31:23 +01:00
Quentin Fuxa
4f87ac3ea4 Refactor PCM conversion to a dedicated function; immediate chunk addition to the diarization queue 2025-03-17 11:46:45 +01:00
Quentin Fuxa
eead544977 Update README.md 2025-03-15 17:28:24 +01:00
Quentin Fuxa
f4a57cd810 Merge pull request #85 from SilasK/warm-up
add warmup ASR, with default file being https://github.com/ggerganov/whisper.cpp/raw/master/samples/jfk.wav
2025-03-14 11:43:24 +01:00
Quentin Fuxa
b768b219fe Warmup functionality: add timeout option (for VM not connected to internet); False option to disable warmup 2025-03-14 11:41:18 +01:00
Quentin Fuxa
2fb386f94c Create CONTRIBUTING.md 2025-03-14 11:10:32 +01:00
Silas Kieser
cb5cf39336 fix #84 2025-03-13 15:03:16 +01:00
Quentin Fuxa
3024a9bdb2 Diarization : Uses a rx observer instead of diart attach_hooks method 2025-03-13 12:02:18 +01:00
Quentin Fuxa
7b582f3f9f change default model to tiny, and vad activated by default 2025-03-13 12:01:08 +01:00
Quentin Fuxa
8ae38a48ef Update README.md 2025-03-05 18:18:38 +01:00
Quentin Fuxa
fc3ffada59 recording duration & waveform added 2025-03-05 18:13:37 +01:00
Quentin Fuxa
e3550ef07d use confidence scores returned by whisper to immediately validate tokens 2025-03-03 12:08:56 +01:00
Quentin Fuxa
b502c8c81d update font 2025-03-03 11:48:25 +01:00
Quentin Fuxa
b37d3cafb3 ffmpeg timout from 5 to 15s; diarization lag does not stay = 0 at the beginning 2025-03-03 10:34:32 +01:00
Quentin Fuxa
d304011aac update demo 2025-03-03 10:33:20 +01:00
Quentin Fuxa
597772c6c5 clean html 2025-03-03 10:27:07 +01:00
Quentin Fuxa
a656ccae72 Update README.md 2025-03-03 09:37:14 +01:00
Quentin Fuxa
e910873312 Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web 2025-03-01 15:53:43 +01:00
Quentin Fuxa
2a869cd509 solve https://github.com/QuentinFuxa/whisper_streaming_web/issues/60#issuecomment-2692191781 2025-03-01 15:53:35 +01:00
Quentin Fuxa
d053bac871 huggingface steps added. Solve https://github.com/QuentinFuxa/whisper_streaming_web/issues/59 & https://github.com/QuentinFuxa/whisper_streaming_web/issues/60 2025-03-01 11:52:09 +01:00
Quentin Fuxa
e486ef8d98 Delete src directory 2025-02-28 19:09:15 +01:00
Quentin Fuxa
0a1fb08371 src/web to web 2025-02-28 18:50:00 +01:00
Quentin Fuxa
ddb8860528 move files 2025-02-28 18:49:45 +01:00
Quentin Fuxa
2e19516b3e split backends and online asr files 2025-02-28 18:49:31 +01:00
Quentin Fuxa
3c7bc6f472 add coming soon 2025-02-28 18:48:48 +01:00
Quentin Fuxa
2d2a4967e6 update import paths 2025-02-28 18:41:12 +01:00
Quentin Fuxa
7e880e039e time objects is now used by DiartDiarization class 2025-02-28 18:40:42 +01:00
Quentin Fuxa
627386a8a4 silero vad 2025-02-28 18:39:46 +01:00
Quentin Fuxa
14af47e84b undiarized text is assigned to last speaker, with buffer information; traceback is used to format_exc errors 2025-02-28 18:11:36 +01:00
Quentin Fuxa
00eb4a0a4f UI updated when diarization is disactivated 2025-02-28 18:10:00 +01:00
Quentin Fuxa
2f87e592e0 Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web 2025-02-28 15:44:19 +01:00
Quentin Fuxa
56717b094f DiartDiarization now uses SpeakerSegment 2025-02-28 15:44:09 +01:00
Quentin Fuxa
7b1c88589e // execution for diarization and transcription 2025-02-28 15:43:46 +01:00
Quentin Fuxa
72ce8d0e3f update frontend for new variables 2025-02-28 15:40:56 +01:00
Quentin Fuxa
09090aa3f5 Add files via upload 2025-02-28 11:49:32 +01:00
Quentin Fuxa
d3960ffef9 Add files via upload 2025-02-28 11:44:29 +01:00
Quentin Fuxa
247582fb33 SpeakerSegment class for future diart improvements 2025-02-26 21:28:22 +01:00
Quentin Fuxa
091d5d7bf5 new buffer format 2025-02-26 21:27:39 +01:00
Quentin Fuxa
9d5d6d8031 UI improvements 2025-02-26 21:26:24 +01:00
Quentin Fuxa
8aa3c760c7 Add files via upload 2025-02-24 09:27:47 +01:00
Quentin Fuxa
f925ef3786 Merge pull request #57 from QuentinFuxa/diart_integration_improvements
diarization now works at word - not chunk - level!
2025-02-24 00:38:30 +01:00
Quentin Fuxa
2ced4fef20 diarization now works at word - not chunk - level! 2025-02-24 00:35:42 +01:00
Quentin Fuxa
5b9b9328e0 Merge pull request #56 from QuentinFuxa/diart_integration_improvements
Improve sentence tokenization handling - MosesSentenceSplitter now wo…
2025-02-23 23:42:37 +01:00
Quentin Fuxa
d89622b9c2 Improve sentence tokenization handling - MosesSentenceSplitter now works with list input 2025-02-23 23:41:15 +01:00
Quentin Fuxa
d4096e7e11 Merge pull request #55 from QuentinFuxa/diart_integration_improvements
Diart integration improvements : Correct bugs
2025-02-23 23:16:10 +01:00
Quentin Fuxa
296327071d Enhance diarization logic to improve speaker attribution : corrects several bugs 2025-02-23 23:14:10 +01:00