Commit Graph

170 Commits

Author SHA1 Message Date
Quentin Fuxa
fba37eba0a move to src 2025-01-19 21:17:55 +01:00
Quentin Fuxa
5523b51fd7 first speaker is "0" no more None 2025-01-19 19:40:09 +01:00
Quentin Fuxa
9bdb92e923 update demo.png 2025-01-19 19:36:10 +01:00
Quentin Fuxa
b51c8427f4 diart link added 2025-01-19 17:12:55 +01:00
Quentin Fuxa
977436622a add diarization (beta). Disabled by default 2025-01-19 17:12:40 +01:00
Quentin Fuxa
ce56264241 split whisper_online.py into smaller files 2025-01-14 20:52:53 +01:00
Quentin Fuxa
9cbac96c44 del online once webstreaming is finished 2025-01-14 20:20:22 +01:00
Quentin Fuxa
3f30d3de6e Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web 2025-01-14 20:14:22 +01:00
Quentin Fuxa
f884d1162d warning when transcribe_kargs are used with MLX Whisper 2025-01-14 20:14:16 +01:00
Quentin Fuxa
6ee91c3c93 Merge pull request #15 from in-c0/patch-1
Specify encoding to ensure Python reads file as UTF-8
2025-01-13 20:30:51 +01:00
Ava
f52a5ae3c2 specify encoding to ensure Python reads file as UTF-8
executing `python whisper_fastapi_online_server.py --host 0.0.0.0 --port 8000` resulted in error on my setup for me:

```
whisper_streaming_web\whisper_fastapi_online_server.py, line 47, in <module>
    html = f.read()
           ^^^^^^^^
  File "C:\Python312\Lib\encodings\cp1252.py", line 23, in decode
    return codecs.charmap_decode(input,self.errors,decoding_table)[0]
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
UnicodeDecodeError: 'charmap' codec can't decode byte 0x8f in position 1818: character maps to <undefined>
```

On Windows, Python defaults to the `cp1252` encoding, which may not match the encoding of the file being read. 
Files containing special characters, non-ASCII text, or saved with UTF-8 encoding can trigger this error when read without specifying the correct encoding.
2025-01-13 23:12:38 +11:00
Quentin Fuxa
0ff6067f37 Update README.md 2025-01-04 00:55:12 +01:00
Quentin Fuxa
da6c8d25e4 Update README.md 2025-01-03 14:54:29 +01:00
Quentin Fuxa
aa0ba598f0 no online conflict when multiple users 2025-01-03 14:48:45 +01:00
Quentin Fuxa
b7a2d23a18 if websocket connection fails, frontend does not allow recording 2024-12-31 11:17:41 +01:00
Quentin Fuxa
58e48bb717 Merge pull request #10 from SilasK/main
More flexibility by using custom tokenize_method  + black
2024-12-31 10:33:47 +01:00
silask
6a04ddbed2 only print translated text not timestamps 2024-12-30 21:53:33 +01:00
silask
aa4d2599cc fix #7 2024-12-30 21:53:33 +01:00
silask
5fdb08edae black formating 2024-12-30 21:53:33 +01:00
Quentin Fuxa
4cb3660666 Update README.md 2024-12-30 20:46:36 +01:00
Quentin Fuxa
122368bff3 Append full transcription in websocket processing 2024-12-30 15:21:00 +01:00
Quentin Fuxa
0d833eaea2 Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web 2024-12-28 18:32:36 +01:00
Quentin Fuxa
c960d1571d Batch unprocessed audio to reduce Whisper streaming calls 2024-12-28 18:32:27 +01:00
Quentin Fuxa
1aa1b9ea99 Update README.md : ffmpeg to ffmpeg-python 2024-12-28 09:15:09 +01:00
Quentin Fuxa
99019f1dd7 Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web 2024-12-24 19:36:36 +01:00
Quentin Fuxa
1cea20a42d /ws to /asr to distinguish protocol ws:// from endpoint 2024-12-24 19:36:20 +01:00
Quentin Fuxa
50bbd26517 throw errors if websocket connection fails 2024-12-24 19:31:08 +01:00
Quentin Fuxa
cf5d1cf013 Update README.md 2024-12-19 18:57:15 +01:00
Quentin Fuxa
0553b75415 unfork project, indicate files from whisper streaming 2024-12-19 12:01:07 +01:00
Quentin Fuxa
baa01728be Merge branch 'whisper-mlx' 2024-12-19 11:14:48 +01:00
Quentin Fuxa
8dcebd9329 add translate_model_name function 2024-12-19 11:10:02 +01:00
Quentin Fuxa
bfe973a0d2 Merge branch 'whisper-mlx' 2024-12-19 10:48:25 +01:00
Quentin Fuxa
87cab7c280 add whisper mlx backend 2024-12-19 10:47:46 +01:00
Quentin Fuxa
bee27c68e6 better buffer gestion 2024-12-19 10:19:24 +01:00
Quentin Fuxa
aa4480b138 update frontend 2024-12-19 10:19:11 +01:00
Quentin Fuxa
cc92e97e17 Update README.md 2024-12-19 08:38:30 +01:00
Quentin Fuxa
8c6c0104a3 Update README.md 2024-12-19 00:04:23 +01:00
Quentin Fuxa
494b6e3ca9 Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming 2024-12-16 13:40:51 +01:00
Quentin Fuxa
d045137ba8 smaller screenshot 2024-12-16 13:39:17 +01:00
Quentin Fuxa
54a37fbcb6 Merge pull request #1 from QuentinFuxa/fast-api-web-interface-with-buffer
add fastapi server with live webm to pcm conversion and web page show…
2024-12-16 13:32:33 +01:00
Quentin Fuxa
104f7bde03 add fastapi server with live webm to pcm conversion and web page showing both complete transcription and partial transcription 2024-12-16 13:24:27 +01:00
Dominik Macháček
e6648e4f46 fixed silero vad chunk size
issues #141 #121 #142 #136 etc.
2024-11-28 18:13:49 +01:00
Dominik Macháček
863242f107 Merge pull request #139 from dariopellegrino00/main
large-v3-turbo is now supported
2024-11-22 16:24:49 +01:00
Dario Pellegrino
d48895c343 large-v3-turbo compatible 2024-11-22 00:16:22 +01:00
Dominik Macháček
8cfd8d85a3 Merge branch 'main' of github.com:promet99/whisper_streaming into promet99-main 2024-11-15 13:53:57 +01:00
Dominik Macháček
e1b0e146a5 lru_cache didn't work with Python 3.6.9, openai api needs py version 2024-11-15 13:53:01 +01:00
promet99
e3dc524783 fix: update openapi code to match updated return type 2024-11-10 20:02:25 +09:00
Dominik Macháček
2de090023c fixed VADIterator 2024-10-16 11:14:37 +02:00
Dominik Macháček
e25ad4fcd7 fixed
issue #116
2024-10-04 17:39:20 +02:00
Dominik Macháček
63870987c0 FixedSileroVADIterator to support other than 512-sized chunks with v5
isssue #116
2024-10-04 17:14:55 +02:00