Commit Graph

47 Commits

Author SHA1 Message Date
Alex Young
97a4ebdf15 Construct an explicit logger rather than using the root logger 2024-04-17 21:58:24 +01:00
Alex Young
2ba48bcbf4 Merge branch 'main' into ayo-logging-fixes 2024-04-17 20:47:55 +01:00
Dominik Macháček
bb93952fd2 Merge branch 'main' into online-from-factory 2024-04-17 15:07:00 +02:00
Dominik Macháček
e0f5d42b13 better documentation, help message and logging prints 2024-04-17 14:51:49 +02:00
Alex Young
2afc97db48 Set the log level inside faster-whisper again (lost in merge) 2024-04-14 20:16:28 +01:00
Alex Young
a7cb7a5469 Merge branch 'main' into ayo-logging-fixes 2024-04-14 20:09:56 +01:00
Alex Young
380c30d48d Further tidying of print output, so by default there's little on the console 2024-04-14 19:14:56 +01:00
Tijs Zwinkels
bccbb15177 Move creation of OnlineASRProcessor inside the factory method
Preventing more code duplication between whisper_online.py and whisper_online_server.py
2024-03-20 16:29:01 +01:00
Tijs Zwinkels
006de3e7b0 Fix imports
Now, the ASR implementations do their own imports. No need to import in the factory
2024-03-20 16:02:24 +01:00
Tijs Zwinkels
8896389ea3 Fix crash when using openai-api with whisper_online_server
+ refactored creation of the ASR into a factory method
2024-03-20 15:29:10 +01:00
koiking213
4405c451ce specify dtype for librosa.load, instead of cast 2024-02-20 23:29:25 +09:00
koiking213
24926c98e0 specify audio dtype 2024-02-20 22:46:04 +09:00
Dominik Macháček
db8b7d2883 removed unused variable 2024-02-20 14:37:18 +01:00
Aleksei Scripnic
80eb0baf5d Removed duplicate variable self.last_chunked_at
I tried to find the difference between self.last_chunked_at and self.buffer_time_offset, and it took me a while to understand that they are exactly the same. I think it's better to get rid of one of the duplicates to make the code more readable.
2024-02-20 14:37:18 +01:00
Dominik Macháček
949304ab05 Merge branch 'opeanai-api2' into opeanai-api 2024-02-19 13:51:26 +01:00
Tijs Zwinkels
9fcd403439 Use automatic language detection by default (instead of English) 2024-02-15 22:24:43 +01:00
Tijs Zwinkels
922ad18ebc Make OpenAI backend work with language autodetect 2024-02-14 17:29:45 +01:00
Tijs Zwinkels
f0a24cd5e1 Make --vad work with --backend openai-api 2024-02-14 17:01:29 +01:00
Tijs Zwinkels
3696fef2b1 Use OpenAI api word-level timestamps 2024-02-14 17:01:29 +01:00
Tijs Zwinkels
531418ad07 Interpolate word timestamps based on word character length 2024-02-14 17:01:29 +01:00
Dominik Macháček
2270014219 fixes 2024-02-14 17:01:29 +01:00
Dominik Macháček
f8b2ae07b8 missing features in openai-api, PR #52 2024-02-14 17:01:29 +01:00
Tijs Zwinkels
f412812082 OpenAI Whisper API backend 2024-02-14 17:01:29 +01:00
Dominik Macháček
cd221a3198 auto language detection #56 2024-02-06 14:29:30 +01:00
Dominik Macháček
d65fd8a649 fixes 2024-01-25 17:53:07 +01:00
Dominik Macháček
50f1b94856 missing features in openai-api, PR #52 2024-01-25 16:50:02 +01:00
Tijs Zwinkels
c30969fe27 OpenAI Whisper API backend 2024-01-25 10:21:33 +01:00
Dominik Macháček
1f2352fa1d README typo and one more simulation option is not shared 2024-01-03 12:52:44 +01:00
Aleksei Scripnic
234ac8f5e8 Samples should be an integer, not seconds
I believe it's just a typo
2024-01-02 14:40:22 +00:00
Dominik Macháček
aa51e39de4 buffer trimming option, sent. segmenter not required anymore
- both for whisper_online + server
- removed argparse code repetition
- README updated
2024-01-02 14:56:30 +01:00
Dominik Macháček
ef08538697 buffer trimming options + most recommendable default
evaluated on ESIC dev2, 27 docs
2024-01-02 12:06:29 +01:00
Dominik Macháček
99aef35958 Merge pull request #36 from luweigen/bug-chunk_completed_sentence
fix bug of completed sentence chunking. tested on faster-whisper in e…
2023-12-19 13:39:37 +01:00
J.P Lee
2b98af7b19 Fix: Omitting the last chunk problem in comp_unaware mode 2023-12-07 17:00:38 +09:00
Wei Lu
a60c64c831 fix bug of completed sentence chunking. tested on faster-whisper in en language 2023-11-28 18:51:36 +02:00
Dominik Macháček
8f32dea5ca logfile reviewed, whisper_timestamped loading module and vad
PR #10, issues #9, #30
2023-11-28 12:16:20 +01:00
Luca
18c1434f77 backend import in child load_model method and expose logfile arg 2023-11-03 11:33:03 +01:00
Luca
f97a253273 Merge branch 'ufal:main' into main 2023-11-03 11:03:54 +01:00
Dominik Macháček
4a51e13199 segmenters for all Whisper languages 2023-09-27 23:29:50 +02:00
Luca
6e6b619257 add option to save log to file 2023-09-06 15:19:12 +02:00
Luca
c0dd2e2db9 import backend from __init__ 2023-09-06 12:39:26 +02:00
Dominik Macháček
2625be10b7 Ukrainian tokenizer support 2023-06-05 17:29:26 +02:00
Dominik Macháček
33369a9555 import WhisperModel 2023-05-25 14:28:18 +02:00
Dominik Macháček
a1ba5e6c3a updates:
- fix errors
- module documented
2023-05-18 17:10:42 +02:00
Dominik Macháček
88dc796b8a model_dir, vad and other updates 2023-05-16 17:39:44 +02:00
Dominik Macháček
b1878ce772 offline option 2023-04-20 16:53:45 +02:00
Dominik Macháček
8116b21b4e faster-whisper support 2023-04-19 15:31:35 +02:00
Dominik Macháček
37fc0f38ca whisper online 2023-04-05 11:55:24 +02:00