Quentin Fuxa
|
d9a4c8dcb2
|
Refactor transcription and diarization handling with token-by-token validation. Introduce segment buffers for ephemeral content and update API to return structured segment data. Enhance silence handling and improve web interface for text transcripts.
|
2025-11-30 16:39:27 +01:00 |
|
Quentin Fuxa
|
4fb735a784
|
new token treatment only iar
|
2025-11-30 15:16:36 +01:00 |
|
Quentin Fuxa
|
d2f998cb7e
|
val
|
2025-11-30 14:37:37 +01:00 |
|
Quentin Fuxa
|
82cd24bb75
|
LoRa path v0 - functional
|
2025-11-29 17:21:10 +01:00 |
|
Quentin Fuxa
|
d45c397c6a
|
simulstreaming: limit n tokens to prevent hallucinations
|
2025-11-28 21:41:19 +01:00 |
|
Quentin Fuxa
|
1d88ba9d69
|
Fixes #294. improve model path backend detection and file extraction
|
2025-11-27 23:14:00 +01:00 |
|
Quentin Fuxa
|
c0965c6c31
|
Lines to Segments. Merging dataclasses
|
2025-11-27 21:54:58 +01:00 |
|
Quentin Fuxa
|
7faa21f95f
|
alignatt: enable model sharing by removing hooks and centralizing session state. Solves #282
Co-authored-by: Emmanuel Schmidbauer <eschmidbauer@gmail.com>
|
2025-11-25 23:07:42 +01:00 |
|
Quentin Fuxa
|
4e9f951551
|
correct silences handling when language not auto
|
2025-11-20 11:20:00 +01:00 |
|
Quentin Fuxa
|
870141298c
|
isort
|
2025-11-23 11:20:00 +01:00 |
|
Quentin Fuxa
|
a175d1a327
|
fixes silence detected but never reported by silero
|
2025-11-23 11:20:00 +01:00 |
|
Quentin Fuxa
|
6206fff118
|
0.2.15
|
2025-11-21 23:52:00 +01:00 |
|
Quentin Fuxa
|
b5067249c0
|
stt/diar/nllw alignment: internal rework 5
|
2025-11-20 23:52:00 +01:00 |
|
Quentin Fuxa
|
f4f9831d39
|
stt/diar/nllw alignment: internal rework 5
|
2025-11-20 23:52:00 +01:00 |
|
Quentin Fuxa
|
254faaf64c
|
stt/diar/nllw alignment: internal rework 5
|
2025-11-20 23:52:00 +01:00 |
|
Quentin Fuxa
|
8e7aea4fcf
|
internal rework 4
|
2025-11-20 23:45:20 +01:00 |
|
Quentin Fuxa
|
270faf2069
|
internal rework 3
|
2025-11-20 22:28:30 +01:00 |
|
Quentin Fuxa
|
b7c1cc77cc
|
internal rework 2
|
2025-11-20 22:06:38 +01:00 |
|
Quentin Fuxa
|
9a45ec221c
|
internal rework 1
|
2025-11-20 12:58:38 +01:00 |
|
Quentin Fuxa
|
b7d20a0ff0
|
segment attribution in result formatter
|
2025-11-19 21:10:28 +01:00 |
|
Quentin Fuxa
|
c1bb9c2bde
|
reduce flickering remaining_time_transcription
|
2025-11-19 19:09:37 +01:00 |
|
Quentin Fuxa
|
11e9def0b2
|
diarization corrections
|
2025-11-19 19:06:03 +01:00 |
|
Quentin Fuxa
|
3104f40f6e
|
fixes #279 #278
|
2025-11-19 18:17:50 +01:00 |
|
Quentin Fuxa
|
e9b4ceeee5
|
Add audio partial silence in chunks handling. bump to 0.2.14.post3
|
2025-11-17 22:52:00 +01:00 |
|
Quentin Fuxa
|
437641fb43
|
reduce min-chunk-size to 0.1, set default model to base
|
2027-04-25 23:52:00 +02:00 |
|
Quentin Fuxa
|
bfd60b3921
|
Add audio partial silence in chunks handling. bump to 0.2.14.post2
|
2025-11-17 22:52:00 +01:00 |
|
Quentin Fuxa
|
1e67bf97f0
|
improve buffering when use of heavy models
|
2027-04-25 23:52:00 +02:00 |
|
Quentin Fuxa
|
bbd4fd6cff
|
Merge branch 'improve_EOS_handling'
|
2025-11-16 22:30:31 +01:00 |
|
Quentin Fuxa
|
28985962a0
|
Silence handling: finish transcription even if not validated at the BEGINNING of the silence
|
2025-11-16 22:29:08 +01:00 |
|
Quentin Fuxa
|
a38c103fcd
|
simulstreaming coreml encoder compatibility
|
2025-11-16 21:24:14 +01:00 |
|
Quentin Fuxa
|
4d2ffb24f8
|
coreml conversion
|
2025-11-16 19:11:43 +01:00 |
|
Quentin Fuxa
|
1bbbb7903c
|
lora loader in shared whisper core
|
2025-11-16 18:44:35 +01:00 |
|
Quentin Fuxa
|
80b77998f9
|
Refactor backend handling
|
2025-11-15 19:51:41 +01:00 |
|
Quentin Fuxa
|
d310f7e25f
|
hf compatibility
|
2025-11-15 18:34:19 +01:00 |
|
Quentin Fuxa
|
8d9be88fe6
|
translation buffer is now displayed in frontend
|
2025-11-10 15:22:26 +01:00 |
|
Quentin Fuxa
|
16461052ed
|
task to direct-english-translation
|
2025-11-10 13:20:26 +01:00 |
|
Quentin Fuxa
|
5491dbd824
|
last_validated_token handled in state
|
2025-11-10 13:18:52 +01:00 |
|
Quentin Fuxa
|
13401ffe24
|
whisper core at root of wlk
|
2025-11-10 12:17:18 +01:00 |
|
Quentin Fuxa
|
7108d2ddc5
|
fixes https://github.com/QuentinFuxa/WhisperLiveKit/issues/269
|
2025-11-09 20:08:18 +01:00 |
|
Quentin Fuxa
|
a732e0903e
|
Add a script to detect alignement heads, usefull for distilled whisper
|
2025-11-09 18:12:09 +01:00 |
|
Quentin Fuxa
|
0491681be4
|
Distilled model compatibility with HF config.json to ModelDimensions
|
2025-11-08 20:20:05 +01:00 |
|
Quentin Fuxa
|
ffe5284764
|
_processing_tasks_done checks task completion
|
2025-11-05 23:34:00 +01:00 |
|
Quentin Fuxa
|
06b31f51eb
|
exception when translation and no nllw
|
2025-10-30 23:30:19 +01:00 |
|
Quentin Fuxa
|
ece02db6a3
|
Use optional new separate NLLW package for translation
|
2025-10-30 19:36:28 +01:00 |
|
Quentin Fuxa
|
939a7ebf8b
|
Translation Local Agreement + Cache optimization v0. Not connected yet
|
2025-10-28 00:16:52 +01:00 |
|
Quentin Fuxa
|
61edb70fff
|
audioProcessor state variables are now uniquely in State dataclass
|
2025-10-26 18:54:47 +01:00 |
|
Quentin Fuxa
|
4e455b8aab
|
translation now separates validated from output buffer tokens
|
2025-10-26 18:51:09 +01:00 |
|
Quentin Fuxa
|
9434390ad3
|
simplify task stopping condition
|
2025-10-26 17:26:43 +01:00 |
|
Quentin Fuxa
|
65250db92c
|
tensor to list at the stream end
|
2025-10-26 16:40:12 +01:00 |
|
Quentin Fuxa
|
416dce7975
|
fixes #261
Co-authored-by: yosagi <11404771+yosagi@users.noreply.github.com>"
|
2025-10-25 14:20:08 +02:00 |
|