diff --git a/README.md b/README.md index b9b2762..1ad1fee 100644 --- a/README.md +++ b/README.md @@ -3,7 +3,7 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_streaming) and lets you transcribe audio directly from your browser. Simply launch the local server and grant microphone access. Everything runs locally on your machine ✨

- Demo Screenshot + Demo Screenshot

### Differences from [Whisper Streaming](https://github.com/ufal/whisper_streaming) @@ -23,6 +23,11 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_str #### 🎙️ **Advanced Features** - **Real-Time Diarization** – Identify different speakers in real time using [Diart](https://github.com/juanmc2005/diart). +#### 🚀 **Coming Soon** + +- **Faster Word Validation** – Accelerate real-time transcription by validating high-confidence words immediately upon first appearance for whisper backends that returns words & segment probabilities +- **Enhanced Diarization Performance** – Optimize speaker identification by implementing longer steps for Diart processing and leveraging language-specific segmentation patterns to improve speaker boundary detection + ## Installation