

Update README.md
paper link
@d649c4b8c00eb7e4a4d09561e8d0583481a0167d
--- README.md
+++ README.md
... | ... | @@ -1,6 +1,15 @@ |
1 | 1 |
# whisper_streaming |
2 | 2 |
Whisper realtime streaming for long speech-to-text transcription and translation |
3 | 3 |
|
4 |
+**Turning Whisper into Real-Time Transcription System** |
|
5 |
+ |
|
6 |
+Demonstration paper, by Dominik Macháček, Raj Dabre, Ondřej Bojar, 2023 |
|
7 |
+ |
|
8 |
+Abstract: Whisper is one of the recent state-of-the-art multilingual speech recognition and translation models, however, it is not designed for real time transcription. In this paper, we build on top of Whisper and create Whisper-Streaming, an implementation of real-time speech transcription and translation of Whisper-like models. Whisper-Streaming uses local agreement policy with self-adaptive latency to enable streaming transcription. We show that Whisper-Streaming achieves high quality and 3.3 seconds latency on unsegmented long-form speech transcription test set, and we demonstrate its robustness and practical usability as a component in live transcription service at a multilingual conference. |
|
9 |
+ |
|
10 |
+ |
|
11 |
+Pre-print: https://arxiv.org/abs/2307.14743 |
|
12 |
+ |
|
4 | 13 |
Demo video: https://player.vimeo.com/video/840442741 |
5 | 14 |
|
6 | 15 |
## Installation |
... | ... | @@ -174,7 +183,7 @@ |
174 | 183 |
|
175 | 184 |
### Tests |
176 | 185 |
|
177 |
-Rigorous quality and latency tests are pending. |
|
186 |
+[See the results in paper.](https://arxiv.org/abs/2307.14743) |
|
178 | 187 |
|
179 | 188 |
## Contact |
180 | 189 |
|
Add a comment
Delete comment
Once you delete this comment, you won't be able to recover it. Are you sure you want to delete this comment?