We set out to develop SwiftScribe to fix a pain point – the time-consuming process of manually transcribing word-by-word.
Now, through the integration of Baidu’s state of the art speech recognition technology and easy editing tools, SwiftScribe allows people to quickly and easily transcribe voice recordings, increasing productivity and streamlining workflow.
The core technology powering SwiftScribe is Baidu’s speech recognition engine, Deep Speech 2. Its neural network, which is trained on thousands of hours labeled audio data, learns to associate sounds with certain words and phrases.
In addition to advanced ASR technology, we designed intuitive shortcut keys and innovative human-computer interaction to solve the problem of discontinuity, one of the biggest obstacles users face when transcribing.
Baidu SVAIL has developed every component of SwiftScribe, from the speech recognition system to the user interface. One big advantage of this approach is as users transcribe and make edits, the system can learn and improve along the way.
It is the use of this sophisticated end-to-end approach that sets SwiftScribe apart from other competitors on the market.