Dong Yu, ofMicrosoft Research Redmond, andFrank Seide, ofMicrosoft Research Asia, have extended the state of the art in real-time, speaker-independent, automatic speech recognition to produce the Deep-Neural-Network Speech Recognition based MAVIS. Though, this concept has been in use for speech recognition for more than 20 years, computer scientists gained access to enough computing power to make it possible to build models only a few years ago. The Deep-Neural-Network Speech Recognition systems are characterized by improved accuracy and faster processor timing. In this process, an audio file is recognized multiple times, and after each time, the recognizer tunes itself a little more closely to the specific speaker or speakers in the file, so that the next time, it gets better.
Categorization (need >= 0.5 to match):