RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
SLT, pp. 873-880, 2021.
In recent years, all-neural end-to-end approaches have obtained state-of-the-art results on several challenging automatic speech recognition (ASR) tasks. However, most existing works focus on building ASR models where train and test data are drawn from the same domain. This results in poor generalization characteristics on mismatched-do...More
PPT (Upload PPT)