SDST: Successive Decoding for Speech-to-text Translation
arxiv(2020)
摘要
End-to-end speech-to-text translation (ST), which directly translates the source language speech to the target language text, has attracted intensive attention recently. However, the combination of speech recognition and machine translation in a single model poses a heavy burden on the direct cross-modal cross-lingual mapping. To reduce the learning difficulty, we propose SDST, an integral framework with \textbf{S}uccessive \textbf{D}ecoding for end-to-end \textbf{S}peech-to-text \textbf{T}ranslation task. This method is verified in two mainstream datasets. Experiments show that our proposed \method improves the previous state-of-the-art methods by big margins.
更多查看译文
关键词
successive decoding,sdst,translation,speech-to-text
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络