Exploring End-to-End Techniques for Low-Resource Speech Recognition.

Lecture Notes in Artificial Intelligence(2018)

引用 5|浏览35
暂无评分
摘要
In this work we present simple grapheme-based system for low-resource speech recognition using Babel data for Turkish spontaneous speech (80 h). We have investigated different neural network architectures performance, including fully-convolutional, recurrent and ResNet with GRU. Different features and normalization techniques are compared as well. We also proposed CTC-loss modification using segmentation during training, which leads to improvement while decoding with small beam size. Our best model achieved word error rate of 45.8%, which is the best reported result for end-to-end systems using in-domain data for this task, according to our knowledge.
更多
查看译文
关键词
Low-resource speech recognition,End-to-end speech recognition,Connectionist Temporal Classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要