Transcribing Radio News

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4(1996)

引用 25|浏览9
暂无评分
摘要
We have recently extended the capabilities of BBN's large vocabulary discrete-utterance speech recognition system (BYBLOS) to operate on raw audio recordings of radio news programming. The recording are given to the system as large monolithic waveforms without any additional side-information. Our goal is to transcribe ail speech in the input with the highest accuracy possible. The problem is very challenging because radio news programming has frequent changes in speaker, speaking style, dialect, accent, topic, channel, and environmental conditions. Furthermore, the monolithic input presents new problems for recognition algorithms and language models since all useful boundaries (such as speaker turns or sentence ends) are unknown.
更多
查看译文
关键词
hidden markov models,channel,speech recognition,language model,system testing,accuracy,background noise,training data,telephony,natural languages,bandwidth,audio recording,radio broadcasting,language models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要