Fine-grained Coordinated Cross-lingual Text Stream Alignment for Endless Language Knowledge Acquisition.

EMNLP(2018)

引用 23|浏览211
暂无评分
摘要
This paper proposes to study fine-grained coordinated cross-lingual text stream alignment through a novel information network decipherment paradigm. We use Burst Information Networks as media to represent text streams and present a simple yet effective network decipherment algorithm with diverse clues to decipher the networks for accurate text stream alignment. Experiments on Chinese-English news streams show our approach not only outperforms previous approaches on bilingual lexicon extraction from coordinated text streams but also can harvest high-quality alignments from large amounts of streaming data for endless language knowledge mining, which makes it promising to be a new paradigm for automatic language knowledge acquisition.
更多
查看译文
关键词
Burst Information Networks, Text stream alignment, Network decipherment, language knowledge mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要