Show Me A Story: Towards Coherent Neural Story Illustration

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)(2018)

引用 34|浏览66
暂无评分
摘要
We propose an end-to-end network for visual illustration of a sequence of sentences forming a story. At the core of our model is the ability to model the inter-related nature of the sentences within a story, as well as the ability to learn coherence to support reference resolution. The framework takes the form of an encoder-decoder architecture, where sentences are encoded using a hierarchical two-level sentence-story GRU, combined with an encoding of coherence, and sequentially decoded using a predicted feature representation into a consistent illustrative image sequence. We optimize all parameters of our network in an end-to-end fashion with respect to order embedding loss, encoding entailment between images and sentences. Experiments on the VISTstorytelling dataset [9] highlight the importance of our algorithmic choices and efficacy of our overall model.
更多
查看译文
关键词
end-to-end network,visual illustration,inter-related nature,reference resolution,encoder-decoder architecture,two-level sentence-story GRU,end-to-end fashion,sentences sequence,neural story illustration,image sequence,coherence encoding
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要