Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows.

International Conference on Computational Linguistics(2022)

引用 0|浏览7
暂无评分
摘要
We present a new multimodal dataset called Visual Recipe Flow, which enables us to learn a cooking action result for each object in a recipe text. The dataset consists of object state changes and the workflow of the recipe text. The state change is represented as an image pair, while the workflow is represented as a recipe flow graph. We developed a web interface to reduce human annotation costs. The dataset allows us to try various applications, including multimodal information retrieval.
更多
查看译文
关键词
visual state changes,flows,dataset
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要