MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs
arxiv(2024)
摘要
Despite advancements in on-topic dialogue systems, effectively managing topic
shifts within dialogues remains a persistent challenge, largely attributed to
the limited availability of training datasets. To address this issue, we
propose Multi-Passage to Dialogue (MP2D), a data generation framework that
automatically creates conversational question-answering datasets with natural
topic transitions. By leveraging the relationships between entities in a
knowledge graph, MP2D maps the flow of topics within a dialogue, effectively
mirroring the dynamics of human conversation. It retrieves relevant passages
corresponding to the topics and transforms them into dialogues through the
passage-to-dialogue method. Through quantitative and qualitative experiments,
we demonstrate MP2D's efficacy in generating dialogue with natural topic
shifts. Furthermore, this study introduces a novel benchmark for topic shift
dialogues, TS-WikiDialog. Utilizing the dataset, we demonstrate that even Large
Language Models (LLMs) struggle to handle topic shifts in dialogue effectively,
and we showcase the performance improvements of models trained on datasets
generated by MP2D across diverse topic shift dialogue tasks.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要