A Large-Scale Chinese Short-Text Conversation Dataset

Yida Wang
Yida Wang
Yinhe Zheng
Yinhe Zheng
Kaili Huang
Kaili Huang
Yong Jiang
Yong Jiang
Xiaoyan Zhu
Xiaoyan Zhu

international conference natural language processing, pp. 91-103, 2020.

Cited by: 4|Bibtex|Views25|Links

Abstract:

The advancements of neural dialogue generation models show promising results on modeling short-text conversations. However, training such models usually needs a large-scale high-quality dialogue corpus, which is hard to access. In this paper, we present a large-scale cleaned Chinese conversation dataset LCCC, which contains a base version...More

Code:

Data:

Your rating :
0

 

Tags
Comments