Improving dialogue classification using a topic space representation and a Gaussian classifier based on the decision rule

ICASSP(2014)

引用 43|浏览62
暂无评分
摘要
In this paper, we study the impact of dialogue representations and classification methods in the task of theme identification of telephone conversation services having highly imperfect automatic transcriptions. Two dialogue representations are firstly compared: the classical Term Frequency-Inverse Document Frequency with Gini purity criteria (TF-IDF-Gini) method and the Latent Dirichlet Allocation (LDA) approach. We then propose to study an original classification method that takes advantage of the LDA topic space representation, highlighted as the best dialogue representation. To do so, two assumptions about topic representation led us to choose a Gaussian process (GP) based method. This approach is compared with a Support Vector Machine (SVM) classification method. Results show that the GP approach is a better solution to deal with the multiple theme complexity of a dialogue, no matter the conditions studied (manual or automatic transcriptions). We finally discuss the impact of the topic space reduction on the classification accuracy.
更多
查看译文
关键词
svm classification method,speech recognition,topic space representation,latent dirichlet allocation,theme classification,decision rule,term frequency-inverse document frequency-gini purity criteria,pattern classification,svm,dialogue representation,gaussian process based method,speech analytics,gaussian process,support vector machine,multiple theme complexity,gaussian processes,gaussian classifier,tf-idf-gini method,interactive systems,telephone conversation services,theme identification,dialogue classification,support vector machines,lda approach,automatic transcriptions,vectors,resource management,speech,accuracy,semantics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要