Which system differences matter?: using l1/l2 regularization to compare dialogue systems

SIGDIAL Conference(2011)

引用 24|浏览4
暂无评分
摘要
We investigate how to jointly explain the performance and behavioral differences of two spoken dialogue systems. The Join Evaluation and Differences Identification (JEDI), finds differences between systems relevant to performance by formulating the problem as a multi-task feature selection question. JEDI provides evidence on the usefulness of a recent method, l1/lp-regularized regression (Obozinski et al., 2007). We evaluate against manually annotated success criteria from real users interacting with five different spoken user interfaces that give bus schedule information.
更多
查看译文
关键词
Join Evaluation,real user,l2 regularization,behavioral difference,Differences Identification,multi-task feature selection question,bus schedule information,annotated success criterion,system differences matter,dialogue system,lp-regularized regression,recent method
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要