Comparing Linguistic Features for Modeling Learning in Computer Tutoring

AIED(2007)

引用 28|浏览22
暂无评分
摘要
We compare the relative utility of different automatically computable linguistic feature sets for modeling student learning in computer dialogue tutoring. We use the PARADISE framework (multiple linear regression) to build a learning model from each of 6 linguistic feature sets: 1) surface features, 2) semantic features, 3) pragmatic features, 4) discourse structure features, 5) local dialogue context features, and 6) all feature sets combined. We hypothesize that although more sophisticated linguistic features are harder to obtain, they will yield stronger learning models. We train and test our models on 3 different train/test dataset pairs derived from our 3 spoken dialogue tutoring system corpora. Our results show that more sophisticated linguistic features usually perform better than either a baseline model containing only pretest score or a model containing only surface features, and that semantic features generalize better than other linguistic feature sets.
更多
查看译文
关键词
computer tutoring,pragmatic feature,surface feature,modeling learning,discourse structure feature,linguistic feature set,local dialogue context feature,semantic feature,linguistic features,computable linguistic feature set,sophisticated linguistic,baseline model,sophisticated linguistic feature,multiple linear regression,linguistics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要