A Data-Driven Methodology to Assess Text Complexity Based on Syntactic and Semantic Measurements.

IHIET(2019)

引用 2|浏览3
暂无评分
摘要
In this paper we propose a data driven methodology to assess text complexity of Spanish school texts. We model the problem as a classification task, that can be solved in a data-driven fashion using machine learning techniques. We show empirically that the discriminative power of the classifier depends on school grade level. Our proposal includes multiple predictors that capture different dimensions of text complexity such as coherence and cohesion. We provide an importance analysis of predictors across several complexity levels. Finally, we assess the model performance using accuracy and correlation measurements. The proposed model achieves accuracies of 0.7.
更多
查看译文
关键词
Text difficulty assessment,Natural language processing,Artificial intelligence,Machine learning,Educational systems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要