Machine Learning Support for EU Funding Project Categorization

COMPUTER JOURNAL(2019)

引用 1|浏览10
暂无评分
摘要
European Union reallocates its money to their member states using different kinds of funding. EU member states categorize EU funding projects using their own categorization system. While EU prepared an integrated European categorization system, many EU members do not use it in their reports. This hinders a straightforward fiscal analysis. The article aims at an automatic support for categorization of EU funding projects by Machine Learning. The experiments showed that Support Vector Machines (SVM) is the top performance Machine Learning algorithm for this task. We experimented with the SVM classifier and the results disclosed that by employing this approach we can classify EU funding projects using a lexical description better than a baseline (i.e. the classification to a major class). Further, we experienced that the approach using the natural language translator outperforms the approach using the word sense disambiguation. Finally, we investigated the influence of the length of project description on the performance of the classifier. The results showed that while there was a positive correlation between the length of project description and the classifier performance for project descriptions in English, in the case of project description in Non-English languages the classifier performed better for shorter project descriptions. In future, we plan to build a new online application which would use the classifier on the back-end and a user would get a category recommendation on the front-end using a visualization of the EU categorization system.
更多
查看译文
关键词
EU funding projects,open fiscal data,categorization,RDF code list,machine learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要