Deep neural network ensembles for detecting self-admitted technical debt

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS(2022)

引用 0|浏览0
暂无评分
摘要
Effectively identifying self-admitted technical debt (SATD) from project source code comments helps developers quickly find and repay these debts, thereby reducing its negative impact. Previous studies used techniques based on patterns, text mining, natural language processing, and neural networks to detect SATD. Compared with these above, Convolutional Neural Networks (CNN) have the strong feature extraction ability. Deep network ensembles are demonstrated great potential for the task of sentences classification. In order to boost the performance of CNN-based SATD detecting, we propose a deep neural network ensemble contribute to ensemble learning in a simple yet effective way. Specifically, CNN, CNN-LSTM (convolutional neural network and long short-term memory), and DPCNN (Deep Pyramid Convolutional Neural Networks) are used as individual classifiers to diversify the deep network ensembles. In order to improve the explainability, we introduce attention to measure the contribution of feature words to SATD classification. 62,285 source code comments from 10 projects were used in our experiments. The results show that our approach can effectively reduce misjudgment and detect more SATD, especially for cross-project, so as to greatly improve the detection accuracy.
更多
查看译文
关键词
Self-admitted technical debt, ensemble learning, convolutional neural network, long short-term memory
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要