Accurate and Efficient Structural Characteristic Feature Extraction for Clone Detection

FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING, PROCEEDINGS(2009)

引用 52|浏览0
暂无评分
摘要
Structure-oriented approaches in clone detection have become popular in both code-based and model-based clone detection. However, existing methods for capturing structural information in software artifacts are either too computationally expensive to be efficient or too light-weight to be accurate in clone detection. In this paper, we present Exas, an accurate and efficient structural characteristic feature extraction approach that better approximates and captures the structure within the fragments of artifacts. Exas structural features are the sequences of labels and numbers built from nodes, edges, and paths of various lengths of a graph-based representation. A fragment is characterized by a structural characteristic vector of the occurrence counts of those features. We have applied Exas in building two clone detection tools for source code and models. Our analytic study and empirical evaluation on open-source software show that Exas and its algorithm for computing the characteristic vectors are highly accurate and efficient in clone detection.
更多
查看译文
关键词
characteristic vector,clone detection,exas structural feature,model-based clone detection,open-source software show,structural information,efficient structural characteristic feature,clone detection tool,structural characteristic vector,present exas,source code,feature extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要