Cross-architecture Binary Function Similarity Detection based on Composite Feature Model

Xiaonan Li,Guimin Zhang,Qingbao Li,Ping Zhang,Zhifeng Chen, Jinjin Liu, Shudan Yue

KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS(2023)

引用 0|浏览7
暂无评分
摘要
Recent studies have shown that the neural network-based binary code similarity detection technology performs well in vulnerability mining, plagiarism detection, and malicious code analysis. However, existing cross-architecture methods still suffer from insufficient feature characterization and low discrimination accuracy. To address these issues, this paper proposes a cross-architecture binary function similarity detection method based on composite feature model (SDCFM). Firstly, the binary function is converted into vector representation according to the proposed composite feature model, which is composed of instruction statistical features, control flow graph structural features, and application program interface calling behavioral features. Then, the composite features are embedded by the proposed hierarchical embedding network based on a graph neural network. In which, the block-level features and the function-level features are processed separately and finally fused into the embedding. In addition, to make the trained model more accurate and stable, our method utilizes the embeddings of predecessor nodes to modify the node embedding in the iterative updating process of the graph neural network. To assess the effectiveness of composite feature model, we contrast SDCFM with the state of art method on benchmark datasets. The experimental results show that SDCFM has good performance both on the area under the curve in the binary function similarity detection task and the vulnerable candidate function ranking in vulnerability search task.
更多
查看译文
关键词
Binary Similarity,Composite Feature Model,Cross-Architecture,Graph Embedding Network,Vulnerability Detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要