Retrieval on source code: a neural code search.

Saksham Sachdev,Hongyu Li, Sifei Luan,Seohyun Kim,Koushik Sen,Satish Chandra

PLDI '18: ACM SIGPLAN Conference on Programming Language Design and Implementation Philadelphia PA USA June, 2018(2018)

引用 186|浏览61
暂无评分
摘要
Searching over large code corpora can be a powerful productivity tool for both beginner and experienced developers because it helps them quickly find examples of code related to their intent. Code search becomes even more attractive if developers could express their intent in natural language, similar to the interaction that Stack Overflow supports. In this paper, we investigate the use of natural language processing and information retrieval techniques to carry out natural language search directly over source code, i.e. without having a curated Q&A forum such as Stack Overflow at hand. Our experiments using a benchmark suite derived from Stack Overflow and GitHub repositories show promising results. We find that while a basic word–embedding based search procedure works acceptably, better results can be obtained by adding a layer of supervision, as well as by a customized ranking strategy.
更多
查看译文
关键词
code search, word-embedding, TF-IDF
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要