Data-Driven Evidence-Based Syntactic Sugar Design
CoRR(2024)
摘要
Programming languages are essential tools for developers, and their evolution
plays a crucial role in supporting the activities of developers. One instance
of programming language evolution is the introduction of syntactic sugars,
which are additional syntax elements that provide alternative, more readable
code constructs. However, the process of designing and evolving a programming
language has traditionally been guided by anecdotal experiences and intuition.
Recent advances in tools and methodologies for mining open-source repositories
have enabled developers to make data-driven software engineering decisions. In
light of this, this paper proposes an approach for motivating data-driven
programming evolution by applying frequent subgraph mining techniques to a
large dataset of 166,827,154 open-source Java methods. The dataset is mined by
generalizing Java control-flow graphs to capture broad programming language
usages and instances of duplication. Frequent subgraphs are then extracted to
identify potentially impactful opportunities for new syntactic sugars. Our
diverse results demonstrate the benefits of the proposed technique by
identifying new syntactic sugars involving a variety of programming constructs
that could be implemented in Java, thus simplifying frequent code idioms. This
approach can potentially provide valuable insights for Java language designers,
and serve as a proof-of-concept for data-driven programming language design and
evolution.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要