FlakiMe: Laboratory-Controlled Test Flakiness Impact Assessment

Maxime Cordy,Renaud Rwemalika,Adriano Franci,Mike Papadakis,Mark Harman

2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)（2022）

引用 9|浏览10

暂无评分

摘要

Much research on software testing makes an implicit assumption that test failures are deterministic such that they always witness the presence of the same defects. However, this assumption is not always true because some test failures are due to so-called flaky tests, i.e., tests with non-deterministic outcomes. To help testing researchers better investigate flakiness, we introduce a test flakiness assessment and experimentation platform, called FlakiMe. FlakiMe supports the seeding of a (controllable) degree of flakiness into the behaviour of a given test suite. Thereby, FlakiMe equips researchers with ways to investigate the impact of test flakiness on their techniques under laboratory-controlled conditions. To demonstrate the application of FlakiMe, we use it to assess the impact of flakiness on mutation testing and program repair (the PRAPR and ARJA methods). These results indicate that a 10% flakiness is sufficient to affect the mutation score, but the effect size is modest (2% – 5%), while it reduces the number of patches produced for repair by 20% up to 100% of repair problems; a devastating impact on this application of testing. Our experiments with FlakiMe demonstrate that flakiness affects different testing applications in very different ways, thereby motivating the need for a laboratory-controllable flakiness impact assessment platform and approach such as FlakiMe.

查看译文

关键词

software testing,implicit assumption,test failures,flaky tests,nondeterministic outcomes,test flakiness assessment,experimentation platform,test suite,FlakiMe,mutation testing,program repair,laboratory-controlled test flakiness impact assessment,ARJA methods,PRAPR

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要