Probabilistic and Systematic Coverage of Consecutive Test-Method Pairs for Detecting Order-Dependent Flaky Tests

Wei, Anjiang; Yi, Pu; Xie, Tao; Marinov, Darko; Lam, Wing

doi:10.1007/978-3-030-72016-2_15

Anjiang Wei¹⁰,
Pu Yi¹⁰,
Tao Xie¹⁰,
Darko Marinov¹¹ &
…
Wing Lam¹¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12651))

Included in the following conference series:

International Conference on Tools and Algorithms for the Construction and Analysis of Systems

2836 Accesses
6 Citations

Abstract

Software developers frequently check their code changes by running a set of tests against their code. Tests that can nondeterministically pass or fail when run on the same code version are called flaky tests. These tests are a major problem because they can mislead developers to debug their recent code changes when the failures are unrelated to these changes. One prominent category of flaky tests is order-dependent (OD) tests, which can deterministically pass or fail depending on the order in which the set of tests are run. By detecting OD tests in advance, developers can fix these tests before they change their code. Due to the high cost required to explore all possible orders (n! permutations for n tests), prior work has developed tools that randomize orders to detect OD tests. Experiments have shown that randomization can detect many OD tests, and that most OD tests depend on just one other test to fail. However, there was no analysis of the probability that randomized orders detect OD tests. In this paper, we present the first such analysis and also present a simple change for sampling random test orders to increase the probability. We finally present a novel algorithm to systematically explore all consecutive pairs of tests, guaranteeing to detect all OD tests that depend on one other test, while running substantially fewer orders and tests than simply running all test pairs.

Tao Xie is with the Key Laboratory of High Confidence Software Technologies (Peking University), Ministry of Education, China.

Download to read the full chapter text

Chapter PDF

Initial Results on Counting Test Orders for Order-Dependent Flaky Tests Using Alloy

A revisit of three studies related to random testing

Article 23 March 2015

Excluding code from test coverage: practices, motivations, and impact

Article 07 December 2022

Keywords

References

Apache Hadoop (2020), https://github.com/apache/hadoop
Bell, J., Kaiser, G., Melski, E., Dattatreya, M.: Efficient dependency detection for safe Java test acceleration. In: ESEC/FSE (2015)
Google Scholar
Coefficient of variation (2020), https://en.wikipedia.org/wiki/Coefficient_of_variation
Cucumber (2020), https://cucumber.io/docs/cucumber
Facebook testing and verification request for proposals (2019), https://research.fb.com/programs/research-awards/proposals/facebook-testing-and-verification-request-for-proposals-2019
Gambi, A., Bell, J., Zeller, A.: Practical test dependency detection. In: ICST (2018)
Google Scholar
Golomb, S.W., Taylor, H.: Tuscan squares – A new family of combinatorialdesigns. Ars Combinatoria (1985)
Google Scholar
Google: Avoiding flakey tests (2008), http://googletesting.blogspot.com/2008/04/tott-avoiding-flakey-tests.html
Gyori, A., Shi, A., Hariri, F., Marinov, D.: Reliable testing: Detecting state-polluting tests to prevent test dependency. In: ISSTA (2015)
Google Scholar
Harman, M., O’Hearn, P.: From start-ups to scale-ups: Opportunities and open problems for static and dynamic program analysis. In: SCAM (2018)
Google Scholar
Herzig, K., Greiler, M., Czerwonka, J., Murphy, B.: The art of testing less without sacrificing quality. In: ICSE (2015)
Google Scholar
Herzig, K., Nagappan, N.: Empirically detecting false test alarms using association rules. In: ICSE (2015)
Google Scholar
Houston, R.: Tackling the minimal superpermutation problem (2014), arXiv
Google Scholar
Huo, C., Clause, J.: Improving oracle quality by detecting brittle assertions and unused inputs in tests. In: FSE (2014)
Google Scholar
iDFlakies: Flaky test dataset (2020), https://sites.google.com/view/flakytestdataset
Jiang, H., Li, X., Yang, Z., Xuan, J.: What causes my test alarm? Automatic cause analysis for test alarms in system and integration testing. In: ICSE (2017)
Google Scholar
JUnit (2020), https://junit.org
Kowalczyk, E., Nair, K., Gao, Z., Silberstein, L., Long, T., Memon, A.: Modeling and ranking flaky tests at Apple. In: ICSE SEIP (2020)
Google Scholar
Lam, W.: Illinois Dataset of Flaky Tests (IDoFT) (2020), http://mir.cs.illinois.edu/flakytests
Lam, W., Godefroid, P., Nath, S., Santhiar, A., Thummalapenta, S.: Root causing flaky tests in a large-scale industrial setting. In: ISSTA (2019)
Google Scholar
Lam, W., Muşlu, K., Sajnani, H., Thummalapenta, S.: A study on the lifecycle of flaky tests. In: ICSE (2020)
Google Scholar
Lam, W., Oei, R., Shi, A., Marinov, D., Xie, T.: iDFlakies: A framework for detecting and partially classifying flaky tests. In: ICST (2019)
Google Scholar
Lam, W., Shi, A., Oei, R., Zhang, S., Ernst, M.D., Xie, T.: Dependent-test-aware regression testing techniques. In: ISSTA (2020)
Google Scholar
Lam, W., Winter, S., Astorga, A., Stodden, V., Marinov, D.: Understanding reproducibility and characteristics of flaky tests through test reruns in Java projects. In: ISSRE (2020)
Google Scholar
Lam, W., Winter, S., Wei, A., Xie, T., Marinov, D., Bell, J.: A large-scale longitudinal study of flaky tests. In: OOPSLA (2020)
Google Scholar
Lucas, E.: Récréations mathématiques (1894)
Google Scholar
Luo, Q., Hariri, F., Eloussi, L., Marinov, D.: An empirical analysis of flaky tests. In: FSE (2014)
Google Scholar
Maven (2020), https://maven.apache.org
Maven Surefire plugin (2020), https://maven.apache.org/surefire/maven-surefire-plugin
Memon, A., Gao, Z., Nguyen, B., Dhanda, S., Nickell, E., Siemborski, R., Micco, J.: Taming Google-scale continuous testing. In: ICSE SEIP (2017)
Google Scholar
Micco, J.: The state of continuous integration testing at Google. In: ICST (2017)
Google Scholar
Muşlu, K., Soran, B., Wuttke, J.: Finding bugs by isolating unit tests. In: ESEC/FSE (2011)
Google Scholar
Nie, C., Leung, H.: A survey of combinatorial testing. ACM Comput. Surv. (2011)
Google Scholar
Ollis, M.: Sequenceable groups and related topics. Electronic Journal of Combinatorics (2013)
Google Scholar
pytest (2020), https://docs.pytest.org
RSpec (2020), https://rspec.info
Shi, A., Lam, W., Oei, R., Xie, T., Marinov, D.: iFixFlakies: A framework for automatically fixing order-dependent flaky tests. In: ESEC/FSE (2019)
Google Scholar
Spock (2019), http://docs.spockframework.org
StackExchange – Covering pairs with permutations (2020),https://math.stackexchange.com/questions/1769877/covering-pairs-with-permutations
Test Verification (2019), https://developer.mozilla.org/en-US/docs/Mozilla/QA/Test_Verification
TestNG (2019), https://testng.org/doc/documentation-main.html
Tillson, T.W.: A Hamiltonian decomposition of \(K_{2m}^{*}\), \(2m\ge 8\). Journal of Combinatorial Theory, Series B (1980)
Google Scholar
TotT: Avoiding flakey tests (2019), http://goo.gl/vHE47r
TuscanSquare (2020), https://github.com/Anjiang-Wei/TuscanSquare
Yoo, S., Harman, M.: Regression testing minimization, selection and prioritization: A survey. Software Testing, Verification & Reliability (2012)
Book Google Scholar
Zeller, A., Hildebrandt, R.: Simplifying and isolating failure-inducing input. TSE (2002)
Google Scholar
Zhang, S., Jalali, D., Wuttke, J., Muşlu, K., Lam, W., Ernst, M.D., Notkin, D.: Empirically revisiting the test independence assumption. In: ISSTA (2014)
Google Scholar
Ziftci, C., Reardon, J.: Who broke the build?: Automatically identifying changes that induce test failures in continuous integration at Google scale. In: ICSE (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Peking University, Beijing, China
Anjiang Wei, Pu Yi & Tao Xie
University of Illinois at Urbana-Champaign, Urbana, IL, USA
Darko Marinov & Wing Lam

Authors

Anjiang Wei
View author publications
You can also search for this author in PubMed Google Scholar
Pu Yi
View author publications
You can also search for this author in PubMed Google Scholar
Tao Xie
View author publications
You can also search for this author in PubMed Google Scholar
Darko Marinov
View author publications
You can also search for this author in PubMed Google Scholar
Wing Lam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tao Xie .

Editor information

Editors and Affiliations

Eindhoven University of Technology, Eindhoven, The Netherlands
Jan Friso Groote
Aalborg University, Aalborg East, Denmark
Kim Guldstrand Larsen

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wei, A., Yi, P., Xie, T., Marinov, D., Lam, W. (2021). Probabilistic and Systematic Coverage of Consecutive Test-Method Pairs for Detecting Order-Dependent Flaky Tests. In: Groote, J.F., Larsen, K.G. (eds) Tools and Algorithms for the Construction and Analysis of Systems. TACAS 2021. Lecture Notes in Computer Science(), vol 12651. Springer, Cham. https://doi.org/10.1007/978-3-030-72016-2_15

Download citation

DOI: https://doi.org/10.1007/978-3-030-72016-2_15
Published: 20 March 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-72015-5
Online ISBN: 978-3-030-72016-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The European Joint Conferences on Theory and Practice of Software. (opens in a new tab)

Probabilistic and Systematic Coverage of Consecutive Test-Method Pairs for Detecting Order-Dependent Flaky Tests

Abstract

Chapter PDF

Similar content being viewed by others

Initial Results on Counting Test Orders for Order-Dependent Flaky Tests Using Alloy

A revisit of three studies related to random testing

Excluding code from test coverage: practices, motivations, and impact

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Probabilistic and Systematic Coverage of Consecutive Test-Method Pairs for Detecting Order-Dependent Flaky Tests

Abstract

Chapter PDF

Similar content being viewed by others

Initial Results on Counting Test Orders for Order-Dependent Flaky Tests Using Alloy

A revisit of three studies related to random testing

Excluding code from test coverage: practices, motivations, and impact

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation