Techniques for Statistical Debugging

Liblit, Ben

doi:10.1007/978-3-540-71878-9_4

Ben Liblit¹

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 4440))

321 Accesses

Abstract

Thus far we have focused on techniques for collecting sparsely sampled data from large numbers of users. However, this data is only as good as the sense we can make of it. This chapter presents several techniques for using sparsely sampled data to isolate the causes of bugs.

Sampled data is terribly incomplete. With sampling, 99% of everything that happens is not even seen. Thus, we do not give strict causes and effects as one might look for using a symbolic debugger. Instead we use statistical models to identify those behaviors that tend to be strongly predictive of failure over many runs. We refer to this body of techniques as statistical debugging. Statistical debugging reaps the benefits of the Bernoulli sampling transformation developed in Sect. 2.1.1. While the data is incomplete, it is incomplete in a fair, statistically unbiased way. Thus the observed data is a noisy but representative sample of the complete behavior, and failure trends identified in the former are equally applicable to the later.

Section 4.1 defines some basic notation and terminology that we will use throughout the remainder of this chapter. In Sect. 4.2 we describe an algorithm for isolating single, deterministic bugs using a process of elimination. Section 4.3 extends our scope to non-deterministic bugs using a general-purpose statistical regression model. This approach has certain limitations, which we discuss in greater depth in Sect. 4.3.4 and Sect. 4.4. Better understanding of these limitations leads us to develop an improved algorithm in Sect. 4.5 that combines statistical ranking techniques with an iterative bug elimination process to manage multiple unknown deterministic and non-deterministic bugs. The ranking and iterative elimination is our best known algorithm to date. Section 4.6 offers several case studies demonstrating how the algorithm has been used to successfully isolate both known and previously unknown bugs in real applications.

What is luck? Luck is probability taken personally. It is the excitement of bad math.

Penn Jillette

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

University of Wisconsin-Madison, Computer Sciences Department, 1210 West Dayton Street, Madison, WI 53706-1685, USA
Ben Liblit

Authors

Ben Liblit
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Liblit, B. (2007). Techniques for Statistical Debugging. In: Cooperative Bug Isolation. Lecture Notes in Computer Science, vol 4440. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71878-9_4

Download citation

DOI: https://doi.org/10.1007/978-3-540-71878-9_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71877-2
Online ISBN: 978-3-540-71878-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics