Statistical methods for analyzing speedup learning experiments

Etzioni, Oren; Etzioni, Ruth

doi:10.1007/BF00993983

Statistical methods for analyzing speedup learning experiments

Technical Note
Published: March 1994

Volume 14, pages 333–347, (1994)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

Statistical methods for analyzing speedup learning experiments

Download PDF

Oren Etzioni¹ &
Ruth Etzioni^2,3

402 Accesses
12 Citations
Explore all metrics

Abstract

Speedup learning systems are typically evaluated by comparing their impact on a problem solver's performance. The impact is measured by running the problem solver, before and after learning, on a sample of problems randomly drawn from some distribution. Often, the experimenter imposes a bound on the CPU time the problem solver is allowed to spend on any individual problem. Segre et al. (1991) argue that the experimenter's choice of time bound can bias the results of the experiment. To address this problem, we present statistical hypothesis tests specifically designed to analyze speedup data and eliminate this bias. We apply the tests to the data reported by Etzioni (1990a) and show that most (but not all) of the speedups observed are statistically significant.

References

Brown, B.W. Jr., & Hollander, M. (1977).Statistics: A biomedical introduction. New York: Wiley.
Google Scholar
Cohen, Paul R., & Kim, John B. (1993). A bootstrap test for comparing performance of programs when data are censored, and comparisons to Etzioni's test. Unpublished manuscript, University of Massachusetts, Amherst.
Google Scholar
DeGroot, Morris H. (1986).Probability and statistics 2nd ed. Reading, MA: Addison Wesley.
Google Scholar
Etzioni, Oren. (1990a).A structural theory of explanation-based learning. Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, PA. (Available as technical report CMU-CS-90-185.)
Google Scholar
Etzioni, Oren. (1990b). Why Prodigy/EBL works. InProceedings of AAAI-90.
Gibbons, Jean Dickinson. (1971).Nonparametric statistical inference. New York: McGraw-Hill.
Google Scholar
Hajek, J., & Sidak, Z. (1967).Theory of rank tests. New York: Academic Press.
Google Scholar
Hemelryk, J. (1952). A theorem on the sign test when ties are present.Indagationes Mathematica, 14 322–326.
Google Scholar
Holt, J.D. & Prentice, R.L. (1974). Survival analysis in twin studies and matched pair experiments.Biometrika, 61 17–30.
Google Scholar
Kalbfleisch, J.D., & Prentice, R.L. (1980).The statistical analysis of failure time data. New York: Wiley.
Google Scholar
Kambhampati, Subbarao, & Chen, Jengchin. (1993). Relative utility of ebg based plan reuse in partial ordering vs. total ordering planning. InProceedings of the 11th National Conference on Artificial Intelligence (AAAI-93). Cambridge, MA: MIT Press (AAAI).
Google Scholar
Knoblock, Craig A. (1990). Learning abstraction hierarchies for problem solving. InProceedings of the Eighth National Conference on Artificial Intelligence. Menlo Park, CA: AAAI Press.
Google Scholar
Knoblock, Craig A. (In press). Automatically generating abstractions for planning.Artificial Intelligence.
Lehmann, E.L. (1975).Nonparametrics: Statistical methods based on ranks. San Francisco: Holden Day.
Google Scholar
Minton, Steven (1988a). Quantitative results concerning the utility of explanation-based learning. InProceedings of AAAI-88 (pp. 564–569).
Minton, Steven. (1988b).Learning effective search control knowledge: An explanation-based approach. Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, PA. (Available as technical report CMU-CS-88-133.)
Google Scholar
Minton, Steven. (1993). Integrating heuristics for constraint satisfaction problems: A case study. InAAAI-93 Proceedings.
Mooney, Raymond J. (1989). The effect of rule use on the utility of explanation-based learning. InProceedings of the Eleventh International Joint Conference on Artificial Intelligence (pp. 725–730).
O'Rorke, P. (1989). LT revisited: Explanation-based learning and the logic of Principia Mathematica.Machine Learning, 4(2 117–160.
Google Scholar
Segre, Alberto, Elkan, Charles, & Russell, Alexander. (1991). A critical look at experimental evaluations of EBL.Machine Learning, 6(2).
Shavlik, Jude W. (1990). Acquiring recursive concepts and iterative concepts with explanation-based learning.Machine Learning, 5(1).
Wilks, Samuel S. (1962).Mathematical statistics. New York: John Wiley & Sons.
Google Scholar
Woolson, R.F., & Lachenbruch, P.A. (1980). Rank tests for censored matched pairs.Biometrika, 67 597–606.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, FR-35, University of Washington, 98195, Seattle, WA
Oren Etzioni
Fred Hutchinson Cancer Research Center, Division of Public Health Sciences, 98104, Seattle, WA
Ruth Etzioni
Department of Biostatistics, University of Washington, 98195, Seattle, WA
Ruth Etzioni

Authors

Oren Etzioni
View author publications
You can also search for this author in PubMed Google Scholar
Ruth Etzioni
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Etzioni, O., Etzioni, R. Statistical methods for analyzing speedup learning experiments. Mach Learn 14, 333–347 (1994). https://doi.org/10.1007/BF00993983

Download citation

Received: 09 July 1992
Accepted: 07 October 1992
Issue Date: March 1994
DOI: https://doi.org/10.1007/BF00993983

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Statistical methods for analyzing speedup learning experiments

Abstract

Article PDF

Similar content being viewed by others

Performance Engineering: From Numbers to Insight

Educational and Research Systems for Evaluating the Efficiency of Parallel Computations

Using Sampling to Understand Parallel Program Performance

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Statistical methods for analyzing speedup learning experiments

Abstract

Article PDF

Similar content being viewed by others

Performance Engineering: From Numbers to Insight

Educational and Research Systems for Evaluating the Efficiency of Parallel Computations

Using Sampling to Understand Parallel Program Performance

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation