Skip to main content

Inference Guided Data Exploration

  • Chapter
  • 1925 Accesses

Part of the book series: Statistics for Industry and Technology ((SIT))

Abstract

We consider comparing two treatments using a given hypothesis test on the full sample and on all possible subsets, and we separately consider restricting the subsets considered to be those defined based on half-intervals of a covariate. Rather than treating this as a family of hypothesis tests, we instead choose the minimum p-value from the group of hypothesis tests as our test statistic. Simulation is employed to find an approximate critical value to control the type I error for our novel test statistic. These techniques may be used as a rule of thumb for judging the potential significance of a result after a “fishing expedition” has been caried out on a dataset, i.e., a large number of tests of hypothesis were performed on subsets of the data or a subset was selected after inspecting the data. When the technique is restricted to subsets defined based on half-intervals of a covariate, it may be useful as a planned methodology for analyzing an experiment.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fleiss, J. L. (1986). The Design and Analysis of Clinical Experiments, John Wiley & Sons, New York.

    MATH  Google Scholar 

  2. Hsu, J. C. (1996). Multiple Comparisons, Chapman and Hall, New York.

    MATH  Google Scholar 

  3. Koziol, J. A. and Wu, S. H. (1996). Changepoint statistics for assessing a treatment-vovariate interaction, Biometrics, 52, 1147–1152.

    Article  MATH  Google Scholar 

  4. Mamounas, E. P. (1997). NSABP Protocol B-27: Preoperative doxorubicin plus cyclophosphamide followed by preoperative or postoperative docetaxel, Oncology, 11 (Suppl. No. 6), 37–40.

    Google Scholar 

  5. Miller R. G. (1981). Simultaneous Statistical Inference, Springer-Verlag, New York.

    MATH  Google Scholar 

  6. Potthoff, R. F. (1964). On the Johnson-Neyman technique and some extensions thereof, Psychometrika, 29, 241–256.

    Article  Google Scholar 

  7. Worsley, K. J. (1992). A three dimensional statistical analysis for CBF activation studies in human brain, Journal of Cerebral Blood Flow and Metabolism, 12, 900–918.

    Google Scholar 

  8. Yothers, G. (2003). Methodologies for Identifying Subsets of the Population Where Two Treatments Differ, Ph.D. Dissertation, University of Pittsburgh, Pittsburgh, Pennsylvania.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Birkhäuser Boston

About this chapter

Cite this chapter

Yothers, G., Sampson, A.R. (2005). Inference Guided Data Exploration. In: Balakrishnan, N., Nagaraja, H.N., Kannan, N. (eds) Advances in Ranking and Selection, Multiple Comparisons, and Reliability. Statistics for Industry and Technology. Birkhäuser Boston. https://doi.org/10.1007/0-8176-4422-9_3

Download citation

Publish with us

Policies and ethics