Inference Guided Data Exploration

Yothers, Greg; Sampson, Allan R.

doi:10.1007/0-8176-4422-9_3

Inference Guided Data Exploration

Greg Yothers^5,6 &
Allan R. Sampson^5,6

Chapter

1925 Accesses

Part of the book series: Statistics for Industry and Technology ((SIT))

Abstract

We consider comparing two treatments using a given hypothesis test on the full sample and on all possible subsets, and we separately consider restricting the subsets considered to be those defined based on half-intervals of a covariate. Rather than treating this as a family of hypothesis tests, we instead choose the minimum p-value from the group of hypothesis tests as our test statistic. Simulation is employed to find an approximate critical value to control the type I error for our novel test statistic. These techniques may be used as a rule of thumb for judging the potential significance of a result after a “fishing expedition” has been caried out on a dataset, i.e., a large number of tests of hypothesis were performed on subsets of the data or a subset was selected after inspecting the data. When the technique is restricted to subsets defined based on half-intervals of a covariate, it may be useful as a planned methodology for analyzing an experiment.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fleiss, J. L. (1986). The Design and Analysis of Clinical Experiments, John Wiley & Sons, New York.
MATH Google Scholar
Hsu, J. C. (1996). Multiple Comparisons, Chapman and Hall, New York.
MATH Google Scholar
Koziol, J. A. and Wu, S. H. (1996). Changepoint statistics for assessing a treatment-vovariate interaction, Biometrics, 52, 1147–1152.
Article MATH Google Scholar
Mamounas, E. P. (1997). NSABP Protocol B-27: Preoperative doxorubicin plus cyclophosphamide followed by preoperative or postoperative docetaxel, Oncology, 11 (Suppl. No. 6), 37–40.
Google Scholar
Miller R. G. (1981). Simultaneous Statistical Inference, Springer-Verlag, New York.
MATH Google Scholar
Potthoff, R. F. (1964). On the Johnson-Neyman technique and some extensions thereof, Psychometrika, 29, 241–256.
Article Google Scholar
Worsley, K. J. (1992). A three dimensional statistical analysis for CBF activation studies in human brain, Journal of Cerebral Blood Flow and Metabolism, 12, 900–918.
Google Scholar
Yothers, G. (2003). Methodologies for Identifying Subsets of the Population Where Two Treatments Differ, Ph.D. Dissertation, University of Pittsburgh, Pittsburgh, Pennsylvania.
Google Scholar

Download references

Author information

Authors and Affiliations

National Surgical Adjuvant Breast and Bowel Project (NSABP), Pittsburgh, Pennsylvania, USA
Greg Yothers & Allan R. Sampson
Department of Statistics, University of Pittsburgh, Pennsylvania, USA
Greg Yothers & Allan R. Sampson

Authors

Greg Yothers
View author publications
You can also search for this author in PubMed Google Scholar
Allan R. Sampson
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics and Statistics, McMaster University, Hamilton, Ontario, L8S 4K1, Canada
N. Balakrishnan
Department of Statistics, Ohio State University, 19S8 Neil Avenue Cockins Hall, Room 404, Columbus, OH, 43210-1247, USA
H. N. Nagaraja
Department of Management Science and Statistics, University of Texas at San Antonio, 6900 N. Loop 1604 W, San Antonio, TX, 78249-0632, USA
N. Kannan

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yothers, G., Sampson, A.R. (2005). Inference Guided Data Exploration. In: Balakrishnan, N., Nagaraja, H.N., Kannan, N. (eds) Advances in Ranking and Selection, Multiple Comparisons, and Reliability. Statistics for Industry and Technology. Birkhäuser Boston. https://doi.org/10.1007/0-8176-4422-9_3

Download citation

DOI: https://doi.org/10.1007/0-8176-4422-9_3
Publisher Name: Birkhäuser Boston
Print ISBN: 978-0-8176-3232-8
Online ISBN: 978-0-8176-4422-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics