Subgroup Discovery for Test Selection: A Novel Approach and Its Application to Breast Cancer Diagnosis

Mueller, Marianne; Rosales, Rómer; Steck, Harald; Krishnan, Sriram; Rao, Bharat; Kramer, Stefan

doi:10.1007/978-3-642-03915-7_11

Marianne Mueller²⁰,
Rómer Rosales²¹,
Harald Steck²¹,
Sriram Krishnan²¹,
Bharat Rao²¹ &
…
Stefan Kramer²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5772))

Included in the following conference series:

International Symposium on Intelligent Data Analysis

1815 Accesses
10 Citations

Abstract

We propose a new approach to test selection based on the discovery of subgroups of patients sharing the same optimal test, and present its application to breast cancer diagnosis. Subgroups are defined in terms of background information about the patient. We automatically determine the best t subgroups a patient belongs to, and decide for the test proposed by their majority. We introduce the concept of prediction quality to measure how accurate the test outcome is regarding the disease status. The quality of a subgroup is then the best mean prediction quality of its members (choosing the same test for all). Incorporating the quality computation in the search heuristic enables a significant reduction of the search space. In experiments on breast cancer diagnosis data we showed that it is faster than the baseline algorithm APRIORI-SD while preserving its accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Andreassen, S.: Planning of therapy and tests in causal probabilistic networks. Artifical Intelligence in Medicine 4, 227–241 (1992)
Article Google Scholar
Doubilet, P.: A mathematical approach to interpretation and selection of diagnostic tests. Medical Decision Making 3, 177–195 (1983)
Article Google Scholar
Kavšek, B., Lavrač, N.: APRIORI-SD: Adapting association rule learning to subgroup discovery. Applied Artificial Intelligence 20(7), 543–583 (2006)
Article Google Scholar
Atzmüller, M., Puppe, F.: SD-map – A fast algorithm for exhaustive subgroup discovery. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) PKDD 2006. LNCS (LNAI), vol. 4213, pp. 6–17. Springer, Heidelberg (2006)
Chapter Google Scholar
BI-RADS Breast Imaging Reporting and Data System, Breast Imaging Atlas. 4th edn. American College of Radiology (2003)
Google Scholar
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th VLDB Conference, pp. 487–499 (1994)
Google Scholar
Klösgen, W.: Explora: a multipattern and multistrategy discovery assistant, 249–271 (1996)
Google Scholar
Lavrač, N., Kavšek, B., Flach, P., Todorovski, L.: Subgroup discovery with CN2-SD. Journal of Machine Learning Research (2004)
Google Scholar
Wrobel, S.: An algorithm for multi-relational discovery of subgroups. In: Komorowski, J., Żytkow, J.M. (eds.) PKDD 1997. LNCS, vol. 1263, pp. 78–87. Springer, Heidelberg (1997)
Chapter Google Scholar
Leman, D., Feelders, A., Knobbe, A.J.: Exceptional model mining. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 1–16. Springer, Heidelberg (2008)
Chapter Google Scholar
Mueller, M., Rosales, R., Steck, H., Krishnan, S., Rao, B., Kramer, S.: Data-efficient information-theoretic test selection. In: Proceedings of the 12th Conference on Artificial Intelligence in Medicine (AIME 2009), pp. 410–415 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Informatik, Technische Universität München, 85748, Garching, Germany
Marianne Mueller & Stefan Kramer
IKM CAD and Knowledge Solutions, Siemens Healthcare, Malvern, PA, 19335, USA
Rómer Rosales, Harald Steck, Sriram Krishnan & Bharat Rao

Authors

Marianne Mueller
View author publications
You can also search for this author in PubMed Google Scholar
Rómer Rosales
View author publications
You can also search for this author in PubMed Google Scholar
Harald Steck
View author publications
You can also search for this author in PubMed Google Scholar
Sriram Krishnan
View author publications
You can also search for this author in PubMed Google Scholar
Bharat Rao
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Kramer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics, Imperial College London, South Kensington Campus, SW7 2PG, London, United Kingdom
Niall M. Adams
INSA Lyon, LIRIS CNRS UMR 5205, Bâtiment Blaise Pascal, University of Lyon, F-69621, Villeurbanne, France
Céline Robardet
Department of Information and Computer Science, Universiteit Utrecht, Utrecht, The Netherlands
Arno Siebes
INSA-Lyon, LIRIS CNRS UMR5205, University of Lyon, F-69621, Villeurbanne, France
Jean-François Boulicaut

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mueller, M., Rosales, R., Steck, H., Krishnan, S., Rao, B., Kramer, S. (2009). Subgroup Discovery for Test Selection: A Novel Approach and Its Application to Breast Cancer Diagnosis. In: Adams, N.M., Robardet, C., Siebes, A., Boulicaut, JF. (eds) Advances in Intelligent Data Analysis VIII. IDA 2009. Lecture Notes in Computer Science, vol 5772. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03915-7_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-03915-7_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03914-0
Online ISBN: 978-3-642-03915-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics