Subgroup and outlier detection analysis

Wu, Gang; Pawlikowska, Iwona; Gruber, Tanja; Downing, James; Zhang, Jinghui; Pounds, Stan

doi:10.1186/1471-2105-14-S17-A2

Subgroup and outlier detection analysis

Meeting abstract
Open access
Published: 22 October 2013

Volume 14, article number A2, (2013)
Cite this article

Download PDF

You have full access to this open access article

BMC Bioinformatics Aims and scope Submit manuscript

Subgroup and outlier detection analysis

Download PDF

Gang Wu¹,
Iwona Pawlikowska²,
Tanja Gruber³,
James Downing⁴,
Jinghui Zhang¹ &
…
Stan Pounds²

2191 Accesses
1 Citation
Explore all metrics

Background

High-dimensional biological data presents the opportunity to discover novel forms of biological heterogeneity, such as overexpression or suppression of expression of a particular gene in a subset of a cohort. This novel biological heterogeneity appears in the data as outliers or distinct subgroups. Here, we describe and evaluate three procedures for subgroup and outlier detection analysis (SODA): a leave-one-out (LOO) procedure that is widely used for outlier detection in the bioinformatics literature, the least median squares (LMS) procedure from the statistics literature, and the dip test (DT) from the statistics literature. We also propose and evaluate the max spacing test (MST) as a novel SODA method.

Results

In simulation studies, we found that LMS, DT, and MST are each the best method in specific settings. In an example analysis, we found that LMS and MST effectively identified confirmed fusion genes as outliers and DT and MST effectively identified genes that distinguish between two confirmed subtypes of pediatric acute megakaryoblastic leukemia. We conclude that LMS, DT, and MST are robust and complimentary methods for SODA.

Acknowledgements

We gratefully acknowledge funding from ALSAC which raises funds for St. Jude.

Author information

Authors and Affiliations

Department of Computational Biology, St. Jude Children’s Research Hospital, Memphis, TN, 38105, USA
Gang Wu & Jinghui Zhang
Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN, 38105, USA
Iwona Pawlikowska & Stan Pounds
Department of Oncology, St. Jude Children’s Research Hospital, Memphis, TN, 38105, USA
Tanja Gruber
Department of Pathology, St. Jude Children’s Research Hospital, Memphis, TN, 38105, USA
James Downing

Authors

Gang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Iwona Pawlikowska
View author publications
You can also search for this author in PubMed Google Scholar
Tanja Gruber
View author publications
You can also search for this author in PubMed Google Scholar
James Downing
View author publications
You can also search for this author in PubMed Google Scholar
Jinghui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Stan Pounds
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stan Pounds.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Wu, G., Pawlikowska, I., Gruber, T. et al. Subgroup and outlier detection analysis. BMC Bioinformatics 14 (Suppl 17), A2 (2013). https://doi.org/10.1186/1471-2105-14-S17-A2

Download citation

Published: 22 October 2013
DOI: https://doi.org/10.1186/1471-2105-14-S17-A2

Subgroup and outlier detection analysis

Background

Results

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Subgroup and outlier detection analysis

Background

Results

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation