Rank Aggregation for Candidate Gene Identification

Burkovski, Andre; Lausser, Ludwig; Kraus, Johann M.; Kestler, Hans A.

doi:10.1007/978-3-319-01595-8_31

Andre Burkovski^21,22,
Ludwig Lausser²¹,
Johann M. Kraus²¹ &
…
Hans A. Kestler²¹

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

5308 Accesses
1 Citations

Abstract

Differences of molecular processes are reflected, among others, by differences in gene expression levels of the involved cells. High-throughput methods such as microarrays and deep sequencing approaches are increasingly used to obtain these expression profiles. Often differences of gene expression across different conditions such as tumor vs inflammation are investigated. Top scoring differential genes are considered as candidates for further analysis. Measured differences may not be related to a biological process as they can also be caused by variation in measurement or by other sources of noise. A method for reducing the influence of noise is to combine the available samples. Here, we analyze different types of combination methods, early and late aggregation and compare these statistical and positional rank aggregation methods in a simulation study and by experiments on real microarray data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ailon, N., Charikar, M., & Newman, A. (2008). Aggregating inconsistent information: Ranking and clustering. Journal of the ACM, 55, 23:1–23:27.
Google Scholar
Alon, U., Barkai, N., Notterman, D. A., Gish, K., Ybarra, S., Mack, D., et al. (1999). Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proceedings of the National Academy of Sciences of USA, 96(12), 6745–6750.
Article Google Scholar
Copeland, A. (1951). A ‘reasonable’ social welfare function. Seminar on Mathematics in Social Sciences, University of Michigan.
Google Scholar
DeConde, R. P., Hawley, S., Falcon, S., Clegg, N., Knudsen, B., & Etzioni, R. (2006). Combining results of microarray experiments: A rank aggregation approach. Statistical Applications in Genetics and Molecular Biology, 5, 1–23.
Article MathSciNet Google Scholar
Diaconis, P., & Graham, R. L. (1977). Spearman’s footrule as a measure of disarray. Journal of the Royal Statistical Society. Series B (Methodological), 39(2), 262–268.
MathSciNet MATH Google Scholar
Dwork, C., Kumar, R., Naor, M., & Sivakumar, D. (2001). Rank aggregation revisited. Systems Research, 13(2), 86–93.
Google Scholar
Fagin, R., Kumar, R., & Sivakumar, D. (2003). Comparing top k lists. In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms (pp. 28–36). Philadelphia: SIAM.
Google Scholar
Kolde, R., Laur, S., Adler, P., & Vilo, J. (2012). Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics, 28(4), 573–580.
Article Google Scholar
Lin, S. (2010). Rank aggregation methods. Wiley Interdisciplinary Reviews: Computational Statistics, 2(5), 555–570.
Article Google Scholar
Pihur, V., Datta, S., & Datta, S. (2007). Weighted rank aggregation of cluster validation measures: A monte carlo cross-entropy approach. Bioinformatics 23(13), 1607–1615.
Article Google Scholar
Pihur, V., Datta, S., & Datta, S. (2008). Finding common genes in multiple cancer types through meta-analysis of microarray experiments: A rank aggregation approach. Genomics, 92(6), 400–403.
Article Google Scholar
Schalekamp, F., & Zuylen, A. (2009). Rank aggregation: Together we’re strong. In Proceedings of the 11th Workshop on Algorithm Engineering and Experiments (pp. 38–51). Philadelphia: SIAM.
Google Scholar
Shipp, M. A., Ross, K. N., Tamayo, P., Weng, A. P., Kutok, J. L., Aguiar, R. C., et al. (2002). Diffuse large b-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning. Nature Medicine, 8(1), 68–74.
Article Google Scholar
Smyth, G. K. (2004). Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Statistical Applications in Genetics and Molecular Biology, 3(1), 3.
Article MathSciNet Google Scholar
West, M., Blanchette, C., Dressman, H., Huang, E., Ishida, S., Spang, R., et al. (2001) Predicting the clinical status of human breast cancer by using gene expression profiles. Proceedings of the National Academy of Sciences of USA, 98(20), 11462–11467.
Article Google Scholar

Download references

Acknowledgements

This work was funded in part by the German federal ministry of education and research (BMBF) within the framework of the program of medical genome research (PaCa-Net; Project ID PKB-01GS08) and the framework GERONTOSYS 2 (Forschungskern SyStaR, Project ID 0315894A), and by the German Science Foundation (SFB 1074, Project Z1) and the International Graduate School in Molecular Medicine at Ulm University (GSC270). The responsibility for the content lies exclusively with the authors.

Author information

Authors and Affiliations

Research Group Bioinformatics and Systems Biology, Institute of Neural Information Processing, Ulm University, 89069, Ulm, Germany
Andre Burkovski, Ludwig Lausser, Johann M. Kraus & Hans A. Kestler
International Graduate School in Molecular Medicine, Ulm University, Ulm, Germany
Andre Burkovski

Authors

Andre Burkovski
View author publications
You can also search for this author in PubMed Google Scholar
Ludwig Lausser
View author publications
You can also search for this author in PubMed Google Scholar
Johann M. Kraus
View author publications
You can also search for this author in PubMed Google Scholar
Hans A. Kestler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hans A. Kestler .

Editor information

Editors and Affiliations

Faculty of Computer Science, Otto-von-Guericke-Universität Magdeburg, Magdeburg, Germany
Myra Spiliopoulou
Institute of Computer Science, University of Hildesheim, Hildesheim, Germany
Lars Schmidt-Thieme
Institute of Computer Science, University of Hildesheim, Hildesheim, Germany
Ruth Janning

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Burkovski, A., Lausser, L., Kraus, J.M., Kestler, H.A. (2014). Rank Aggregation for Candidate Gene Identification. In: Spiliopoulou, M., Schmidt-Thieme, L., Janning, R. (eds) Data Analysis, Machine Learning and Knowledge Discovery. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-01595-8_31

Download citation

DOI: https://doi.org/10.1007/978-3-319-01595-8_31
Published: 10 October 2013
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01594-1
Online ISBN: 978-3-319-01595-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics