Unsupervised Dense Regions Discovery in DNA Microarray Data

Yip, Andy M.; Wu, Edmond H.; Ng, Michael K.; Chan, Tony F.

doi:10.1007/978-3-540-28651-6_11

Andy M. Yip¹⁹,
Edmond H. Wu²⁰,
Michael K. Ng²⁰ &
…
Tony F. Chan¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3177))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1300 Accesses

Abstract

In this paper, we introduce the notion of dense regions in DNA microarray data and present algorithms for discovering them. We demonstrate that dense regions are of statistical and biological significance through experiments. A dataset containing gene expression levels of 23 primate brain samples is employed to test our algorithms. Subsets of potential genes distinguishing between species and a subset of samples with potential abnormalities are identified.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cáceres, M., et al.: Elevated gene expression levels distinguish human from non-human primate brains. PNAS 100, 13030–13035 (2003)
Article Google Scholar
Dennis, G., et al.: DAVID: database for annotation, visualization, and integrated discovery. Genome Biology 4(5), 3 (2003)
Article Google Scholar
Eisen, M.B., Spellman, P.T., Brown, P.O., Botstein, D.: Cluster analysis and display of genome-wide expression patterns. PNAS 85, 14863–14868 (1998)
Article Google Scholar
Lee, M.T.: Analysis of microarray gene expression data. Kluwer Academic Publishers, Dordrecht (2004)
Google Scholar
Li, C., Wong, W.H.: Model-based analysis of oligonecleotide arrays: expression index computation and outlier detection. PNAS 36, 31–36 (2001)
Article Google Scholar
Yang, C., Fayyad, U., Bradley, P.S.: Efficient discovery of error-tolerant frequent itemsets in high dimensions. In: Proc. of ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining, San Francisco, California, pp. 194–203 (2001)
Google Scholar
Yip, M., Wu, E.H., Ng, M.K., Chan, T.F.: An efficient algorithm for dense regions discovery from large-scale data streams. In: Proc. of 8th Pacific-Asia Conf. on on Knowledge Discovery and Data Mining: Sydney, Australia, pp. 116–120 (2004); an extended version availiable at: UCLA CAM Reports 03-76, Math. Dept., University of California, Los Angeles, CA (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, University of California, 405 Hilgard Avenue, Los Angeles, CA, 90095-1555, USA
Andy M. Yip & Tony F. Chan
Department of Mathematics, The University of Hong Kong, Pokfulam Road, Hong Kong
Edmond H. Wu & Michael K. Ng

Authors

Andy M. Yip
View author publications
You can also search for this author in PubMed Google Scholar
Edmond H. Wu
View author publications
You can also search for this author in PubMed Google Scholar
Michael K. Ng
View author publications
You can also search for this author in PubMed Google Scholar
Tony F. Chan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, Computing, and Mathematics, University of Exeter, EX4 4QF, Exeter, UK
Zheng Rong Yang
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin
School of Engineering, Computer Science and Mathematics, University of Exeter, EX4 4QF, UK
Richard M. Everson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yip, A.M., Wu, E.H., Ng, M.K., Chan, T.F. (2004). Unsupervised Dense Regions Discovery in DNA Microarray Data. In: Yang, Z.R., Yin, H., Everson, R.M. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2004. IDEAL 2004. Lecture Notes in Computer Science, vol 3177. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28651-6_11

Download citation

DOI: https://doi.org/10.1007/978-3-540-28651-6_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22881-3
Online ISBN: 978-3-540-28651-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics