An Approach Towards Most Cancerous Gene Selection from Microarray Data

Das, Sunanda; Das, Asit Kumar

doi:10.1007/978-81-322-2202-6_58

Sunanda Das⁷ &
Asit Kumar Das⁸

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 33))

1367 Accesses

Abstract

Microarray gene dataset is often very high-dimensional which presents complicated problems, like the degradation of data accessing, data manipulating and query processing performance. Dimensionality reduction efficiently tackles this problem and benefited us to visualize the intrinsic properties hidden in the dataset. Therefore, Rough set theory (RST) has been used for selecting only the relevant attributes of the dataset, called reduct, sufficient to characterize the information system. The investigation has been carried out on the publicly available microarray dataset. The analysis revealed that Rough Set using the concepts of dependency among genes is able to extract the various dominant genes in term of reducts which play an important role in causing the disease. Experimental results show the effectiveness of the algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chandrashekar, G., Sahin, F.: A survey on feature selection methods. Comput. Electr. Eng. 40(1), 16–28 (2014)
Article Google Scholar
Velayutham, C., Thangavel, K.: Unsupervised quick reduct algorithm using rough set theory. J. Electr. Sci. Technol. 9(3), 193–201 (2011)
Google Scholar
Lazar, C., Taminau, J., Meganck, S., Steenhoff, D., Coletta, A., Molter, C., de Schaetzen, V., Duque, R., Bersini, H., Nowe, A.: A survey on filter techniques for feature selection in gene expression microarray analysis. IEEE/ACM Trans. Comput. Biol. Bioinform. 9(4), 1106–1119 (2012)
Article Google Scholar
Dash, M., Liu, H.: Feature selection for classification. Intell. Data Anal. 1(3), 131–156 (1997)
Article Google Scholar
Kira, K., Rendell, L.A.: The feature selection problem: traditional methods and a new algorithm. In: Proceedings of Ninth National Conference on Artificial Intelligence, pp. 129–134 (1992)
Google Scholar
Langley, P.: Selection of relevant features in machine learning. In: Proceedings on AAAI Fall Symposium Relevance, pp. 1–5 (1994)
Google Scholar
Liu, H., Motoda, H.: Feature Extraction, Construction and Selection: A Data Mining Perspective (Kluwer International Series in Engineering & Computer Science). Academic Publishers, New York (1998)
Google Scholar
Miller A.J., Hall, C.: Subset Selection in Regression (1990)
Google Scholar
Pawlak, Z.: Rough Sets: Theoretical Aspects of Reasoning About Data. Kluwer Academic Publishing, Norwell (1991)
Google Scholar
Polkowski, L.: Rough Sets: Mathematical Foundations. Advances in Soft Computing. Physica Verlag, Heidelberg (2002)
Google Scholar
Baixeries, J.: A formal concept analysis framework to mine functional dependencies. In: Proceeding of the Workshop on Mathematical Methods for Learning (2004)
Google Scholar
Kerber, R., ChiMerge.: Discretization of Numeric Attributes. In: Proceedings of AAAI-92, Ninth International Conference on Artificial Intelligence, AAAI-Press, pp. 123–128 (1992)
Google Scholar
Yu, L., Liu, H.: Efficient feature selection via analysis of relevance and redundancy. J. Mach. Learn. Res. 5, 1205–1224 (2004)
MATH Google Scholar
Hall, M.A.: Correlation-based feature selection for machine learning. The University of Waikato, New Zealand (1999)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Management and Science, Neotia Institute of Technology, Jhinga, Diamond Harbour, South 24-Pargana, Calcutta, West Bengal, 743368, India
Sunanda Das
Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, Howrah, 711103, India
Asit Kumar Das

Authors

Sunanda Das
View author publications
You can also search for this author in PubMed Google Scholar
Asit Kumar Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sunanda Das .

Editor information

Editors and Affiliations

School of Electrical and Information Engineering, University of South Australia, South Australia, Australia
Lakhmi C. Jain
Computer Science and Engineering, Veer Surendra Sai University of Technolo, Sambalpur, Odisha, India
Himansu Sekhar Behera
Computer Science & Engineering, Kalyani University, Nadia, West Bengal, India
Jyotsna Kumar Mandal
Dept. of Computer Science and Eng., National Institute of Technology Rourkela, Rourkela, India
Durga Prasad Mohapatra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Das, S., Das, A.K. (2015). An Approach Towards Most Cancerous Gene Selection from Microarray Data. In: Jain, L., Behera, H., Mandal, J., Mohapatra, D. (eds) Computational Intelligence in Data Mining - Volume 3. Smart Innovation, Systems and Technologies, vol 33. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2202-6_58

Download citation

DOI: https://doi.org/10.1007/978-81-322-2202-6_58
Published: 12 December 2014
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2201-9
Online ISBN: 978-81-322-2202-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics