Soybean Genome Clustering Using Quantum-Based Fuzzy C-Means Algorithm

Rangoju, Sai Siddhartha Vivek Dhir; Garg, Keshav; Dandi, Rohith; Patel, Om Prakash; Bharill, Neha

doi:10.1007/978-981-99-8070-3_7

Sai Siddhartha Vivek Dhir Rangoju¹²,
Keshav Garg¹²,
Rohith Dandi¹²,
Om Prakash Patel¹² &
…
Neha Bharill¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14450))

Included in the following conference series:

International Conference on Neural Information Processing

510 Accesses

Abstract

Bioinformatics is a new area of research in which many computer scientists are working to extract some useful information from genome sequences in a very less time, whereas traditional methods may take years to fetch this. One of the studies that belongs to the area of Bioinformatics is protein sequence analysis. In this study, we have considered the soybean protein sequence which does not have class information therefore clustering of these sequences is required. As these sequences are very complex and consist of overlapping sequences, therefore Fuzzy C-Means algorithm may work better than crisp clustering. However, the clustering of these sequences is a very time-consuming process also the results are not up to the mark by using existing crisp and fuzzy clustering algorithms. Therefore we propose here a quantum Fuzzy c-Means algorithm that uses the quantum computing concept to represent the dataset in the quantum form. The proposed approach also use the quantum superposition concept which fastens the process and also gives better result than the FCM algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alawneh, L., Shehab, M.A., Al-Ayyoub, M., Jararweh, Y., Al-Sharif, Z.A.: A scalable multiple pairwise protein sequence alignment acceleration using hybrid CPU-GPU approach. Clust. Comput. 23, 2677–2688 (2020)
Article Google Scholar
de Almeida Paiva, V .: Protein structural bioinformatics: an overview. Comput. Biol. Med., 105695 (2022)
Google Scholar
Bystrof, C., Thorsson, V., Baker, D.: HMMSTR: a hidden Markov model for local sequence-structure correlations in proteins. J. Mol. Biol. 301(1), 173–190 (2000)
Article Google Scholar
Jha, P., Tiwari, A., Bharill, N., Ratnaparkhe, M., Mounika, M., Nagendra, N.: Apache Spark based kernelized fuzzy clustering framework for single nucleotide polymorphism sequence analysis. Comput. Biol. Chem. 92, 107454 (2021)
Article Google Scholar
Farhangi E., Ghadiri N., Asadi M., Nikbakht M.A., Pitre S.: Fast and scalable protein motif sequence clustering based on Hadoop framework. In: 3th International Conference on Web Research (ICWR), pp. 24–31. IEEE (2017)
Google Scholar
Bezde J.C.: Fuzzy-Mathematics In Pattern Classification. Cornell University (1973)
Google Scholar
Pakhira, M.K., Bandyopadhyay, S., Maulik, U.: A study of some fuzzy cluster validity indices, genetic clustering and application to pixel classification. Fuzzy Sets Syst. 155(2), 191–214 (2005)
Article MathSciNet Google Scholar
Shelokar, P.S., Jayaraman, V.K., Kulkarni, B.D.: An ant colony approach for clustering. Anal. Chim. Acta 509(2), 187–195 (2004)
Article Google Scholar
Kao, Y.T., Zahara, E., Kao, I.W.: A hybridized approach to data clustering. Expert Syst. Appli. 34(3), 1754–1762 (2008)
Article Google Scholar
Zhang, C., Ouyang, D., Ning, J.: An artificial bee colony approach for clustering. Expert Syst. Appli. 37(7), 4761–4767 (2010)
Article Google Scholar
Patel O.P., Bharill N., Tiwari A.: A Quantum-inspired fuzzy based evolutionary algorithm for data clustering. In: 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), Istanbul, Turkey, 2015, pp. 1–8 (2015). https://doi.org/10.1109/FUZZ-IEEE.2015.7337861
Bolshakova, N., Azuaje, F.: Cluster validation techniques for genome expression data. Signal Process. 83(4), 825–833 (2003)
Article MATH Google Scholar
Coelho G.P., Barbante C.C., Boccato L., Attux R.R., Oliveira J.R., Von Zuben F.J.: Automatic feature selection for BCI: an analysis using the davies-bouldin index and extreme learning machines. In: The 2012 international joint conference on neural networks (IJCNN), vol. 2012, pp. 1–8. IEEE (2012)
Google Scholar
Han, H.K., Kim, J.H.: Quantum-inspired evolutionary algorithm for a class of combinatorial optimization. IEEE Trans. Evol. Comput. 6(6), 580–593 (2002)
Article Google Scholar
Peter W.: Quantum machine learning: what quantum computing means to data mining, vol. 2014. Academic Press (2014)
Google Scholar
Wysmierski, P.T., Vello, N.A.: The genetic base of Brazilian soybean cultivars: evolution over time and breeding implications. Genet. Mol. Biol. 36, 547–555 (2013)
Article Google Scholar
Sedivy, E.J., Wu, F., Hanzawa, Y.: Soybean domestication: the origin, genetic architecture and molecular bases. New Phytol. 214(2), 539–553 (2017)
Article Google Scholar
Lee J.D., Shannon J.G., Vuong T.D., Nguyen H.T.: Inheritance of salt tolerance in wild soybean (Glycine soja Sieb. and Zucc.) accession PI483463. J. Heredity, 100(6), 798–801 (2009)
Google Scholar
Xie, M., et al.: A reference-grade wild soybean genome. Nat. Commun. 10(1), 1216 (2019)
Article Google Scholar
Yeung, K.Y., Ruzzo, W.L.: Details of the adjusted rand index and clustering algorithms, supplement to the paper an empirical study on principal component analysis for clustering gene expression data. Bioinformatics 17(9), 763–774 (2001)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Ecole Centrale School of Engineering, Mahindra University, Hyderabad, India
Sai Siddhartha Vivek Dhir Rangoju, Keshav Garg, Rohith Dandi, Om Prakash Patel & Neha Bharill

Authors

Sai Siddhartha Vivek Dhir Rangoju
View author publications
You can also search for this author in PubMed Google Scholar
Keshav Garg
View author publications
You can also search for this author in PubMed Google Scholar
Rohith Dandi
View author publications
You can also search for this author in PubMed Google Scholar
Om Prakash Patel
View author publications
You can also search for this author in PubMed Google Scholar
Neha Bharill
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Om Prakash Patel .

Editor information

Editors and Affiliations

Central South University, Changsha, China
Biao Luo
Chinese Academy of Sciences, Beijing, China
Long Cheng
Zhejiang University, Hangzhou, China
Zheng-Guang Wu
Guangdong University of Technology, Guangzhou, China
Hongyi Li
UNSW Sydney, Sydney, NSW, Australia
Chaojie Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rangoju, S.S.V.D., Garg, K., Dandi, R., Patel, O.P., Bharill, N. (2024). Soybean Genome Clustering Using Quantum-Based Fuzzy C-Means Algorithm. In: Luo, B., Cheng, L., Wu, ZG., Li, H., Li, C. (eds) Neural Information Processing. ICONIP 2023. Lecture Notes in Computer Science, vol 14450. Springer, Singapore. https://doi.org/10.1007/978-981-99-8070-3_7

Download citation

DOI: https://doi.org/10.1007/978-981-99-8070-3_7
Published: 15 November 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8069-7
Online ISBN: 978-981-99-8070-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics