Abstract
Particle Swarm Optimization (PSO) has a limitation of early convergence and needs to be improved to find the global optima. The main objective here is to improve its exploration capability without deteriorating the exploitation capability. For this purpose, a modified version of PSO, namely Elitist Random Swapped Particle Swarm Optimization (ERSPSO), has been proposed. The elitist (fittest) particles in the swarm guide the other particles to improve their position. To enhance exploration in the search process a swapping of the randomly selected parts of the elitist particle positions (candidate solution) has been made. Consequently, a perturbation is applied to further improve the exploration. The proposed ERSPSO has been applied to the full benchmark set of 25 functions (CEC 2005) as well as complex real life problems like ‘Gene selection by sample classification’. The new variant ERSPSO has been validated by the statistical metrics, convergence plot, sensitivity analysis using convergence behaviour, p-values using Wilcoxon rank sum test and Friedman rank test. For sample classification in Gene selection, VkNN (a new variant of kNN) is proposed which performs better than kNN in classification accuracy. The combined ERSPSO-VkNN is tested in 6 microarray datasets including 4 diseases. In most of the datasets (5 datasets out of 6) ERSPSO-VkNN performs better than the state-of-the-art methods. In different datasets, the percentage of classification accuracy of ERSPSO-VkNN varies between 89.29 and 100%. Finally, a biological verification is performed to show that many of the selected genes are biologically significant according to the reporting in current literature.
This is a preview of subscription content, access via your institution.





Data availability
The datasets generated and analysed during the current study are not publicly available due the fact that they constitute an excerpt of research in progress but are available from the corresponding author on reasonable request.
References
Allawh TC, Brown BS (2017) The clinical manifestations and genetic implications of Baraitser-Winter syndrome type 2. J Pediat Genet 6(2):107
Bansal JC et al. (2011) Inertia weight strategies in particle swarm optimization. In: 2011 Third world congress on nature and biologically inspired computing, October. IEEE, 633–640
Biswas S, Acharyya S (2016) Neural model of gene regulatory network: a survey on supportive meta-heuristics. Theory Biosci 135:1–19
Biswas S, Dutta S, Acharyya S (2019) Identification of disease critical genes using collective meta-heuristic approaches: an application to preeclampsia. Interdiscipl Sci: Computat Life Sci 11(3):444–459
Brew O (2018) Placental genomics: regulatory roles of histamine in preeclampsia (Doctoral dissertation, University of West London)
Buurma AJ et al (2013) Genetic variants in preeclampsia: a meta-analysis. Hum Reprod Update 19(3):289–303
Chen Y et al (2018) Particle swarm optimizer with crossover operation. Eng Appl Artif Intell 70:159–169
Chen CF, Zain AM, Mo LP, Zhou KQ (2020) A new hybrid algorithm based on ABC and PSO for function optimization. IOP Conf Series: Mater Sci Eng 864(1):012065. https://doi.org/10.1088/1757-899X/864/1/012065
Chen-Plotkin AS et al (2012) TMEM106B, the risk gene for frontotemporal dementia, is regulated by the microRNA-132/212 cluster and affects progranulin pathways. J Neurosci 32(33):11213–11227
Cho JH et al (2011) Systems biology of interstitial lung diseases: integration of mRNA and microRNA expression changes. BMC Med Genomics 4(1):8
Colas P (2020) Cyclin-dependent kinases and rare developmental disorders. Orphanet J Rare Dis 15(1):1–14
Craig VJ et al (2015) Matrix metalloproteinases as therapeutic targets for idiopathic pulmonary fibrosis. Am J Respir Cell Mol Biol 53(5):585–600
Das P, Jana B, Acharyya S (2021) A new variant of genetic algorithm for solving gene selection problem. In: Proceedings of the sixth international conference on mathematics and computing pp. 309–324. Springer, Singapore
Dashtban M, Balafar M, Suravajhala P (2018) Gene selection for tumor classification using a novel bio-inspired multi-objective approach. Genomics 110(1):10–17
De Luca P et al (2016) CtBP1 associates metabolic syndrome and breast carcinogenesis targeting multiple miRNAs. Oncotarget 7(14):18798
Dillies MA et al (2013) A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis. Brief Bioinform 14(6):671–683
Divorty N et al (2015) G protein-coupled receptor 35: an emerging target in inflammatory and cardiovascular disease. Front Pharmacol 6:41
Do C, Xing Z, Yu YE, Tycko B (2017) Trans-acting epigenetic effects of chromosomal aneuploidies: lessons from Down syndrome and mouse models. Epigenomics 9(2):189–207
Doubková M et al (2019) A novel germline mutation of the SFTPA1 gene in familial interstitial pneumonia. Human Genome Variat 6(1):1–6
Dutta P, Saha S (2017) Fusion of expression values and protein interaction information using multi-objective optimization for improving gene clustering. Comput Biol Med 89:31–43
Eberhart RC, Kennedy J (1995) A new optimizer using particle swarm theory, In: Proceedings of the sixth international symposium on micro machine and human science, IEEE 39–43
Eisen MB, Brown PO (1999) DNA arrays for analysis of gene expression. Methods in enzymology, vol 303. Academic Press, Cambridge, pp 179–205
Elsisi M (2020) Optimal design of nonlinear model predictive controller based on new modified multitracker optimization algorithm. Int J Intell Syst 35(11):1857–1878
Elsisi M, Tran MQ (2021) Development of an IoT architecture based on a deep neural network against cyber attacks for automated guided vehicles. Sensors 21(24):8467
Elsisi M, Zaini HG, Mahmoud K, Bergies S, Ghoneim SS (2021) Improvement of trajectory tracking by robot manipulator based on a new co-operative optimization algorithm. Mathematics 9(24):3231
Elsisi M (2022) Improved grey wolf optimizer based on opposition and quasi learning approaches for optimization: case study autonomous vehicle including vision system. Artificial intelligence review, 1–24
Engelbrecht AP, Cleghorn CW (2020) Recent advances in particle swarm optimization analysis and understanding. In: Proceedings of the 2020 genetic and evolutionary computation conference companion, July. 747–774
Fang X et al (2015) The NEK1 interactor, C21ORF2, is required for efficient DNA damage repair. Acta Biochim Biophys Sin 47(10):834–841
Gassner FJ et al (2018) Imprecision and DNA break repair biased towards incompatible end joining in leukemia. Mol Cancer Res 16(3):428–438
Ghosh AK, Kay NE (2013) Critical signal transduction pathways in CLL. In: Malek S (ed) Advances in chronic lymphocytic leukemia. Springer, New York, pp 215–239
Gricks CS et al (2004) Differential regulation of gene expression following CD40 activation of leukemic compared to healthy B cells. Blood 104(13):4002–4009
Guo H et al (2019) Disruptive mutations in TANC2 define a neurodevelopmental syndrome associated with psychiatric disorders. Nat Commun 10(1):1–17
Haklı H, Uğuz H (2014) A novel particle swarm optimization algorithm with Levy flight. Appl Soft Comput 23:333–345
Hamblin TJ et al (2002) CD38 expression and immunoglobulin variable region mutations are independent prognostic variables in chronic lymphocytic leukemia, but CD38 expression may vary during the course of the disease. Blood 99(3):1023–1029
Han F, Tang D, Sun YWT, Cheng Z, Jiang J, Li QW (2019) A hybrid gene selection method based on gene scoring strategy and improved particle swarm optimization. BMC Bioinfo 20(8):1–13
Hemalatha R, Prakash R, Sivapragash C (2020) Analysis on energy consumption in smart grid WSN using path operator calculus centrality based HSA-PSO algorithm. Soft Comput 24(14):10771–10783
Hromadnikova I et al (2014) First trimester screening of circulating C19MC microRNAs can predict subsequent onset of gestational hypertension. PLoS ONE 9(12):e113735
Ishibashi O et al (2012) Hydroxysteroid (17-β) dehydrogenase 1 is dysregulated by miR-210 and miR-518c that are aberrantly expressed in preeclamptic placentas: a novel marker for predicting preeclampsia. Hypertension 59(2):265
Izzo A et al (2017) Overexpression of chromosome 21 miRNAs may affect mitochondrial function in the hearts of down syndrome fetuses. Int J Geno. https://doi.org/10.1155/2017/8737649
Jain M, Singh V, Rani A (2019) A novel nature-inspired algorithm for optimization: squirrel search algorithm. Swarm Evol Comput 44:148–175
Jana B, Mitra S, Acharyya S (2019) Repository and mutation based particle swarm optimization (RMPSO): a new PSO variant applied to reconstruction of gene regulatory network. Appl Soft Comput 74:330–355
Jebbink J et al (2012) Molecular genetics of preeclampsia and HELLP syndrome—a review. Biochimica et Biophysica Acta (BBA)-Molec Basis Dis 1822(12):1960–1969
Jee AS et al (2017) Role of autoantibodies in the diagnosis of connective-tissue disease ILD (CTD-ILD) and interstitial pneumonia with autoimmune features (IPAF). J Clin Med 6(5):51
Jelinek DF et al (2003) Identification of a global gene expression signature of b-chronic lymphocytic leukemia1 1 Mayo Comprehensive Cancer Center, National Cancer Institute CA91542 (awarded to NE Kay), and generous philanthropic support provided by Edson Spencer. Mol Can Res 1(5):346–361
Jensi R, Jiji GW (2016) An enhanced particle swarm optimization with levy flight for global optimization. Appl Soft Comput 43:248–261
Jordehi AR (2015) Enhanced leader PSO (ELPSO): a new PSO variant for solving global optimisation problems. Appl Soft Comput 26:401–417
Kar S, Sharma KD, Maitra M (2015) Gene selection from microarray gene expression data for classification of cancer subgroups employing PSO and adaptive K-nearest neighborhood technique. Expert Syst Appl 42(1):612–627
Kennedy J, Eberhart RC (1995) Particle swarm optimization, In: Proceeding of the international conference on neural networks, IEEE, 4, 1942–1948
Khan AH, Ahmed S, Bera SK, Mirjalili S, Oliva D, Sarkar R (2022) Enhancing the contrast of the grey-scale image based on meta-heuristic optimization algorithm. Soft Comp. https://doi.org/10.1007/s00500-022-07033-8
Kimura K et al (2020) ARL4C is associated with initiation and progression of lung adenocarcinoma and represents a therapeutic target. Cancer Sci 111(3):951
Kipps TJ et al (2017) Chronic lymphocytic leukaemia. Nat Rev Dis Primers 3(1):1–22
Kołodziejczyk J, Tarasenko Y (2021) Particle swarm optimization and L´ evy flight integration. Procedia Comp Sci 192:4658–4671
Lee S et al (2004) Frameshift mutation in the Dok1 gene in chronic lymphocytic leukemia. Oncogene 23(13):2287–2297
Lee CP et al (2011) Gene selection and sample classification on microarray data based on adaptive genetic algorithm/k-nearest neighbor method. Expert Syst Appl 38(5):4661–4667
Li L, Weinberg CR, Darden TA, Pedersen LG (2001) Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method. Bioinformatics 17(12):1131–1142
Li L, Pedersen LG, Darden TA, Weinberg CR (2002) Computational analysis of leukemia microarray expression data using the GA/KNN method. Methods of microarray data analysis. Springer, Boston, pp 81–95
Li XL, Serra R, Julien O (2019) Effects of the particle swarm optimization parameters for structural dynamic monitoring of cantilever beam, July
Lin A et al (2019) Adaptive comprehensive learning particle swarm optimization with cooperative archive. Appl Soft Comput 77:533–546
Maity A, Das S (2020) Efficient hybrid local search heuristics for solving the travelling thief problem. Appl Soft Comput 93:106284
Mansouri N, Javidi MM (2020) A review of data replication based on meta-heuristics approach in cloud computing and data grid. Soft Comput 24(19):14503–14530
Mateos MK et al (2015) Down syndrome and leukemia: insights into leukemogenesis and translational targets. Translat Pediat 4(2):76
Matveeva A et al (2017) The TGF-beta—SMAD pathway is inactivated in cronic lymphocytic leukemia cells. Exp Oncol 39(4):286–290
McDonough JE et al (2019) Transcriptional regulatory model of fibrosis progression in the human lung. JCI Insight 4(22):e131597
Minami T (2011). Down syndrome expressed protein; DSCR-1 Deters Cancer and Septic Inflammation. Gene Etiol Down Syndrome, 121
Mohamad MS, Omatu S, Deris S, Yoshioka M, Zainal A (2009) An improved binary particle swarm optimisation for gene selection in classifying cancer classes. In: International work-conference on artificial neural networks, June, Springer, Berlin, Heidelberg, pp. 495–502
Moore SW, Zaahl MG (2012) Intronic RET gene variants in Down syndrome–associated Hirschsprung disease in an African population. J Pediatr Surg 47(2):299–302
Moore AW (2001) Cross-validation for detecting and preventing overfitting, School of Computer Science Carneigie Mellon University
Moslehi F, Haeri A, Martínez-Álvarez F (2020) A novel hybrid GA–PSO framework for mining quantitative association rules. Soft Comput 24(6):4645–4666
Mowery CT et al (2018) Trisomy of a Down syndrome critical region globally amplifies transcription via HMGN1 overexpression. Cell Rep 25(7):1898–1911
Myers JE et al (2013) Integrated proteomics pipeline yields novel biomarkers for predicting preeclampsia. Hypertension 61(6):1281–1288
Nakashima T et al (2008) Suppressor of cytokine signaling 1 inhibits pulmonary inflammation and fibrosis. J Allergy Clin Immunol 121(5):1269–1276
National Institute of General Medical Sciences, October, 2017,<https://www.nigms.nih.gov/education/Documents/pharma/newline cogenomics1.pdf
Nevado J et al (2015) PIAS4 is associated with macro/microcephaly in the novel interstitial 19p13. 3 microdeletion/microduplication syndrome. Eur J Human Genet 23(12):1615–1626
Nofrini V, Di Giacomo D, Mecucci C (2016) Nucleoporin genes in human diseases. Eur J Hum Genet 24(10):1388–1395
Peng X et al (2016) Plexin C1 deficiency permits synaptotagmin 7–mediated macrophage migration and enhances mammalian lung fibrosis. FASEB J 30(12):4056–4070
Prasad Y, Biswas KK, Hanmandlu M (2018) A recursive PSO scheme for gene selection in microarray data. Appl Soft Comput 71:213–225
Priya JS, Femina MA, Samuel RA (2020) APSO-MVS: an adaptive particle swarm optimization incorporating multiple velocity strategies for optimal leader selection in hybrid MANETs. Soft Comput 24(24):18349–18365
Puri V, Chauhan YK (2022) Offline parameter estimation of a modified permanent magnet generator using GSA and GSA-PSO. Soft Comput. https://doi.org/10.1007/s00500-021-06610-7
Quackenbush J (2002) Microarray data normalization and transformation. Nat Genet 32(4s):496
Ramaswamy R, Kandhasamy P, Palaniswamy S (2021) Feature selection for Alzheimer’s gene expression data using modified binary particle swarm optimization. IETE J Res. https://doi.org/10.1080/03772063.2021.1962747
Rauf HT, Shoaib U, Lali MI, Alhaisoni M, Irfan MN, Khan MA (2020) Particle swarm optimization with probability sequence for global optimization. IEEE Access 8:110535–110549
Rotoli BM et al (2007) Alveolar macrophages from normal subjects lack the NOS-related system y+ for arginine transport. Am J Respir Cell Mol Biol 37(1):105–112
Saha S, Biswas S, Acharyya S (2016) Gene selection by sample classification using k nearest neighbor and meta-heuristic algorithms. In: 2016 IEEE 6th International conference on advanced computing (IACC), IEEE, February. 250–255
Sahu B, Mishra D (2012) A novel feature selection algorithm using particle swarm optimization for cancer microarray data. Procedia Engineering 38:27–31
Saldarriaga MM et al (2019) A case of CCDC6-RET fusion mutation in adult acute lymphoblastic leukemia (ALL), a known activating mutation reported in ALL. Front Oncol. https://doi.org/10.3389/fonc.2019.01303
Shao Y, Chen J, Zheng J, Liu CR (2017) Effect of histone deacetylase HDAC3 on cytokines IL-18, IL-12 and TNF-α in patients with intrahepatic cholestasis of pregnancy. Cell Physiol Biochem 42(4):1294–1302
Sharp TV et al (2008) The chromosome 3p21 3-encoded gene, LIMD1, is a critical tumor suppressor involved in human lung cancer development. Proceed Nat Acad Sci 105(50):19932–19937
Shi Y, Eberhart RC (1998) A modified particle swarm optimizer, In: Proceedings of the international conference on evolutionary computation, IEEE, 69–73
Slikker W Jr (2018) Biomarkers and their impact on precision medicine. Exp Biol Med 243(3):211
Stock CJ et al (2019) Bromodomain and extraterminal (BET) protein inhibition restores redox balance and inhibits myofibroblast activation. BioMed Research International
Suganthan PN et al (2005) Problem definitions and evaluation criteria for the CEC 2005 special session on real-parameter optimization. KanGAL Report, 2005005 (2005), 2005
Sullivan KD et al (2016) Trisomy 21 consistently activates the interferon response. Elife 5:e16220
Taskiran EZ et al (2017) Homozygous indel mutation in CDH11 as the probable cause of Elsahy-Waters syndrome. Am J Med Genet A 173(12):3143–3152
Thanendrarajan S, Kim Y, Schmidt-Wolf IGH (2011) Understanding and targeting the Wnt/β-catenin signaling pathway in chronic leukemia. Leukemia research and treatment
Tran MQ, Elsisi M, Liu MK (2021) Effective feature selection with fuzzy entropy and similarity classifier for chatter vibration diagnosis. Measurement 184:109962
Tran MQ, Liu MK, Elsisi M (2022) Effective multi-sensor data fusion for chatter detection in milling process. ISA Trans 125:514–527
US Food and Drug Administration (2019) FDA warns about rare but severe lung inflammation with Ibrance, Kisqali, and Verzenio for breast cancer
Utami DA, Rustam Z (2019) Gene selection in cancer classification using hybrid method based on Particle Swarm Optimization (PSO), Artificial Bee Colony (ABC) feature selection and support vector machine. In: AIP conference proceedings (Vol. 2168, No. 1, p. 020047), November. AIP Publishing LLC
Valenzuela FJ et al (2012) Pathogenesis of preeclampsia: the genetic component. J Pregn. https://doi.org/10.1155/2012/632732
Vilardell M et al (2011) Meta-analysis of heterogeneous Down Syndrome data reveals consistent genome-wide dosage effects related to neurological processes. BMC Genomics 12(1):229
Walters DM et al (2014) Genetic susceptibility to interstitial pulmonary fibrosis in mice induced by vanadium pentoxide (V2O5). FASEB J 28(3):1098–1112
Wang L, Feng Y et al (2006) Prolylcarboxypeptidase gene, chronic hypertension, and risk of preeclampsia. Am J Obstet Gynecol 195(1):162–171
Wang L et al (2011) SF3B1 and other novel cancer genes in chronic lymphocytic leukemia. N Engl J Med 365(26):2497–2506
Wang H, Liang M, Sun C, Zhang G, Xie L (2021) Multiple-strategy learning particle swarm optimization for large-scale optimization problems. Complex & Intelligent Systems 7(1):1–16
White WM et al (2013) Genome-wide methylation profiling demonstrates hypermethylation in maternal leukocyte DNA in preeclamptic compared to normotensive pregnancies. Hypertens Pregnancy 32(3):257–269
Williams PJ, Pipkin FB (2011) The genetics of pre-eclampsia and other hypertensive disorders of pregnancy. Best Pract Res Clin Obstet Gynaecol 25(4):405–417
Xia X, Li S (2020) Research on improved chaotic particle optimization algorithm based on complex function. Frontiers in Physics 8:368
Ye W, Feng W, Fan S (2017) A novel multi-swarm particle swarm optimization with dynamic learning strategy. Appl Soft Comput 61:832–843
Ye M, Wang W, Yao C, Fan R, Wang P (2019) Gene selection method for microarray data classification using particle swarm optimization and neighborhood rough set. Curr Bioinform 14(5):422–431
Yepes S, Torres MM, López-Kleine L (2015) Regulatory network reconstruction reveals genes with prognostic value for chronic lymphocytic leukemia. BMC Genomics 16(1):1002
Yildiz M et al (2015) Activating STAT6 mutations in follicular lymphoma Blood. J Am Soci Hematol 125(4):668–679
Zhang JG, Deng HW (2007) Gene selection for classification of microarray data based on the Bayes error. BMC Bioinformatics 8(1):370
Zhang X, Lin Q (2022) Three-learning strategy particle swarm algorithm for global optimization problems. Inf Sci 593:289–313
Zhang K, Huang Q, Zhang Y (2019) Enhancing comprehensive learning particle swarm optimization with local optima topology. Inf Sci 47:1–18
Zhang X, Zhao K, Wang L, Wang Y, Niu Y (2020) An improved squirrel search algorithm with reproductive behavior. IEEE Access 8:101118–101132
Zheng T, Luo W (2019) An improved squirrel search algorithm for optimization. Complexity. https://doi.org/10.1155/2019/6291968
Zuo X, Xiao L (2014) A DE and PSO based hybrid algorithm for dynamic optimization problems. Soft Comput 18(7):1405–1424
Acknowledgements
We are grateful to TEQIP-III Maulana AbulKalam Azad University of Technology (MAKAUT), West Bengal, India for supporting our research.
Funding
The authors have no affiliation with any organization with a direct or indirect financial interest in the subject matter discussed in the manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Jana, B., Acharyya, S. Elitist random swapped particle swarm optimization embedded with variable k-nearest neighbour classification: a new PSO variant applied to gene identification. Soft Comput 27, 3169–3201 (2023). https://doi.org/10.1007/s00500-022-07515-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-022-07515-9
Keywords
- Gene identification
- Meta-heuristics
- PSO variant
- Sample classification
- Variable kNN