Skip to main content
Log in

AFP-SRC: identification of antifreeze proteins using sparse representation classifier

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Species living in the extreme cold environment fight against the harsh conditions using antifreeze proteins (AFPs), which manipulate the freezing mechanism of water in more than one way. This amazing nature of AFPs turns out to be extremely useful in several industrial and medical applications. The lack of similarity in their structure and sequence makes their prediction an arduous task, and identifying them experimentally in the wet laboratory is time-consuming and expensive. In this research, we propose a computational framework for the prediction of AFPs, which is essentially based on a sample-specific classification method using sparse reconstruction. A linear model and an over-complete dictionary matrix (ODM) of known AFPs are used to predict a sparse class-label vector that provides a sample-association score. Delta rule is applied for the reconstruction of two pseudo-samples using lower and upper parts of the sample-association vector and based on the minimum recovery score, class labels are assigned. We compare our approach with contemporary methods on a standard dataset. The proposed method outperforms the contemporary methods in terms of balanced accuracy and Youden’s index. The MATLAB implementation of the proposed method is available at the author’s GitHub page (https://github.com/Shujaat123/AFP-SRC).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Prathalingam NS, Holt WV, Revell SG, Mirczuk S, Fleck RA, Watson PF (2006) Impact of antifreeze proteins and antifreeze glycoproteins on bovine sperm during freeze-thaw. Theriogenology 66(8):1894–1900

    Article  Google Scholar 

  2. Qadeer S, Khan MA, Ansari MS, Rakha BA, Ejaz R, Iqbal R, Younis M, Ullah N, DeVries Arthur L, Akhter S (2015) Efficiency of antifreeze glycoproteins for cryopreservation of nili-ravi (bubalus bubalis) buffalo bull sperm. Animal Reprod Sci 157:56–62

  3. Aung HN, Chang KK (2019) A brief review of applications of antifreeze proteins in cryopreservation and metabolic genetic engineering. 3Biotech 9(9):329

  4. Griffith M, Vanya Ewart K (1995) Antifreeze proteins and their potential use in frozen foods. Biotechnol Adv 13(3):375–402

    Article  Google Scholar 

  5. Duman JG, Wisniewski Michael J (2014) The use of antifreeze proteins for frost protection in sensitive crop plants. Environ Exp Bot 106:60–69

    Article  Google Scholar 

  6. Rubinsky B, Arav A, Hong JS, Lee CY (1994) Freezing of mammalian livers with glycerol and antifreeze proteins. Biochem Biophys Res Commun 200(2):732–741

    Article  Google Scholar 

  7. Kumar BR, Sudipto S, Santasabuj D (2014) Prediction of interactions between viral and host proteins using supervised machine learning methods. PloS One 9(11):e112034

    Article  Google Scholar 

  8. Khan S (2015) Machine learning approaches for the prediction of protein sequences. Master’s thesis, Karachi Institute of Economics and Technology

  9. Park S, Khan S, Wahab A (2020) E3-targetpred: prediction of e3-target proteins using deep latent space encoding. arXiv preprint arXiv:2007.12073

  10. Swarup KG, Anupam G (2020) Classification of gene expression patterns using a novel type-2 fuzzy multigranulation-based svm model for the recognition of cancer mediating biomarkers. Neural Comput Appl 1–19

  11. Marika K, Katrin SB, Mirko W, Mateusz K, Kaja G, Jacek B, Thomas V (2021) Learning vector quantization as an interpretable classifier for the detection of sars-cov-2 types based on their rna sequences. Neural Comput Appl 1–12

  12. Khan S, Huh J, Ye J. C (2021) Variational formulation of unsupervised deep learning for ultrasound image artifact removal. IEEE Trans Ultrasonics Ferroelec Freq Control

  13. Vives-BV, Ruiz-FD (2021) Fundamentals of artificial metaplasticity in radial basis function networks for breast cancer classification. Neural Comput Appl 1–12

  14. Wang F, Guan Z-X, Dao F-Y, Ding H (2019) A brief review of the computational identification of antifreeze protein. Curr Org Chem 23(15):1671–1680

    Article  Google Scholar 

  15. Kandaswamy KK, Kuo-Chen C, Thomas M, Steffen M, Suganthan PN, Sridharan S, Pugalenthi G (2011) AFP-Pred: a random forest approach for predicting antifreeze proteins from sequence-derived. J Theor Biol 270:56–62

    Article  Google Scholar 

  16. Chin-Sheng Yu, Chih-Hao L (2011) Identification of antifreeze proteins and their functional residues by support vector machine and genetic algorithms based on n-peptide compositions. PloS One 6(5):e20445

    Article  Google Scholar 

  17. Xiaowei Z, Zhiqiang M, Minghao Y (2012) Using support vector machine and evolutionary profiles to predict antifreeze protein sequences. Int J Mol Sci 13:2196–2207

    Article  Google Scholar 

  18. Sukanta M, Pai Priyadarshini P (2014) Chou’s pseudo amino acid composition improves sequence-based antifreeze protein prediction. J Theor Biol 356:30–35

  19. Eslami M, Zade RSH, Takalloo Z, Mahdevar G, Emamjomeh A, Sajedi RH, Zahiri J (2018) afpcool: a tool for antifreeze protein prediction. Heliyon 4(7):e00705

    Article  Google Scholar 

  20. Deepak G, Parashjyoti B, Usha MS, Mukesh P (2021) Data-driven mechanism based on fuzzy lagrangian twin parametric-margin support vector machine for biomedical data analysis. Neural Comput Appl 1–11

  21. Popa Iolanda V, Burlacu A, Gavrilescu O, Dranga M, Prelipcean Cristina C, Mihai C (2021) A new approach to predict ulcerative colitis activity through standard clinical–biological parameters using a robust neural network model. Neural Comput Appl 1–14

  22. Huang Y-A, You Z-H, Chen X, Chan K, Luo X (2016) Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding. BMC Bioinform 17(1):1–11

    Article  Google Scholar 

  23. Khan S, Huh J, Ye Jong C (2020) Adaptive and compressive beamforming using deep learning for medical ultrasound. IEEE Trans Ultrasonics Ferroelect Freq Control 1

  24. Kong M, Zhang Y, Da X, Chen W, Dehmer M (2020) Fctp-wsrc: protein-protein interactions prediction via weighted sparse representation based classification. Front Genetics 11:18

    Article  Google Scholar 

  25. Naseem I, Khan S, Togneri R, Bennamoun M (2017) Ecmsrc: a sparse learning approach for the prediction of extracellular matrix proteins. Curr Bioinform 12(4):361–368

    Article  Google Scholar 

  26. Naseem I, Togneri R, Bennamoun M (2008) Sparse representation for ear biometrics. In: International symposium on visual computing. Springer, pp 336–345

  27. Naseem I, Togneri R, Bennamoun M (2010) Sparse representation for speaker identification. In: 2010 20th international conference on pattern recognition. IEEE, pp 4460–4463

  28. Naseem I, Togneri R, Bennamoun M (2012) Robust regression for face recognition. Patt Recogn 45(1):104–118

    Article  Google Scholar 

  29. Wang Z, Li Y, You Z-H, Li L-P, Zhan X-K, Pan J (2020) Prediction of protein-protein interactions from protein sequences by combining matpca feature extraction algorithms and weighted sparse representation models. Math Probl Eng

  30. Hun YY, Shujaat K, Jaeyoung H, Chul YJ (2018) Efficient b-mode ultrasound image reconstruction from sub-sampled rf data using deep learning. IEEE Trans Med Imaging 38(2):325–336

    Article  Google Scholar 

  31. Elad M (2010) Sparse and redundant representations: from theory to applications in signal and image processing. Springer Science & Business Media

  32. Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Patt Anal Mach Intell 35(8):1798–1828

    Article  Google Scholar 

  33. Xiyi H, Fang-Xiang W (2009) Sparse representation for classification of tumors using gene expression data. J Biomed Biotechnol

  34. Li Y, Ngom A (2013) Sparse representation approaches for the classification of high-dimensional biological data. BMC Syst Biol 7(4):1–14

    Google Scholar 

  35. Elad M, Aharon M (2006) Image denoising via learned dictionaries and sparse representation. In: 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR’06), vol 1. IEEE, pp 895–900

  36. Li Y, Ngom A (2013) The non-negative matrix factorization toolbox for biological data mining. Source Code Biol Med 8(1):1–15

    Article  Google Scholar 

  37. Abavisani M, Patel Vishal M (2019) Deep sparse representation-based classification. IEEE Sig Process Lett 26(6):948–952

    Article  Google Scholar 

  38. Alim A, Abdul R, Naseem I (2020) Pogb-pred: prediction of antifreeze proteins sequences using amino acid composition with feature selection followed by a sequential-based ensemble approach. Curr Bioinform 15(1)

  39. Khan S, Naseem I, Togneri R, Bennamoun M (2016) Rafp-pred: robust prediction of antifreeze proteins using localized analysis of n-peptide compositions. IEEE/ACM Trans Comput Biol Bioinform 15(1):244–250

    Article  Google Scholar 

  40. Usman M, Khan S, Lee J-A (2020) Afp-lse: Antifreeze proteins prediction using latent space encoding of composition of k-spaced amino acid pairs. Sci Rep 10(1):1–13

    Article  Google Scholar 

  41. Usman M, Lee JA (2019) Afp-cksaap: prediction of antifreeze proteins using composition of k-spaced amino acid pairs with deep neural network. In: 2019 IEEE 19th international conference on bioinformatics and bioengineering (BIBE), pp 38–43

  42. Park S, Khan S, Moinuddin M, Al-Saggaf Ubaid M (2020) Gssmd: a new standardized effect size measure to improve robustness and interpretability in biological applications. In: 2020 IEEE international conference on bioinformatics and biomedicine (BIBM). IEEE, pp 1096–1099

  43. Ringnér M (2008) What is principal component analysis? Nat Biotechnol 26(3):303

    Article  Google Scholar 

  44. Liaqat A, Iram W, Noorbakhsh AG, Fazel K, Syed ACB (2021) Lda–ga–svm: improved hepatocellular carcinoma prediction through dimensionality reduction and genetically optimized support vector machine. Neural Comput Appl 33(7):2783–2792

  45. Erkan U (2020) A precise and stable machine learning algorithm: eigenvalue classification (eigenclass). Neural Comput Appl 1–12

  46. Donoho DL (2006) For most large underdetermined systems of linear equations the minimal \(l_1\)-norm solution is also the sparsest solution. Commun Pure Appl Math 59(6):797–829

    MathSciNet  MATH  Google Scholar 

  47. Candès E (2008) The restricted isometry property and its implications for compressed sensing. C R Acad Sci Paris Ser I 346:589–592

  48. Candès E, Romberg J, Tao T (2006) Stable signal recovery from incomplete and inaccurate measurements. Commun Pure Appl Math 59(8):1207–1223

    Article  MathSciNet  MATH  Google Scholar 

  49. Ma Y, Zhu X, Zhu S, Keshou W, Chen Y (2018) Combating the class imbalance problem in sparse representation learning. J Intell Fuzzy Syst 35(2):1865–1874

    Article  Google Scholar 

  50. Liu Z, Gao C, Yang H, He Q (2016) A cost-sensitive sparse representation based classification for class-imbalance problem. Sci Programm

  51. Ting S, Bob Z, Yan TY (2018) Sparse supervised representation-based classifier for uncontrolled and imbalanced classification. IEEE Trans Neural Netw Learn Syst 31(8):2847–2856

    MathSciNet  Google Scholar 

  52. Priya A, Ashutosh M, Avleen M (2021) N-semble-based method for identifying parkinson’s disease genes. Neural Comput Appl 1–11

  53. Zou X, Feng Y, Li H, Jiang S (2017) Srot: sparse representation-based over-sampling technique for classification of imbalanced dataset. In: IOP conference series: earth and environmental science, vol 81. IOP Publishing, p 012201

  54. Elyan E, Moreno-Garcia CF, Jayne C (2021) Cdsmote: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification. Neural Comput Appl 33(7):2839–2851

    Article  Google Scholar 

  55. Sun J, Wang X, Xiong N, Shao J (2018) Learning sparse representation with variational auto-encoder for anomaly detection. IEEE Access 6:33353–33361

    Article  Google Scholar 

  56. Gong D, Liu L, Le V, Saha B, Mansour Moussa R, Venkatesh S, van den Hengel (2019) Memorizing normality to detect anomaly: Memory-augmented deep autoencoder for unsupervised anomaly detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 1705–1714

Download references

Acknowledgements

The authors would like to thank Misbah Tariq for helping in visualization. We would also like to thank learned referees and editor for their valuable and useful suggestions.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

Author information

Authors and Affiliations

Authors

Contributions

MU and SK contributed equally, they designed the research, conducted the experiments and wrote the manuscript, SP conceived the DeepSRC experiments and performed analysis, AW supervised the study and analyzed the results. All authors discussed the results and reviewed the manuscript.

Corresponding author

Correspondence to Shujaat Khan.

Ethics declarations

Conflict of Interest

The authors reported no conflict of interest.

Code Availability

All the data used in this study along with the MATLAB and python implementations of the SRC and DeepSRC methods are available at the author’s GitHub page (https://github.com/Shujaat123/AFP-SRC).

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Usman, M., Khan, S., Park, S. et al. AFP-SRC: identification of antifreeze proteins using sparse representation classifier. Neural Comput & Applic 34, 2275–2285 (2022). https://doi.org/10.1007/s00521-021-06558-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-021-06558-7

Keywords

Navigation