Skip to main content


This chapter is based on the prediction of MoRF regions within the intrinsically disordered protein sequence. Disordered proteins have molecular recognition regions (MoRF) making them highly attractive to bind with protein pairs. Thus, as they combine with other protein pairs, they undergo disorder-to-order transition making them essential for various biological functions. Therefore, the project is tasked to obtain structural information of the disordered protein sequence and perform machine learning techniques to predict the MoRF regions in disordered protein sequences. The proposed method for the project will focus on programming and simulation analysis using the MATLAB software for which structural information will be extracted from the disordered protein sequences. Using these sequences, the project is aimed to perform training and testing implementation. Two test methods are used to evaluate the performance of the trained SVM models. Analysis has shown that the cross-validation test method outperforms the independent test method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions


  1. Sharma R, Kumar S, Tsunoda T, Patil A, Sharma A (2016) Predicting MoRFs in protein sequences using HMM profiles. BMC Bioinform 17(19). Available:

  2. Sharma R, Sharma A, Patil A, Tsunoda T (2019) Discovering MoRFs by trisecting intrinsically disordered protein sequence into terminals and middle regions. BMC Bioinform 19(13). Available:

  3. Sharma R, Raicar G, Tsunoda T, Patil A, Sharma A (2018) OPAL: prediction of MoRF regions in intrinsically disordered protein sequences. Bioinformatics 34(11):1850–1858. Available:

  4. Malhis N, Jacobson M, Gsponer J (2016) MoRFchibi SYSTEM: software tools for the identification of MoRFs in protein sequences. Nucleic Acids Res 44(W1):W488–W493

    Article  Google Scholar 

  5. Sharma R, Bayarjargal M, Tsunoda T, Patil A, Sharma A (2018) MoRFPred-plus: computational identification of MoRFs in protein sequences using physicochemical properties and HMM profiles. J Theoret Biol 437:9–16. Available:

  6. Midic U, Oldfield C, Dunker A, Obradovic Z, Uversky V (2009) Protein disorder in the human diseasome: unfoldomics of human genetic diseases. BMC Genom 10(1):S12. Available

  7. Uversky V et al (2009) Unfoldomics of human diseases: linking protein intrinsic disorder with diseases. BMC Genom 10(1):S7. Available:

  8. Al-Tabbakh SM, Mohamed HM, El ZH (2018) Machine learning techniques for analysis of Egyptian flight delay. Int J Data Mining Knowledge Managem Process 8(3):01–14. Available

  9. Ryan MM, Shobha G, Rangaswamy S (2020) Supervised learning—an overview | ScienceDirect Topics. 2020. [Online]. Available Accessed 1 Mar 2020

  10. Mishra S (2020) Unsupervised learning and data clustering. Medium 2020. [Online]. Available: Accessed 1 Mar 2020

  11. Hsu W et al (2020) Intrinsic protein disorder and protein-protein interactions. In: Pacific symposium on biocomputing. Pacific symposium on biocomputing, pp 1–13. Available: Accessed 20 Feb 2020

  12. Mohan A et al (2006) Analysis of molecular recognition features (MoRFs). J Molecular Biol 362(5):1043–1059. Available:

  13. He H, Zhao J, Sun G (2019) Prediction of MoRFs in protein sequences with MLPs based on sequence properties and evolution information. Entropy 21(7):635. Available:

  14. Hanson J, Litfin T, Paliwal K, Zhou Y (2019) Identifying molecular recognition features in intrinsically disordered regions of proteins by transfer learning. Bioinformatics. Available

  15. Wang Y, Guo Y, Pu X, Li M (2017) A sequence-based computational method for prediction of MoRFs. RSC Adv 7(31):18937–18945. Available

  16. EL‐Manzalawy Y, Dobbs D, Honavar V (2008) Predicting flexible length linear B-cell epitopes. J Molecular Recogn 21(4):121–132. Available:

  17. Reddy H, Sharma A, Dehzangi A, Shigemizu D, Chandra A, Tsunoda T (2019) GlyStruct: glycation prediction using structural properties of amino acid residues. BMC Bioinform 19(13). Available

  18. Team D (2020) Kernel functions-introduction to SVM Kernel & examples—dataflair. DataFlair, 2020 [Online]. Available Accessed 28 May 2020

  19. Understanding AUC—ROC Curve, Medium (2020) [Online]. Available Accessed 22 May 2020

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Bibhya Sharma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Elisha, D., Sanau, J., Assaf, M.H., Kumar, R.R., Sharma, B., Sharma, R. (2023). Molecular Recognition and Feature Extraction System. In: Yadav, A., Nanda, S.J., Lim, MH. (eds) Proceedings of International Conference on Paradigms of Communication, Computing and Data Analytics. PCCDA 2023. Algorithms for Intelligent Systems. Springer, Singapore.

Download citation

Publish with us

Policies and ethics