Prediction of Protein–Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures

Liu, Guang-Hui; Shen, Hong-Bin; Yu, Dong-Jun

doi:10.1007/s00232-015-9856-z

Prediction of Protein–Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures

Published: 12 November 2015

Volume 249, pages 141–153, (2016)
Cite this article

The Journal of Membrane Biology Aims and scope Submit manuscript

Guang-Hui Liu^1,2,
Hong-Bin Shen³ &
Dong-Jun Yu¹

1236 Accesses
31 Citations
Explore all metrics

Abstract

Accurately predicting protein–protein interaction sites (PPIs) is currently a hot topic because it has been demonstrated to be very useful for understanding disease mechanisms and designing drugs. Machine-learning-based computational approaches have been broadly utilized and demonstrated to be useful for PPI prediction. However, directly applying traditional machine learning algorithms, which often assume that samples in different classes are balanced, often leads to poor performance because of the severe class imbalance that exists in the PPI prediction problem. In this study, we propose a novel method for improving PPI prediction performance by relieving the severity of class imbalance using a data-cleaning procedure and reducing predicted false positives with a post-filtering procedure: First, a machine-learning-based data-cleaning procedure is applied to remove those marginal targets, which may potentially have a negative effect on training a model with a clear classification boundary, from the majority samples to relieve the severity of class imbalance in the original training dataset; then, a prediction model is trained on the cleaned dataset; finally, an effective post-filtering procedure is further used to reduce potential false positive predictions. Stringent cross-validation and independent validation tests on benchmark datasets demonstrated the efficacy of the proposed method, which exhibits highly competitive performance compared with existing state-of-the-art sequence-based PPIs predictors and should supplement existing PPI prediction methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automated feature engineering improves prediction of protein–protein interactions

Article 05 July 2019

In silico prediction methods of self-interacting proteins: an empirical and academic survey

Article 06 October 2022

Selecting high-quality negative samples for effectively predicting protein-RNA interactions

Article Open access 14 March 2017

References

Agrawal NJ, Helk B, Trout BL (2014) A computational tool to predict the evolutionarily conserved protein–protein interaction hot-spot residues from the structure of the unbound protein. FEBS Lett 588:326–333
Article CAS PubMed Google Scholar
Ahmed Z, Tetlow IJ, Ahmed R, Morell MK, Emes MJ (2015) Protein–protein interactions among enzymes of starch biosynthesis in high-amylose barley genotypes reveal differential roles of heteromeric enzyme complexes in the synthesis of A and B granules. Plant Sci 233:95–106
Article CAS PubMed Google Scholar
Ako-Adjei D, Fu W, Wallin C, Katz KS, Song G, Darji D, Brister JR, Ptak RG, Pruitt KD (2015) HIV-1, human interaction database: current status and new features. Nucleic Acids Res 43:D566–D570
Article PubMed PubMed Central Google Scholar
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Article CAS PubMed PubMed Central Google Scholar
Baldi P, Brunak S, Chauvin Y, Andersen CA, Nielsen H (2000) Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 16:412–424
Article CAS PubMed Google Scholar
Betel D, Breitkreuz KE, Isserlin R, Dewar-Darch D, Tyers M, Hogue CW (2007) Structure-templated predictions of novel protein interactions from sequence information. PLoS Comput Biol 3:e182
Article PubMed Central Google Scholar
Bock JR, Gough DA (2001) Predicting protein–protein interactions from primary structure. Bioinformatics 17:455–460
Article CAS PubMed Google Scholar
Bradford JR, Westhead DR (2005) Improved prediction of protein–protein binding sites using a support vector machines approach. Bioinformatics 21:1487–1494
Article CAS PubMed Google Scholar
Bradford JR, Needham CJ, Bulpitt AJ, Westhead DR (2006) Insights into protein–protein interfaces using a Bayesian network prediction method. J Mol Biol 362:365–386
Article CAS PubMed Google Scholar
Breiman L (2001) Random forests. Mach Learn 45:5–32
Article Google Scholar
Burgoyne NJ, Jackson RM (2006) Predicting protein interaction sites: binding hot-spots in protein–protein and protein–ligand interfaces. Bioinformatics 22:1335–1342
Article CAS PubMed Google Scholar
Chen X-W, Jeong J-C (2009) Sequence-based prediction of protein interaction sites with an integrative method. Bioinformatics 25:585–591
Article PubMed Google Scholar
Chen C-T, Peng H-P, Jian J-W, Tsai K-C, Chang J-Y, Yang E-W, Chen J-B, Ho S-Y, Hsu W-L, Yang A-S (2012) Protein-protein interaction site predictions with three-dimensional probability distributions of interacting atoms on protein surfaces. PLoS One 7:e37706
Article CAS PubMed PubMed Central Google Scholar
Chen W, Feng PM, Lin H, Chou KC (2013) iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition. Nucleic Acids Res 41:e68
Article CAS PubMed PubMed Central Google Scholar
Chen W, Feng P-M, Deng E-Z, Lin H, Chou K-C (2014) iTIS-PseTNC: a sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition. Anal Biochem 462:76–83
Article CAS PubMed Google Scholar
Chen W, Feng P, Ding H, Lin H, Chou K-C (2015) iRNA-methyl: identifying N 6-methyladenosine sites using pseudo nucleotide composition. Anal Biochem 490:26–33
Article CAS PubMed Google Scholar
Chothia C, Janin J (1975) Principles of protein-protein recognition. Nature 256:705–708
Article CAS PubMed Google Scholar
Chou K (2001) Using subsite coupling to predict signal peptides. Protein Eng 14:75–79
Article CAS PubMed Google Scholar
Chou KC (2011) Some remarks on protein attribute prediction and pseudo amino acid composition. J Theor Biol 273:236–247
Article CAS PubMed Google Scholar
Chou KC (2013) Some remarks on predicting multi-label attributes in molecular biosystems. Mol Biosyst 9:1092–1100
Article CAS PubMed Google Scholar
Chou K-C (2015) Impacts of bioinformatics to medicinal chemistry. Med Chem 11:218–234
Article CAS PubMed Google Scholar
Cukuroglu E, Gursoy A, Nussinov R, Keskin O (2014) Non-redundant unique interface structures as templates for modeling protein interactions. PLoS One 9:e86738
Article PubMed PubMed Central Google Scholar
DeLano WL (2002) The PyMOL molecular graphics system, http://www.pymol.org
Dhole K, Singh G, Pai PP, Mondal S (2014) Sequence-based prediction of protein-protein interaction sites with L1-logreg classifier. J Theor Biol 348:47–54
Article CAS PubMed Google Scholar
Ding H, Deng E-Z, Yuan L-F, Liu L, Lin H, Chen W, Chou K-C (2014) iCTX-Type: a sequence-based predictor for identifying the types of conotoxins in targeting ion channels. BioMed Res Int. doi:10.1155/2014/286419
Google Scholar
Drewes G, Bouwmeester T (2003) Global approaches to protein–protein interactions. Curr Opin Cell Biol 15:199–205
Article CAS PubMed Google Scholar
Edwards AM, Kus B, Jansen R, Greenbaum D, Greenblatt J, Gerstein M (2002) Bridging structural biology and genomics: assessing protein interaction data with known complexes. Trends Genet 18:529–536
Article CAS PubMed Google Scholar
Ertekin S, Huang J, Bottou L, Giles L (2007a). Learning on the border: active learning in imbalanced data classification. In: ACM Conference on Information and Knowledge Management, pp 127–136
Ertekin S, Huang J, Giles CL (2007b) Active learning for class imbalance problem. In: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, Amsterdam, pp 823–824
Estabrooks A, Jo TH, Japkowicz N (2004) A multiple resampling method for learning from imbalanced data sets. Comput Intell 20:18–36
Article Google Scholar
Fariselli P, Pazos F, Valencia A, Casadio R (2002) Prediction of protein–protein interaction sites in heterocomplexes with neural networks. Eur J Biochem 269:1356–1361
Article CAS PubMed Google Scholar
Friedrich T, Pils B, Dandekar T, Schultz J, Müller T (2006) Modelling interaction sites in protein domains with interaction profile hidden Markov models. Bioinformatics 22:2851–2857
Article CAS PubMed Google Scholar
Fry DC (2015) Targeting protein-protein interactions for drug discovery. Protein Protein Interact Methods Appl 1278:93–106
CAS Google Scholar
Gallet X, Charloteaux B, Thomas A, Brasseur R (2000) A fast method to predict protein interaction sites from sequences. J Mol Biol 302:917–926
Article CAS PubMed Google Scholar
Gromiha MM, Yokota K, Fukui K (2009) Energy based approach for understanding the recognition mechanism in protein–protein complexes. Mol Biosyst 5:1779–1786
Article PubMed Google Scholar
Guo SH, Deng EZ, Xu LQ, Ding H, Lin H, Chen W, Chou KC (2014) iNuc-PseKNC: a sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition. Bioinformatics 30:1522–1529
Article CAS PubMed Google Scholar
Hall DA, Ptacek J, Snyder M (2007) Protein microarray technology. Mech Ageing Dev 128:161–167
Article CAS PubMed PubMed Central Google Scholar
He H-B, Garcia EA (2009a) Learning from imbalanced data. IEEE Trans Knowl Data Eng 21:1263–1284
Article Google Scholar
He H, Garcia EA (2009b) Learning from Imbalanced Data. IEEE Trans Knowl Data Eng 21:1263–1284
Article Google Scholar
He X, Han K, Hu J, Yan H, Yang J-Y, Shen H-B, Yu D-J (2015) TargetFreeze: identifying antifreeze proteins via a combination of weights using sequence evolutionary information and pseudo amino acid composition. J Membr Biol 19(1):1–10
Google Scholar
Hong X, Chen S, Harris CJ (2007) A kernel-based two-class classifier for imbalanced data sets. IEEE Trans Neural Networks 18:28–41
Article PubMed Google Scholar
Hu L, Huang T, Shi X, Lu W-C, Cai Y-D, Chou K-C (2011) Predicting functions of proteins in mouse based on weighted protein-protein interaction network and protein hybrid properties. PLoS One 6:e14556
Article CAS PubMed PubMed Central Google Scholar
Hu J, He X, Yu D-J, Yang X-B, Yang J-Y, Shen H-B (2014) A new supervised over-sampling algorithm with application to protein-nucleotide binding residue prediction. PLoS One 9(9):107676
Article Google Scholar
Hubbard SJ, Thornton JM (1993) Naccess. Computer Program, vol 2, Department of Biochemistry and Molecular Biology, University College, London
Hwang H, Pierce B, Mintseris J, Janin J, Weng Z (2008) Protein–protein docking benchmark version 3.0. Proteins Struct Function Bioinform 73:705–709
Article CAS Google Scholar
Ito T, Tashiro K, Muta S, Ozawa R, Chiba T, Nishizawa M, Yamamoto K, Kuhara S, Sakaki Y (2000) Toward a protein–protein interaction map of the budding yeast: a comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. Proc Natl Acad Sci 97:1143–1147
Article CAS PubMed PubMed Central Google Scholar
Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, Sakaki Y (2001) A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci 98:4569–4574
Article CAS PubMed PubMed Central Google Scholar
Jia J, Liu Z, Xiao X, Liu B, Chou K-C (2015a) Identification of protein-protein binding sites by incorporating the physicochemical properties and stationary wavelet transforms into pseudo amino acid composition. J Biomol Struct Dyn. doi:10.1080/07391102.2015.1095116
PubMed Google Scholar
Jia J, Liu Z, Xiao X, Liu B, Chou K-C (2015b) iPPI-Esml: An ensemble classifier for identifying the interactions of proteins by incorporating their physicochemical properties and wavelet transforms into PseAAC. J Theor Biol 377:47–56
Article CAS PubMed Google Scholar
Jia J, Xiao X, Liu B (2015c) Prediction of protein-protein interactions with physicochemical descriptors and wavelet transform via random forests. J Lab Autom. doi:10.1177/2211068215581487
PubMed Google Scholar
Jones S, Thornton JM (1995) Protein-protein interactions: a review of protein dimer structures. Prog Biophys Mol Biol 63:31–65
Article CAS PubMed Google Scholar
Jones S, Thornton JM (1997a) Analysis of protein-protein interaction sites using surface patches. J Mol Biol 272:121–132
Article CAS PubMed Google Scholar
Jones S, Thornton JM (1997b) Prediction of protein-protein interaction sites using patch analysis. J Mol Biol 272:133–143
Article CAS PubMed Google Scholar
Joo K, Lee SJ, Lee J (2012) Sann: solvent accessibility prediction of proteins by nearest neighbor method. Proteins Struct Function Bioinform 80:1791–1797
CAS Google Scholar
Kang PS, Cho SZ (2006) EUS SVMs: ensemble of under-sampled SVMs for data imbalance problems. Neural Inf Process Proc 4232(1):837–846
Google Scholar
Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105–132
Article CAS PubMed Google Scholar
Laurikkala J (2001) Improving identification of difficult small classes by balancing class distribution. Artif Intell Med Proc 2101:63–66
Article Google Scholar
Lin WZ, Fang JA, Xiao X, Chou KC (2013) iLoc-Animal: a multi-label learning classifier for predicting subcellular localization of animal proteins. Mol BioSyst 4:634–644
Article Google Scholar
Lin H, Deng E-Z, Ding H, Chen W, Chou K-C (2014) iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition. Nucleic Acids Res 42:12961–12972
Article PubMed PubMed Central Google Scholar
Liu B, Xu J, Lan X, Xu R, Zhou J, Wang X, Chou K-C (2014) iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition. PLoS One 9(9):e106691
Article PubMed PubMed Central Google Scholar
Liu B, Fang L, Liu F, Wang X, Chen J, Chou K-C (2015a) Identification of real microRNA precursors with a pseudo structure status composition approach. PLoS One 10:e0121501
Article PubMed PubMed Central Google Scholar
Liu B, Fang L, Wang S, Wang X, Li H, Chou K-C (2015b) Identification of microRNA precursor with the degenerate K-tuple or Kmer strategy. J Theor Biol 385:153–159
Article CAS PubMed Google Scholar
Liu Z, Xiao X, Qiu W-R, Chou K-C (2015c) iDNA-Methyl: identifying DNA methylation sites via pseudo trinucleotide composition. Anal Biochem 474:69–77
Article CAS PubMed Google Scholar
Marceau AH, Bernstein DA, Walsh BW, Shapiro W, Simmons LA, Keck JL (2013) Protein interactions in genome maintenance as novel antibacterial targets. PLoS One 8(3):e58765
Article CAS PubMed PubMed Central Google Scholar
Mihel J, Šikić M, Tomić S, Jeren B, Vlahoviček K (2008) PSAIA–protein structure and interaction analyzer. BMC Struct Biol 8:21
Article PubMed PubMed Central Google Scholar
Murakami Y, Mizuguchi K (2010a) Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein–protein interaction sites. Bioinformatics 26:1841–1848
Article CAS PubMed Google Scholar
Murakami Y, Mizuguchi K (2010b) Applying the Naive Bayes classifier with kernel density estimation to the prediction of protein-protein interaction sites. Bioinformatics 26:1841–1848
Article CAS PubMed Google Scholar
Ofran Y, Rost B (2003) Predicted protein–protein interaction sites from local sequence information. FEBS Lett 544:236–239
Article CAS PubMed Google Scholar
Ofran Y, Rost B (2007) ISIS: interaction sites identified from sequence. Bioinformatics 23:e13–e16
Article CAS PubMed Google Scholar
Porollo A, Meller J (2007) Prediction-based fingerprints of protein–protein interactions. Proteins Struct Function Bioinform 66:630–645
Article CAS Google Scholar
Russell RB, Aloy P (2008) Targeting and tinkering with interaction networks. Nat Chem Biol 4:666–673
Article CAS PubMed Google Scholar
Schäffer AA, Aravind L, Madden TL, Shavirin S, Spouge JL, Wolf YI, Koonin EV, Altschul SF (2001) Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res 29:2994–3005
Article PubMed PubMed Central Google Scholar
Sharon M, Sinz A (2015). Studying protein–protein interactions by combining native mass spectrometry and chemical cross-linking. Analyzing biomolecular interactions by mass spectrometry, pp 55–79
Šikić M, Tomić S, Vlahoviček K (2009) Prediction of protein-protein interaction sites in sequences and 3D structures by random forests. PLoS Comput Biol 5:e1000278
Article PubMed PubMed Central Google Scholar
Singh G, Dhole K, Pai PP, Mondal S (2014) SPRINGS: prediction of protein-protein interaction sites using artificial neural networks. PeerJ 1:7
Google Scholar
Skrabanek L, Saini HK, Bader GD, Enright AJ (2008) Computational prediction of protein–protein interactions. Mol Biotechnol 38:1–17
Article CAS PubMed Google Scholar
Sudha G, Nussinov R, Srinivasan N (2014) An overview of recent advances in structural bioinformatics of protein–protein interactions and a guide to their principles. Prog Biophys Mol Biol 116:141–150
Article CAS PubMed Google Scholar
Ting KM (2002) An instance-weighting method to induce cost-sensitive trees. IEEE Trans Knowl Data Eng 14:659–665
Article Google Scholar
Von Mering C, Krause R, Snel B, Cornell M, Oliver SG, Fields S, Bork P (2002) Comparative assessment of large-scale data sets of protein–protein interactions. Nature 417:399–403
Article Google Scholar
Wang BX, Japkowicz N (2010) Boosting support vector machines for imbalanced data sets. Knowl Inf Syst 25:1–20
Article Google Scholar
Wang B, Chen P, Huang D-S, J-j Li, Lok T-M, Lyu MR (2006) Predicting protein interaction sites from residue spatial sequence profile and evolution rate. FEBS Lett 580:380–384
Article CAS PubMed Google Scholar
Wu G, Chang EY (2005) KBA: kernel boundary alignment considering imbalanced data distribution. IEEE Trans Knowl Data Eng 17:786–795
Article Google Scholar
Xiao X, Wang P, Lin WZ, Jia JH, Chou KC (2013) iAMP-2L: a two-level multi-label classifier for identifying antimicrobial peptides and their functional types. Anal Biochem 436:168–177
Article CAS PubMed Google Scholar
Xiao X, Min J-L, Lin W-Z, Liu Z, Cheng X, Chou K-C (2015a) iDrug-Target: predicting the interactions between drug compounds and target proteins in cellular networking via benchmark dataset optimization approach. J Biomol Struct Dyn 33(10):2221–2233
Article CAS PubMed Google Scholar
Xiao X, Zou H-L, Lin W-Z (2015b) iMem-Seq: a multi-label learning classifier for predicting membrane proteins types. J Membr Biol 248:745–752
Article CAS PubMed Google Scholar
Xu Y, Wen X, Wen L-S, Wu L-Y, Deng N-Y, Chou K-C (2014) iNitro-Tyr: prediction of nitrotyrosine sites in proteins with general pseudo amino acid composition. PLoS One 9:e105018
Article PubMed PubMed Central Google Scholar
Yan C, Dobbs D, Honavar V (2003) Identification of surface residues involved in protein-protein interaction—a support vector machine approach, intelligent systems design and applications. Springer, Berlin, pp 53–62
Google Scholar
Yan C, Dobbs D, Honavar V (2004) A two-stage classifier for identification of protein–protein interface residues. Bioinformatics 20:i371–i378
Article CAS PubMed Google Scholar
Yu D-J, Shen H-B, Yang J-Y (2011) SOMRuler: a novel interpretable transmembrane helices predictor. IEEE Trans NanoBiosci 10:121–129
Google Scholar
Yu D-J, Hu J, Wu X-W, Shen H-B, Chen J, Tang Z-M, Yang J, Yang J-Y (2013a) Learning protein multi-view features in complex space. Amino Acids 44:1365–1379
Article CAS PubMed Google Scholar
Yu DJ, Hu J, Huang Y, Shen HB, Qi Y, Tang ZM, Yang JY (2013b) TargetATPsite: a template-free method for ATP-binding sites prediction with residue evolution image sparse representation and classifier ensemble. J Comput Chem 34:974–985
Article PubMed Google Scholar
Yu DJ, Hu J, Tang ZM, Shen HB, Yang J, Yang JY (2013c) Improving protein-ATP binding residues prediction by boosting SVMs with random under-sampling. Neurocomputing 104:180–190
Article Google Scholar
Yugandhar K, Gromiha MM (2014a) Feature selection and classification of protein–protein complexes based on their binding affinities using machine learning approaches. Proteins Struct Funct Bioinform 82:2088–2096
Article CAS Google Scholar
Yugandhar K, Gromiha MM (2014b) Protein-protein binding affinity prediction from amino acid sequence. Bioinformatics 30(24):3583–3589
Article CAS PubMed Google Scholar
Zhou ZH, Liu XY (2010) On multi-class cost-sensitive learning. Comput Intell 26:232–257
Article CAS Google Scholar
Zhou H-X, Shan Y-B (2001) Prediction of protein interaction sites from sequence profile and residue neighbor list. Proteins Struct Funct Bioinform 44:336–343
Article CAS Google Scholar
Zou H-L, Xiao X (2015) A new multi-label classifier in identifying the functional types of human membrane proteins. J Membr Biol 248:179–186
Article CAS PubMed Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China (No. 61373062, 61233011, and 61222306), the Jiangsu Postdoctoral Science Foundation (No. 1201027C), the Natural Science Foundation of Jiangsu (No. BK20141403), the China Postdoctoral Science Foundation (No. 2014T70526 and 2013 M530260), the Fundamental Research Funds for the Central Universities (No. 30920130111010), and “The Six Top Talents” of Jiangsu Province (No. 2013-XXRJ-022).

Author information

Authors and Affiliations

School of Computer Science and Engineering, Nanjing University of Science and Technology, Xiaolingwei 200, Nanjing, 210094, China
Guang-Hui Liu & Dong-Jun Yu
School of Information Engineering, Nanjing University of Finance & Economics, Nanjing, 210046, China
Guang-Hui Liu
Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Dongchuan Road 800, Shanghai, 200240, China
Hong-Bin Shen

Authors

Guang-Hui Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Bin Shen
View author publications
You can also search for this author in PubMed Google Scholar
Dong-Jun Yu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dong-Jun Yu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, GH., Shen, HB. & Yu, DJ. Prediction of Protein–Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures. J Membrane Biol 249, 141–153 (2016). https://doi.org/10.1007/s00232-015-9856-z

Download citation

Received: 07 August 2015
Accepted: 03 November 2015
Published: 12 November 2015
Issue Date: April 2016
DOI: https://doi.org/10.1007/s00232-015-9856-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Prediction of Protein–Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures

Abstract

Access this article

Similar content being viewed by others

Automated feature engineering improves prediction of protein–protein interactions

In silico prediction methods of self-interacting proteins: an empirical and academic survey

Selecting high-quality negative samples for effectively predicting protein-RNA interactions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Prediction of Protein–Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures

Abstract

Access this article

Similar content being viewed by others

Automated feature engineering improves prediction of protein–protein interactions

In silico prediction methods of self-interacting proteins: an empirical and academic survey

Selecting high-quality negative samples for effectively predicting protein-RNA interactions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation