Skip to main content
Log in

GIpred: a computational tool for prediction of GIGANTEA proteins using machine learning algorithm

  • Research Article
  • Published:
Physiology and Molecular Biology of Plants Aims and scope Submit manuscript

Abstract

In plants, GIGANTEA (GI) protein plays different biological functions including carbon and sucrose metabolism, cell wall deposition, transpiration and hypocotyl elongation. This suggests that GI is an important class of proteins. So far, the resource-intensive experimental methods have been mostly utilized for identification of GI proteins. Thus, we made an attempt in this study to develop a computational model for fast and accurate prediction of GI proteins. Ten different supervised learning algorithms i.e., SVM, RF, JRIP, J48, LMT, IBK, NB, PART, BAGG and LGB were employed for prediction, where the amino acid composition (AAC), FASGAI features and physico-chemical (PHYC) properties were used as numerical inputs for the learning algorithms. Higher accuracies i.e., 96.75% of AUC-ROC and 86.7% of AUC-PR were observed for SVM coupled with AAC + PHYC feature combination, while evaluated with five-fold cross validation. With leave-one-out cross validation, 97.29% of AUC-ROC and 87.89% of AUC-PR were respectively achieved. While the performance of the model was evaluated with an independent dataset of 18 GI sequences, 17 were observed as correctly predicted. We have also performed proteome-wide identification of GI proteins in wheat, followed by functional annotation using Gene Ontology terms. A prediction server “GIpred” is freely accessible at http://cabgrid.res.in:8080/gipred/ for proteome-wide recognition of GI proteins.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Availability of data and material

All the datasets used in this study are available at http://cabgrid.res.in:8080/gipred/dataset.html.

Code availability

A prediction server “GIpred” has been developed for the prediction of GIGANTEA proteins. The server is freely accessible at http://cabgrid.res.in:8080/gipred/index.html.

References

Download references

Funding

This study was supported by ICAR CABin Scheme Network project on Agricultural Bioinformatics and Computational Biology (F. No. Agril. Edn. 14/2/2017-A&P dated 02.08.2017), received from Indian Council of Agricultural Research (ICAR), New Delhi. The funder had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization: PKM; Data curation: SD, SS and TKS; Formal analysis: PKM; Investigation: PKM and SD; Methodology: PKM and SD; Software: TKS and PKM; Validation: SS, SD, TKS and SP; Visualization: PKM, SS and TKS; Roles/Writing—original draft: PKM, SD, SP and SS; Writing—review and editing: PKM and SP.

Corresponding author

Correspondence to Prabina Kumar Meher.

Ethics declarations

Conflicts of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (DOCX 36 kb)

Supplementary file 2 (DOCX 12 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Meher, P.K., Dash, S., Sahu, T.K. et al. GIpred: a computational tool for prediction of GIGANTEA proteins using machine learning algorithm. Physiol Mol Biol Plants 28, 1–16 (2022). https://doi.org/10.1007/s12298-022-01130-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12298-022-01130-6

Keywords

Navigation