Skip to main content

An improved unsupervised learning approach for potential human microRNA–disease association inference using cluster knowledge

Abstract

Recently, microRNAs (miRNAs) have been shown to play significant roles in the progression of major human diseases. Identifying associations between human diseases and miRNAs using computational tools have attracted considerable attention. Experimental identification and validation of disease–miRNA associations are rare and time consuming. We propose a computational method for predicting associations to understand pathogenicity and uncover prognosis markers. Existing methods employed approaches based on network and machine learning to predict miRNA disease. However, these approaches do not consider the cluster information between miRNAs. Performance results of these methods’ predictions are not satisfactory due to more number of false positives and false negatives. The proposed methodology comprises of two phases: first, we use miRNA cluster information in addition to miRNA-functional similarity and disease semantic similarity. MiRNAs and diseases are placed in the same cluster by transforming the miRNA–disease association data matrix into a low-rank model using Principal Component Analysis (PCA). Second, the Adamic/Adar index was applied that computes the closeness of miRNAs and diseases based on shared neighbors, improving the prediction results. The problem of overestimation is resolved by incorporating similarity information about miRNA and disease. Case studies on Leukemia, Carcinoma, Glioma, Pancreatic Neoplasms, and Melanoma exhibit satisfactory performance with Area Under Curve (AUC) values ranging from 0.736 to 0.834. miRNAs predicted using the proposed method are ascertained to be matching the known associations found in the database of HMDD V3.2, dbDEMC V2.0, and PhenomiR V2.0.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

References

Download references

Acknowledgements

We would like to thank the management of Vellore Institute of Technology for providing computation help to carry out our research work.

Funding

AA gratefully acknowledges ICMR for the adhoc research grant IRIS ID:2019-0810.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anand Anbarasu.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Rajapandy, M., Anbarasu, A. An improved unsupervised learning approach for potential human microRNA–disease association inference using cluster knowledge. Netw Model Anal Health Inform Bioinforma 10, 21 (2021). https://doi.org/10.1007/s13721-021-00292-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13721-021-00292-9

Keywords

  • microRNA
  • Principal component analysis
  • Association prediction
  • Disease
  • miRNA cluster information
  • Leave-one-out cross validation