Integrating Association Rules Mined from Health-Care Data with Ontological Information for Automated Knowledge Generation

  • John Heritage
  • Sharon McDonald
  • Ken McGarryEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 650)


Association rule mining can be combined with complex network theory to automatically create a knowledge base that reveals how certain drugs cause side-effects on patients when they interact with other drugs taken by the patient when they have two or more diseases. The drugs will interact with on-target and off-target proteins often in an unpredictable way. A computational approach is necessary to be able to unravel the complex relationships between disease comorbidities. We built statistical models from the publicly available FAERS dataset to reveal interesting and potentially harmful drug combinations based on side-effects and relationships between co-morbid diseases. This information is very useful to medical practitioners to tailor patient prescriptions for optimal therapy.


Comorbidity Side-effect Association rules Support Confidence Pharmaco-epidemiology 


  1. 1.
    Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: SIGMOD 1993, pp. 207–216 (1993)Google Scholar
  2. 2.
    Ashburner, M.: Gene ontology: tool for the unification of biology. Nature Genet. 25, 25–29 (2000)CrossRefGoogle Scholar
  3. 3.
    Brin, S., Motwani, R., Silverstein, C.: Beyond market baskets: generalizing association rules to correlations. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 265–276 (1997)Google Scholar
  4. 4.
    Cai, R., Liu, M., Hu, Y., Melton, B.L., Matheny, M.E., Xu, H., Duan, L., Waitman, L.R.: Identification of adverse drug-drug interactions through causal association rule discovery from spontaneous adverse event reports. Artif. Intell. Med. 76, 7–15 (2017). CrossRefGoogle Scholar
  5. 5.
    Dunn, N., Mann, R.: Prescription-event and other forms of epidemiological monitoring of side-effects in the UK. Clin. Exp. Allergy 29(3), 217–239 (1999)CrossRefGoogle Scholar
  6. 6.
    Ghiassian, S., Menche, J., Barabasi, A.: A DIseAse MOdule Detection (DIAMOnD) algorithm derived from a systematic analysis of connectivity patterns of disease proteins in the human interactome. PLoS Comput. Biol. 11(4), e1004120 (2015)CrossRefGoogle Scholar
  7. 7.
    Hahsler, M., Chelluboina, S., Hornik, K., Buchta, C.: The arules R-package ecosystem: analyzing interesting patterns from large transaction datasets. J. Mach. Learn. Res. 12, 1977–1981 (2011)MathSciNetzbMATHGoogle Scholar
  8. 8.
    Li, J., Gong, B., Chen, X., Liu, T., Wu, C., Zhang, F., Li, C., Li, X., Rao, S., Li, X.: Dosim: an R package for similarity between diseases based on disease ontology. BMC Bioinform. 12(1), 266 (2011). CrossRefGoogle Scholar
  9. 9.
    Manda, P., McCarthy, F., Bridges, S.: Interestingness measures and strategies for mining multi-ontology multi-level association rules from gene ontology annoations for the discovery of new GO relationships. J. Biomed. Inform. 46(5), 849–856 (2013)CrossRefGoogle Scholar
  10. 10.
    McGarry, K.: Discovery of functional protein groups by clustering community links and integration of ontological knowledge. Expert Syst. Appl. 40(13), 5101–5112 (2013)CrossRefGoogle Scholar
  11. 11.
    McGarry, K., Emery, K., Varnakulasingam, V., McDonald, S., Ashton, M.: Complex network based computational techniques for edgetic modelling of mutations implicated with human diseases. In: The 16th UK Workshop on Computational Intelligence, UKCI-2016, pp. 89–105. Springer-Verlag, University of Lancaster, UK (7th–9th September 2016)Google Scholar
  12. 12.
    McGarry, K., Slater, N., Amaning, A.: Identifying candidate drugs for repositioning by graph based modeling techniques based on drug side-effects. In: The 15th UK Workshop on Computational Intelligence, UKCI-2015. University of Exeter, UK (7th-9th September 2015)Google Scholar
  13. 13.
    Menche, J., Sharma, A., Kitsak, M., Ghiassian, S., Vidal, M., Loscalzo, J., Barabasi, A.: Uncovering disease-disease relationships through the incomplete human interactome. Science 347(6224), 1257601 (2015)CrossRefGoogle Scholar
  14. 14.
    Ogata, H., Goto, S., Sato, K., Fujibuchi, W., Bono, H., Kanehisa, M.: Kegg: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 27, 29–34 (1998)CrossRefGoogle Scholar
  15. 15.
    R Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2015).
  16. 16.
    Rodriguez, E., Staffa, J., Graham, D.: The role of databases in drug postmarketing surveillance. Pharmacoepidemiol. Drug Safety 10(5), 407–410 (2001)CrossRefGoogle Scholar
  17. 17.
    Schriml, L., Arze, C., Nadendla, S., Chang, Y.W., Mazaitis, M., Felix, V., Feng, G., Kibbe, W.: Disease ontology: a backbone for disease semantic integration. Nucleic Acids Res. 40, D940–D946 (2012)CrossRefGoogle Scholar
  18. 18.
    Tatonetti, N., Fernald, G., Altman, R.: A novel signal detection algorithm for identifying hidden drug-drug interactions in adverse event reports. J. Am. Med. Inform. Assoc. 19(1), 79–85 (2012)CrossRefGoogle Scholar
  19. 19.
    Tatonetti, N.P., Denny, J.C., Murphy, S.N., Fernald, G.H., Krishnan, G., Castro, V., Yue, P., Tsao, P.S., Kohane, I., Roden, D.M., Altman, R.B.: Detecting drug interactions from adverse-event reports: interaction between paroxetine and pravastatin increases blood glucose levels. Clin. Pharmacol. Ther. 90, 133–42 (2011)CrossRefGoogle Scholar
  20. 20.
    Wang, F., Zhang, P., Cao, N., Hu, J., Sorrentino, R.: Exploring the associations between drug side-effects and therapeutic indications. J. Biomed. Inform. 51, 15–23 (2014)CrossRefGoogle Scholar
  21. 21.
    Wang, J., Du, Z., Payattakool, R., Yu, P., Chen, C.: A new method to measure the semantic similarity of GO terms. Bioinformatics 23(10), 1274–1281 (2007)CrossRefGoogle Scholar
  22. 22.
    Wright, A., Chen, E., Maloney, F.: An automated technique for identifying associations between medications, laboratory results and problems. J. Biomed. Inform. 43(6), 891–901 (2010)CrossRefGoogle Scholar
  23. 23.
    Yang, J., Li, Z., Fan, X., Cheng, Y.: Drug disease association and drug-repositioning predictions in complex diseases using causal inference probabilistic matrix factorization. J. Chem. Inf. Model. 54(9), 2562–2569 (2014)CrossRefGoogle Scholar
  24. 24.
    Yu, G., Yan, G., He, Q.: DOSE: an R/Bioconductor package for disease ontology semantic and enrichement analysis. Bioinformatics 31(4), 608–609 (2015)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.School of Pharmacy and Pharmaceutical Sciences, Facuty of Health Sciences and WellbeingUniversity of SunderlandSunderlandUK
  2. 2.Faculty of ComputingUniversity of SunderlandSunderlandUK

Personalised recommendations