Abstract
Traditional machine learning algorithms depend heavily on the assumption that there is sufficient data to learn a reliable model. This is not always the case, and in situations where data is limited, transfer learning can be applied to compensate for the lack of information by learning from several sources. In this work, we present a novel methodology for inducing a Temporal Nodes Bayesian Network (TNBN) when training data is scarce by applying a transfer learning strategy. A TNBN is a probabilistic graphical model that offers a compact representation for dynamic domains by defining multiple time intervals in which events can occur. Learning a TNBN poses additional challenges to learning traditional Bayesian networks due to the incorporation of time intervals. Our proposal incorporates novel approaches to transfer knowledge from several TNBNs to learn the structure, parameters and intervals of a target TNBN. To evaluate our algorithm, we performed experiments with a synthetic network, where we created auxiliary models by altering the structure, parameters and temporal intervals of the original model. Results show that the proposed algorithm is capable of retrieving a reliable model even when few records are available for the target domain. We also performed experiments with a real-world data set belonging to the medical domain of HIV, where we were able to learn some documented mutational pathways and their temporal relations by applying transfer learning.
Similar content being viewed by others
Notes
The decision to have the target data account for at least 25 % of the total was chosen empirically, and will not necessarily be the best value for other domains. To find a good percentage split, a cross validation procedure can be carried out, where the percentage assigned to the target data is increased with every new cross validation. The percentage split that averaged the best results is used.
For our experiments, we approximate a data set of sufficient size to be that which holds at least 10 records per each probability value present in the model. A data set with fewer than this minimum of records is considered to have “scarce” records.
Dr. Santiago Avila from the Research Center for Infectious Diseases (or CIENI) in Mexico city provided his assistance for these experiments.
References
Arroyo-Figueroa G, Sucar LE (1999) A temporal Bayesian network for diagnosis and prediction. In: Proceedings of the fifteenth conference on uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc, pp 13–20
Arroyo-Figueroa G, Sucar LE, Villavicencio A (1998) Probabilistic temporal reasoning and its application to fossil power plant operation. Expert Syst Appl 15(3):317–324
Beerenwinkel N, Schmidt B, Walter H, Kaiser R, Lengauer T, Hoffmann D, Korn K, Selbig J (2002) Diversity and complexity of HIV-1 drug resistance: a bioinformatics approach to predicting phenotype from genotype. Proc National Acad Sci 99(12):8271–8276
Brier GW (1950) Verification of forecasts expressed in terms of probability. Mon Weather Rev 78(1):1–3
Caruana R (1997) Multitask learning. Mach Learn 28(1):41–75
Consortium E, et al (2002) Elvira: An environment for creating and using probabilistic graphical models. In: Proceedings of the first European workshop on probabilistic graphical models, pp. 222–230
Cooper GF, Herskovits E (1992) A Bayesian method for the induction of probabilistic networks from data. Mach Learn 9(4):309–347
Dai W, Xue GR, Yang Q, Yu Y (2007) Transferring naive Bayes classifiers for text classification. In: Proceedings of the national conference on artificial intelligence, vol 22. Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 1999, p 540
Drȧghici S, Potter RB (2003) Predicting HIV drug resistance with neural networks. Bioinformatics 19(1):98–107
Dunn JC (1973) A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. Cybern Syst 3(3):32–57
Fiedler Cameras LJ, Sucar LE, Morales EF (2013) A transfer learning approach for learning temporal nodes Bayesian networks. In: The twenty-sixth international FLAIRS conference, pp 637–640
Friedman N, Yakhini Z (1996) On the sample complexity of learning bayesian networks. In: Proceedings of the twelfth international conference on uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc, pp 274–282
Hanks S, Madigan D, Gavrin J (1995) Probabilistic temporal reasoning with endogenous change. In: Proceedings of the eleventh conference on uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc, pp 245–254
Hernandez-Leal P, Gonzalez JA, Morales EF, Enrique Sucar L (2013) Learning temporal nodes Bayesian networks. Int J Approx Reason 54(8):956–977
Hernandez-Leal P, Rios-Flores A, Ávila-Rios S, Reyes-Terán G, Gonzalez JA, Fiedler-Cameras L, Orihuela-Espina F, Morales EF, Sucar LE (2013) Discovering human immunodeficiency virus mutational pathways using temporal Bayesian networks. Artif Intell Med 57(3):185–195
Hernandez-Leal P, Sucar LE, Gonzalez JA, Morales EF, Ibarguengoytia PH (2011) Learning temporal bayesian networks for power plant diagnosis. In: Modern approaches in applied intelligence. Springer, pp 39–48
Luis R, Sucar LE, Morales EF (2010) Inductive transfer for learning Bayesian networks. Mach Learn 79(1–2):227–255
Masquelier B, Breilh D, Neau D, Lawson-Ayayi S, Lavignolle V, Ragnaud JM, Dupon M, Morlat P, Dabis F, Fleury H et al (2002) Human immunodeficiency virus type 1 genotypic and pharmacokinetic determinants of the virological response to lopinavir-ritonavir-containing therapy in protease inhibitor-experienced patients. Antimicrob Agents Chemother 46(9):2926–2932
Niculescu-Mizil A, Caruana R (2007) Inductive transfer for Bayesian network structure learning. J Mach Learn Res Proc Track 2:339–346
Oniśko A, Druzdzel MJ, Wasyluk H. (2001) Learning Bayesian network parameters from small data sets: application of noisy-or gates. Int J Approx Reason 27(2):165–182
Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowledge Data Eng 22(10):1345–1359
Rhee SY, Gonzales MJ, Kantor R, Betts BJ, Ravela J, Shafer RW (2003) Human immunodeficiency virus reverse transcriptase and protease sequence database. Nucleic Acids Res 31(1):298–303
Singh M, Valtorta M (1993) An algorithm for the construction of bayesian network structures from data. In: Proceedings of the ninth international conference on uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc, pp 259–265
Spirtes P, Glymour C, Scheines R (2000) Causation, prediction, and search, vol 81. The MIT Press
Sun S, Xu Z, Yang M (2013) Transfer learning with part-based ensembles. In: Multiple classifier systems. Springer, pp 271– 282
Tonda AP, Lutton E, Reuillon R, Squillero G, Wuillemin PH (2012) Bayesian network structure learning from limited datasets through graph evolution. In: EuroGP. Springer Verlag, pp 254– 265
Xu Z, Sun S (2012) Multi-source transfer learning with multi-view adaboost. In: Neural information processing. Springer, pp 332–339
Acknowledgments
The first author is thankful to CONACyT for the financial support given to her through scholarship 261257.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Fiedler, L.J., Sucar, L.E. & Morales, E.F. Transfer learning for temporal nodes Bayesian networks. Appl Intell 43, 578–597 (2015). https://doi.org/10.1007/s10489-015-0662-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-015-0662-1