Abstract
In this paper, we review the task of inductive process modeling, which uses domain knowledge to compose explanatory models of continuous dynamic systems. Next we discuss approaches to learning with missing values in time series, noting that these efforts are typically applied for descriptive modeling tasks that use little background knowledge. We also point out that these methods assume that data are missing at random—a condition that may not hold in scientific domains. Using experiments with synthetic and natural data, we compare an expectation maximization approach with one that simply ignores the missing data. Results indicate that expectation maximization leads to more accurate models in most cases, even though its basic assumptions are unmet. We conclude by discussing the implications of our findings along with directions for future work.
Chapter PDF
Similar content being viewed by others
Keywords
- Expectation Maximization
- Expectation Maximization Algorithm
- Inductive Logic Programming
- Natural Data
- Continuous Dynamic System
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Åström, K.J., Eykhoff, P.: System identification—a survey. Automatica 7, 123–167 (1971)
Bradley, E., Easley, M., Stolle, R.: Reasoning about nonlinear system identification. Artificial Intelligence 133, 139–188 (2001)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological) 39, 1–38 (1977)
Džeroski, S., Todorovski, L.: Discovering dynamics: From inductive logic programming to machine discovery. Journal of Intelligent Information Systems 4, 89–108 (1995)
Falkenhainer, B., Forbus, K.D.: Compositional modeling: Finding the right model for the job. Artificial Intelligence 51, 95–143 (1991)
Forbus, K.: Qualitative process theory. Artificial Intelligence 24, 85–168 (1984)
Holling, C.S.: Some characteristics of simple types of predation and parasitism. Canadian Entomologist 91, 385–398 (1959)
Isaksson, A.: System identification subject to missing data. In: American Control Conference, pp. 693–698 (1991)
Jost, C., Ellner, S.: Testing for predator dependence in predator-prey dynamics: A non-parametric approach. Proceedings of the Royal Society of London B 267, 1611–1620 (2000)
Langley, P.: Data-driven discovery of physical laws. Cognitive Science 5, 31–54 (1981)
Langley, P., Sánchez, J., Todorovski, L., Džeroski, S.: Inducing process models from continuous data. In: The Nineteenth International Conference on Machine Learning, pp. 347–354 (2002)
Little, R.J., Rubin, D.B.: Statistical analysis with missing data, 2nd edn. John Wiley, Hoboken
Stoica, P., Xu, L., Li, J.: Parameter estimation with missing data via equalization-maximization. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (2005), pp. IV–57–IV–60 (2005)
Todorovski, L.: Using domain knowledge for automated modeling of dynamic systems with equation discovery Doctoral dissertation, Faculty of computer and information science, University of Ljubljana (2003)
Todorovski, L., Bridewell, W., Shiran, O., Langley, P.: Inducing hierarchical process models in dynamic domains. In: The Twentieth National Conference on Artificial Intelligence, pp. 892–897 (2005)
Veilleux, B.G.: The analysis of a predatory interaction between didinium and paramecium. Master’s thesis, University of Alberta (1976)
Washio, T., Motoda, H., Niwa, Y.: Enhancing the plausibility of law equation discovery. In: The Seventeenth International Conference on Machine Learning, pp. 1127–1134 (2000)
Żytkow, J.M., Zhu, J., Hussam, A.: Automated discovery in a chemistry laboratory. In: The Eighth National Conference on Artificial Intelligence, pp. 889–894 (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bridewell, W., Langley, P., Racunas, S., Borrett, S. (2006). Learning Process Models with Missing Data. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_52
Download citation
DOI: https://doi.org/10.1007/11871842_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)