Learning Process Models with Missing Data

Abstract

In this paper, we review the task of inductive process modeling, which uses domain knowledge to compose explanatory models of continuous dynamic systems. Next we discuss approaches to learning with missing values in time series, noting that these efforts are typically applied for descriptive modeling tasks that use little background knowledge. We also point out that these methods assume that data are missing at random—a condition that may not hold in scientific domains. Using experiments with synthetic and natural data, we compare an expectation maximization approach with one that simply ignores the missing data. Results indicate that expectation maximization leads to more accurate models in most cases, even though its basic assumptions are unmet. We conclude by discussing the implications of our findings along with directions for future work.

Publication
In Proceedings of the 17th European Conference on Machine Learning
Will Bridewell
Will Bridewell
Research Scientist in Artificial Intelligence

My research interests include the relationship between attention, cognition, and intentional action.