ForCE: Is Estimation of Data Completeness Through Time Series Forecasts Feasible?
- Cite this paper as:
- Endler G., Baumgärtel P., Wahl A.M., Lenz R. (2015) ForCE: Is Estimation of Data Completeness Through Time Series Forecasts Feasible?. In: Tadeusz M., Valduriez P., Bellatreche L. (eds) Advances in Databases and Information Systems. ADBIS 2015. Lecture Notes in Computer Science, vol 9282. Springer, Cham
Measuring the completeness of a data population often requires either expert knowledge or the presence of reference data. If neither is available, measuring population completeness becomes nontrivial. We present the ForCE approach (Forecasting for Completeness Estimation), a method to estimate the completeness of timestamped data using time series forecasting. We evaluate the method’s feasibility using a medical domain real-world dataset, which we provide for download. The method is compared to three baselines. ForCE manages to surpass all three.