Uncertain Data Mining: An Example in Clustering Location Data
Data uncertainty is an inherent property in various applications due to reasons such as outdated sources or imprecise measurement. When data mining techniques are applied to these data, their uncertainty has to be considered to obtain high quality results. We present UK-means clustering, an algorithm that enhances the K-means algorithm to handle data uncertainty. We apply UK-means to the particular pattern of moving-object uncertainty. Experimental results show that by considering uncertainty, a clustering algorithm can produce more accurate results.
Unable to display preview. Download preview PDF.
- 2.Chau, M., Cheng, R., Kao, B.: Uncertain Data Mining: A New Research Direction. In: Proc. Workshop on the Sciences of the Artificial, Hualien, Taiwan (2005)Google Scholar
- 3.Cheng, R., Kalashnikov, D., Prabhakar, S.: Querying Imprecise Data in Moving Object Environments. IEEE TKDE 16(9), 1112–1127 (2004)Google Scholar
- 4.Cheng, R., Xia, X., Prabhakar, S., Shah, R., Vitter, J.: Efficient Indexing Methods for Probabilistic Threshold Queries over Uncertain Data. In: Proc. VLDB (2004)Google Scholar
- 5.Hamdan, H., Govaert, G.: Mixture Model Clustering of Uncertain Data. IEEE International Conference on Fuzzy Systems, 879–884 (2005)Google Scholar
- 6.Wolfson, O., Sistla, P., Chamberlain, S., Yesha, Y.: Updating and Querying Databases that Track Mobile Units. Distributed and Parallel Databases 7(3) (1999)Google Scholar