Abstract
The sampling rate of variables collected in the hospital setting is dependent on several factors. Patients have different lengths of stay in the hospital, during which distinct physiological parameters are measured. The frequency of measurements depends ultimately in the type of variable and in the patient condition. Hence, when performing data based modeling for knowledge discovery in medical databases, one should have in consideration the heterogeneity of variables. This paper proposes an extension of a mixed fuzzy clustering algorithm in order to handle time invariant and time variant features of unequal lengths. Additionally, a novel approach for deriving Takagi-Sugeno fuzzy models, based on feature transformation using fuzzy c-means is implemented and compared with approaches based on mixed fuzzy clustering. The proposed approaches are tested on real data for mortality prediction in intensive care units of patients diagnosed with acute kidney injury and for ICU readmission prediction. Overall, mixed fuzzy clustering yields better results than fuzzy c-means. Moreover, the proposed extension for time series of unequal lengths improves previous results. Mortality is classified with an AUC of 0.73 and readmissions with an AUC of 0.64.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Celi, L.A., Mark, R.G., Stone, D.J., Montgomery, R.A.: Big data in the intensive care unit closing the data loop. Am. J. Respir. Crit. Med. 187(11), 1157–1160 (2013)
Liao, T.W.: Clustering of time series data - a survey. Pattern Recogn. 38, 1857–1874 (2005)
Ferreira, M.C., Salgado, C.M., Viegas, J.L., Schafer, H., Azevedo, C.S., Vieira, S.M., Sousa, J.: Fuzzy modeling based on mixed fuzzy clustering for health care applications. In: 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp. 1–5. IEEE (2015)
Bezdek, J.C., Ehrlich, R., Full, W.: FCM: the fuzzy c-means clustering algorithm. Comput. Geosci. 10(2), 191–203 (1984)
Izakian, H., Pedrycz, W., Jamal, I.: Clustering spatiotemporal data: an augmented fuzzy C-means. IEEE Trans. Fuzzy Syst. 21(5), 855–868 (2013)
Takagi, T., Sugeno, M.: Fuzzy Identification of Systems and Its Application to Modeling and Control (1985)
Saeed, M., Villarroel, M., Reisner, A.T., Clifford, G., Lehman, L., Moody, G., Heldt, T., Kyaw, T.H., Moody, B., Mark, R.G.: Multiparameter intelligent monitoring in intensive care ii (mimic-ii): a public-access intensive care unit database. Crit. Care Med. 39(5), 952 (2011)
Goldberger, A.L., Amaral, L.A.N., Glass, L., Hausdorff, J.M., Ivanov, P.C., Mark, R.G., Mietus, J.E., Moody, G.B., Peng, C., Stanley, H.E.: Physiobank, physiotoolkit, and physionet components of a new research resource for complex physiologic signals. Circulation 101(23), 215–220 (2000)
Overview of the mimic-iii data (2015). http://mimic.physionet.org/gettingstarted/overview/
Fuchs, L., Lee, J., Novack, V., Baumfeld, Y., Scott, D., Celi, L., Mandelbaum, T., Howell, M., Talmor, D.: Severity of acute kidney injury and two-year outcomes in critically ill patients. Chest 144(3), 866–875 (2013)
Mandelbaum, T., Scott, D.J., Lee, J., Mark, R.G., Malhotra, A., Waikar, S.S., Howell, M.D., Talmor, D.: Outcome of critically ill patients with acute kidney injury using the akin criteria. Crit. Care Med. 39(12), 2659–2664 (2011)
Celi, L.A.G., Tang, R.J., Villarroel, M.C., Davidzon, G.A., Lester, W.T., Chueh, H.C.: A clinical database-driven approach to decision support: predicting mortality among patients with acute kidney injury. J. Healthc. Eng. 2(1), 97–110 (2011). 22844575[pmid]
Fialho, A.S., Cismondi, F., Vieira, S.M., Reti, S.R., Sousa, J.M.C., Finkelstein, S.N.: Data mining using clinical physiology at discharge to predict ICU readmissions. Expert Syst. Appl. 39(18), 13158–13165 (2012)
Pereira, R.D.M.A., Salgado, C.M., Dejam, A., Reti, S.R., Vieira, S.M., Sousa, J.M.C., Celi, L.A., Finkelstein, S.N.: Fuzzy modeling to predict severely depressed left ventricular ejection fraction following admission to the intensive care unit using clinical physiology. Sci. World J. 2015, 1–9 (2015). 26345130[pmid]
Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(4), 29–36 (1982)
Celi, L., Tang, R., Villarroel, M., Davidzon, G., Lester, W., Chueh, H.: A clinical database-driven approach to decision support: predicting mortality among patients with acute kidney injury. J. Healthc. Eng. 2(1), 97–110 (2011)
Acknowledgments
This work was supported by FCT, through IDMEC, under project iDecision4Care (IF/00833/2014/CP1238/CT0002). S. Vieira acknowledges support by Program Investigador FCT (IF/00833/ 2014) from FCT, co-funded by the European Social Fund (ESF) through the Operational Program Human Potential (POPH). This work was supported by FCT, through IDMEC, under LAETA, project UID/EMS/50022/2013.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Salgado, C.M., Vieira, S.M., Sousa, J.M.C. (2016). Fuzzy Modeling Based on Mixed Fuzzy Clustering for Multivariate Time Series of Unequal Lengths. In: Carvalho, J., Lesot, MJ., Kaymak, U., Vieira, S., Bouchon-Meunier, B., Yager, R. (eds) Information Processing and Management of Uncertainty in Knowledge-Based Systems. IPMU 2016. Communications in Computer and Information Science, vol 611. Springer, Cham. https://doi.org/10.1007/978-3-319-40581-0_60
Download citation
DOI: https://doi.org/10.1007/978-3-319-40581-0_60
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40580-3
Online ISBN: 978-3-319-40581-0
eBook Packages: Computer ScienceComputer Science (R0)