Handling Missing Data in Observational Clinical Studies Concerning Cardiovascular Risk: An Insight into Critical Aspects

Solaro, Nadia; Lucini, Daniela; Pagani, Massimo

doi:10.1007/978-3-319-55723-6_14

Handling Missing Data in Observational Clinical Studies Concerning Cardiovascular Risk: An Insight into Critical Aspects

Nadia Solaro²¹,
Daniela Lucini²² &
Massimo Pagani²²

Conference paper
First Online: 05 July 2017

3502 Accesses
1 Citations

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

Abstract

In observational clinical studies, subjects’ health status is empirically assessed according to research protocols that prescribe aspects to investigate and methods for investigation. Commonly to many fields of research, these studies are frequently affected by incompleteness of information, a problem that, if not duly handled, may seriously invalidate conclusions drawn from investigations. Regarding cardiovascular risk assessment, coronary risk factors (e.g. high blood pressure) and proxies of neurovegetative domain (e.g. heart rate variability) are individually evaluated through direct measurements taken in laboratory. A major cause of missingness can be ascribed to the fact that overall sets of collected data typically derive from aggregation of a multitude of sub-studies, undertaken at different times and under slightly different protocols that might not involve the same variables. Data on certain variables can thus be missing if such variables were not included in all protocols. This issue is addressed by referring to a clinical case study concerning the role of Autonomic Nervous System in the evaluation of subjects’ health status.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bowman, A.W., Azzalini, A.: R package ‘sm’: nonparametric smoothing methods (2014). Version 2.2-5.4 http://www.stats.gla.ac.uk/~adrian/sm
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood estimation from incomplete data via the EM algorithm. J. Roy. Stat. Soc. B 39, 1–38 (1977)
MATH Google Scholar
D’Orazio, M.: StatMatch: statistical matching (2015). R package version 1.2.3 http://CRAN.R-project.org/package=StatMatch
D’Orazio, M., Di Zio, M., Scanu, M.: Statistical Matching Theory and Practice. Wiley, New York (2006)
Book MATH Google Scholar
González, I., Déjean, S.: CCA: canonical correlation analysis (2012). R package version 1.2. http://CRAN.R-project.org/package=CCA
Honaker, J., King, G.: What to do about missing values in time-series cross-section data. Am. J. Polit. Sci. 54, 561–581 (2010)
Article Google Scholar
Honaker, J., King, G., Blackwell, M.: Amelia II: A program for missing data. J. Stat. Softw. 45, 1–47 (2011). http://www.jstatsoft.org/v45/i07/
Article Google Scholar
Hothorn, T., Hornik, K., van de Wiel, M.A., Zeileis, A.: Implementing a class of permutation tests: the coin package. J. Stat. Softw. 28, 1–23 (2008). http://www.jstatsoft.org/v28/i08/
Article Google Scholar
Husson, F., Josse, J.: missMDA: handling missing values with/in multivariate data analysis (principal component methods) (2015). R package version 1.8.2. http://CRAN.R-project.org/package=missMDA
Istat.it: Noi Italia – 100 statistiche per capire il Paese in cui viviamo. 2016 edition: http://noi-italia.istat.it/. 2015 edition: http://noi-italia2015.istat.it/
Josse, J., Pagès, J., Husson, F.: Multiple imputation in principal component analysis. Adv. Data Anal. Classif. 5, 231–246 (2011)
Article MathSciNet MATH Google Scholar
Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data, 2nd edn. Wiley, New York (2002)
MATH Google Scholar
Lucini, D., Solaro, N., Pagani, M.: May autonomic indices from cardiovascular variability help identify hypertension? J. Hypertens. 32, 363–373 (2014)
Article Google Scholar
Molenberghs, G., Kenward, M.G.: Missing Data in Clinical Studies. Wiley, Chichester (2007)
Book Google Scholar
R Core Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna (2016). http://www.R-project.org
Saporta, G.: Data fusion and data grafting. Comput. Stat. Data An. 38, 465–473 (2002)
Article MathSciNet MATH Google Scholar
Solaro, N., Barbiero, A., Manzi, G., Ferrari, P.A.: GenForImp: a sequential distance-based approach for imputing missing data (2015). R package version 1.0.0. http://CRAN.R-project.org/package=GenForImp
Solaro, N., Barbiero, A., Manzi, G., Ferrari, P.A.: A sequential distance-based approach for imputing missing data: forward imputation. Adv. Data Anal. Classif. 1–20 (2016) doi:10.1007/s11634-016-0243-0
Google Scholar
Townsend, N., Nichols, M., Scarborough, P., Rayner, M.: Cardiovascular disease in Europe – epidemiological update 2015. Eur. Heart J. 36, 2696–2705 (2015)
Article Google Scholar
Venkatraman, E.S.: clinfun: Clinical trial design and data analysis functions (2015). R package version 1.0.10. http://CRAN.R-project.org/package=clinfun

Download references

Author information

Authors and Affiliations

Department of Statistics and Quantitative Methods, University of Milano-Bicocca, Milano, Italy
Nadia Solaro
BIOMETRA Department, University of Milan, Milano, Italy
Daniela Lucini & Massimo Pagani

Authors

Nadia Solaro
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Lucini
View author publications
You can also search for this author in PubMed Google Scholar
Massimo Pagani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nadia Solaro .

Editor information

Editors and Affiliations

Department of Political Sciences, University of Naples Federico II, Napoli, Italy
Francesco Palumbo
Department of Statistical Sciences Paolo Fortunati, Alma Mater Studiorum, University of Bologna, Bologna, Italy
Angela Montanari
Department of Statistical Sciences, Sapienza University of Rome, Rome, Italy
Maurizio Vichi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Solaro, N., Lucini, D., Pagani, M. (2017). Handling Missing Data in Observational Clinical Studies Concerning Cardiovascular Risk: An Insight into Critical Aspects. In: Palumbo, F., Montanari, A., Vichi, M. (eds) Data Science . Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-55723-6_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-55723-6_14
Published: 05 July 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-55722-9
Online ISBN: 978-3-319-55723-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics