Abstract
The past decade has seen explosive growth in digitized medical data. This trend offers medical practitioners an unparalleled opportunity to identify effectiveness of treatments for patients using summary statistics and to offer patients more personalized medical treatments based on predictive analytics. To exploit this opportunity, statisticians and computer scientists need to work and communicate effectively with medical practitioners to ensure proper measurement data, collection of sufficient volumes of heterogeneous data to ensure patient privacy, and understanding of probabilities and sources of errors associated with data sampling. Interdisciplinary collaborations between scientists are likely to lead to the development of more effective methods for explaining probabilities, possible errors, and risks associated with treatment options to patients. This chapter introduces some online resources to help medical practitioners with little or no background in summary and predictive statistics learn basic statistical concepts and implement data analysis on their personal computers using R, a high-level computer language that requires relatively little training. Readers who are only interested in understanding basic statistical concepts may want to skip the subsection on R.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Report on BridgeHead’s 2010 Data Management Healthcheck Survey Results, http://www.bridgeheadsoftware.com/resources/category/reports/ (accessed February 19, 2014)
The BridgeHead Software 2011 International Healthcheck Data Management Survey, http://www.bridgeheadsoftware.com/resources/category/reports/ (accessed February 19, 2014)
IBM Big Data Website, http://www-01.ibm.com/software/data/bigdata/industry-healthcare.html#2 (accessed February 19, 2014)
Creswell, J.: A digital shift on health data swells profits in an industry. NY Times on-line, http://ww.nytimes.com/2013/02/20/business/a-digital-shift-on-health-data-swells-profits.html?pagewanted=all&_r=0 (February 19, 2013)
Kerr, W., Lau, E., Owens, G., Trefler, A.: The future of medical diagnostics: large digitized databases. Yale Journal of Biology and Medicine 85(3), 363–377 (2012), http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3447200/
Holzinger, A., Dehmer, M., Jurisica, I.: Knowledge discovery and interactive data mining in bioinformatics – state-of-the-art, future challenges and research directions. BMC Bioinformatics 15(suppl. 6) (2014)
Hood, L., Rowen, L., Galas, D., Aitchison, J.: Systems biology at the Institute of Systems Biology. Briefings in Functional Genomics and Proteomics 7(4), 239–248 (2008)
Gibbs, W.: Medicine gets up close and personal. Nature, http://www.nature.com/news/medicine-gets-up-close-and-personal-1.14702 (accessed February 11, 2014)
Holzinger, A.: Human-computer interaction and knowledge discovery (HCI-KDD): What is the benefit of bringing those two fields to work together? In: Cuzzocrea, A., Kittl, C., Simos, D.E., Weippl, E., Xu, L. (eds.) CD-ARES 2013. LNCS, vol. 8127, pp. 319–328. Springer, Heidelberg (2013)
The R project for statistical computing, http://www.r-project.org/ (accessed February 11, 2014)
Holzinger, A.: Biomedical Informatics: Computational Sciences meets Life Sciences. BoD, Norderstedt (2012)
Wikipedia, http://en.wikipedia.org/wiki/Cloud_computing (February 11, 2014)
Wikipedia, http://en.wikipedia.org/wiki/Massive_Open_Online_Course (February 11, 2014)
Merriam-Webster on-line, http://www.merriam-webster.com/dictionary/statistics
Wikipedia, http://en.wikipedia.org/wiki/Summary_statistics (February 11, 2014)
Wikipedia, http://en.wikipedia.org/wiki/Statistical_inference (February 11, 2014)
Merriam-Webster, http://www.merriam-webster.com/dictionary/correlation (February 11, 2014)
Wolfram MathWorldTM, http://mathworld.wolfram.com/Probability.html
Evfimievski, A., Grandison, T.: Privacy-preserving data mining. In: Ferraggine, V., Doorn, J., Rivero, L. (eds.) Handbook of Research on Innovations in Database Technologies and Applications, pp. 527–536. IGI Global, Hershey (2009)
Aggarwal, C., Yu, P.: A general survey of privacy-preserving data mining models and algorithms. In: Aggarwal, C. Yu, P. (eds.), pp. 11–52. Springer, New York (2008)
Pasierb, K., Kajdanowicz, T., Kazienko, P.: Privacy-preserving data mining, sharing and publishing. Journal of Medical Informatics and Technologies 18, 69–76 (2011)
Vaidya, J., Clifton, C., Zhu, M.: Privacy and data mining. In: Vaidya, J., Clifton, C., Zhu, M. (eds.) Privacy Preserving Data Mining, ch. 1. Springer, NY (2006)
Coursera website, http://www.coursera.org
MIT OpenCourseware website, ocw.mit.edu/index.htm
CMU’s Open Learning Initiative, oli.cmu.edu
Udacity website, http://www.udacity.com
Chase, W., Bown, F.: General Statistics, 3rd edn. John Wiley and Sons, NY (1997)
Kerns, G.: Introduction to Probability and Statistics Using R. G. Jay Kerns publisher (2010), http://www.amazon.com , incomplete, rough draft available on-line, cran.r-project.org/web/packages/IPSUR/vignettes/IPSUR.pdf (accessed February 19, 2014)
Der, G., Everitt, J.: Applied Medical Statistics using SAS. CRC Press, Boca Raton (2013)
IBM SPSS, http://www.ibm.com/software/analytics/spss/products/modeler/
StatSoft STATISTICA, http://www.statsoft.com/Solutions/Healthcare
Athena Health Electronic Medical Record (EMR), http://www.athenahealth.com/Clinicals
Wikipedia entry for S, http://en.wikipedia.org/wiki/S_%28programming_language%29
Becker, R.: A brief history of S, kabah.lcg.unam.mx/~lcollado/R/resources/history_of_S.pdf (accessed February 20, 2014)
Try R, http://tryr.codeschool.com/
Code School, http://www.codeschool.com/
O’Reilly, http://oreilly.com/
Computing for Data Analysis, Coursera, http://www.coursera.org/course/compdata
Data Analysis, Coursera, http://www.coursera.org/course/dataanalysis
Core Concepts in Data Analysis, Coursera, http://www.coursera.org/course/datan
Apple, http://www.apple.com
Prinzel, Y.: How Steve Jobs’ health problems have impacted Applestock over the years. Covestor.com (January 11, 2011), http://investing.covestor.com/2011/01/how-steve-jobs-health-problems-have-impacted-apple-stock-over-the-years-aapl
Ovide, S.: Amazon CEO Jeff Bezos evacuated for kidney stone attack. Wall Street Jour. (January 11, 2011), blogs.wsj.com/digits/2014/01/04/amazon-ceo-gives-kidney-stones-zero-stars/
Herper, M.: From fitbits to clinical studies: how big data could change medicine. Forbes on-line (December 16, 2013), http://www.forbes.com/sites/matthewherper/2013/12/16/from-fitbits-to-clinical-studies-how-big-data-could-change-medicine/
Athena Health website, http://www.athenahealth.com/our-company/about-us/medical-practice-management.php
Netlib, http://www.netlib.org/
Dongarra, J., Grosse, E.: Distribution of mathematical software via electronic mail. Comm. of the ACM 30(5), 403–407 (1987)
Sang-Hun, C.: Disgraced cloning expert convicted in South Korea. NY Times (October 26, 2009), http://www.nytimes.com/2009/10/27/world/asia/27clone.html?ref=hwangwoosuk&_r=0
Agence France-Presse, Tokyo: Japan looks into claim ‘falsified’ data was used in Alzheimer’s drug study (January 10, 2014), http://www.scmp.com/news/asia/article/1402542/japan-looks-claim-falsified-data-was-used-alzheimers-drug-study
Takenaka, K., Kelland, K.: Japanese scientist urges withdrawal of own ‘breakthrough’ stem cell research. InterAksyon.com (March 11, 2014), http://www.interaksyon.com/article/82452/japanese-scientist-urges-withdrawal-of-own-breakthrough-stem-cell-research
J-ADNI homepage, http://www.j-adni.org/etop.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kobayashi, M. (2014). Resources for Studying Statistical Analysis of Biomedical Data and R. In: Holzinger, A., Jurisica, I. (eds) Interactive Knowledge Discovery and Data Mining in Biomedical Informatics. Lecture Notes in Computer Science, vol 8401. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-43968-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-662-43968-5_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-43967-8
Online ISBN: 978-3-662-43968-5
eBook Packages: Computer ScienceComputer Science (R0)