Ethics and Epistemology in Big Data Research
- 1.6k Downloads
Biomedical innovation and translation are increasingly emphasizing research using “big data.” The hope is that big data methods will both speed up research and make its results more applicable to “real-world” patients and health services. While big data research has been embraced by scientists, politicians, industry, and the public, numerous ethical, organizational, and technical/methodological concerns have also been raised. With respect to technical and methodological concerns, there is a view that these will be resolved through sophisticated information technologies, predictive algorithms, and data analysis techniques. While such advances will likely go some way towards resolving technical and methodological issues, we believe that the epistemological issues raised by big data research have important ethical implications and raise questions about the very possibility of big data research achieving its goals.
KeywordsBig data Real world data Ethics Epistemology
We would like to thank Associate Professor Ainsley Newson for her helpful guidance on an earlier version of this article.
Compliance with Ethical Standards
Research related to this article has been funded by the National Health and Medical Research Council (Career Development Fellowship APP1036539 and Project Grant APP APP1083980).
Conflict of Interest
The authors have no conflicts of interest.
- Aboab, J., L.A. Celi, P. Charlton, et al. 2016. A “datathon” model to support cross-disciplinary collaboration. Science Translational Medicine 8(333): 333ps8.Google Scholar
- Advisory Council to Google on the Right to be Forgotten. 2015. Report of the advisory council to google on the right to be forgotten. Google Docs [Online]. https://drive.google.com/file/d/0B1UgZshetMd4cEI3SjlvV0hNbDA/view?pli=1&usp=embed_facebook. Accessed October 11, 2016.
- American Society of Clinical Oncology. 2017. How CancerLinQ™ Works. https://cancerlinq.org/how-it-works. Accessed September 10, 2016.
- Broder, A., L. Adamic, M. Franklin, M.d. Rijke, E. Xing, and K. Yu. 2015. Big data: New paradigm or sound and fury, signifying nothing? In Proceedings of the eighth ACM international conference on web search and data mining, 5–6.Google Scholar
- Burgio, M.R., J.P. Ioannidis, B.M. Kaminski, et al. 2013. Collaborative cancer epidemiology in the 21st century: The model of cancer consortia. Cancer Epidemiology Biomarkers & Prevention: cebp-0591.Google Scholar
- Busch, L. 2014. Big data, big questions| A dozen ways to get lost in translation: Inherent challenges in large scale data sets. International Journal of Communication 8: 818.Google Scholar
- Carter, P., G.T. Laurie, and M. Dixon-Woods. 2015. The social licence for research: Why care.data ran into trouble. Journal of Medical Ethics. epub ahead of print, January 23. doi: 10.1136/medethics-2014-102374.
- China Daily USA. 2016. China planning big data health care model for 2020 [Online]. https://iapp.org/news/a/china-planning-big-data-health-care-model-for-2020/. Accessed September 9, 2016.Google Scholar
- Crawford, K., M.L. Gray, and K. Miltner. 2014. Big data| Critiquing big data: Politics, ethics, epistemology| Special section introduction. International Journal of Communication 8: 810.Google Scholar
- Crump, C., K. Sundquist, and M.A. Winkleby. 2015. Transnational research partnerships: Leveraging big data to enhance US health. Journal of Epidemiology and Community Health. ePub ahead of print: March 12. doi: 10.1136/jech-2015-205451.
- data.gov. 2016. Open government. https://www.data.gov/open-gov/. Accessed September 15, 2016.
- data.gov.uk. 2016. Opening up government. https://data.gov.uk/. Accessed September 9, 2016.
- eMERGE network. 2014. eMERGE network. https://emerge.mc.vanderbilt.edu/. Accessed September 10, 2016.
- Fierce Biotech. 2016. 10 reasons why biotech needs big data [Online]. http://www.fiercebiotech.com/special-report/10-reasons-why-biotech-needs-big-data. Accessed September 9, 2016.
- Financial Review. 2016. Medibank will use data to force hospitals, surgeons to address health costs [Online]. http://www.afr.com/business/health/hospitals-and-gps/medibank-will-use-data-to-force-hospitals-surgeons-to-address-health-costs-20160728-gqfh3n. Accessed September 9, 2016.
- Genomics England. 2016. The 100,000 genomes project. https://www.genomicsengland.co.uk/the-100000-genomes-project/. Accessed September 10, 2016.
- ———. 2016b. Current use of routinely collected health data to complement randomized controlled trials: A meta-epidemiological survey. CMAJ Open 4(2): E132–E140.Google Scholar
- ———. 2016c. Routinely collected data and comparative effectiveness evidence: Promises and limitations. Canadian Medical Association Journal 188(8): E158.Google Scholar
- ———. 2014. Citizen science: The law and ethics of public access to medical Big Data [Online]. http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2491054. Accessed October 11, 2016.
- ———. 2016. The promise and perils of open medical data. Hastings Center Report 46(1): 6–7.Google Scholar
- Howard, R. 2013. Big data hype cut down to size. Government News 33(5): 26.Google Scholar
- ———. 2005b. Why most published research findings are false. PLoS Medicine 2(8): e124.Google Scholar
- ———. 2013. Informed consent, big data, and the oxymoron of research that is not research. The American Journal of Bioethics 13(4): 40–42.Google Scholar
- McKinsey & Company. 2013. How big data can revolutionize pharmaceutical R&D [Online]. http://www.mckinsey.com/industries/pharmaceuticals-and-medical-products/our-insights/how-big-data-can-revolutionize-pharmaceutical-r-and-d. Accessed September 9, 2016.
- Medicines and Healthcare Products Regulatory Agency. 2017. Welcome to The Clinical Practice Research Datalink. https://www.cprd.com/home/. Accessed February 13, 2017.
- Motherboard. 2015. ‘Oblivion’ is the software that could automate the ‘right to be forgotten’ [Online]. http://motherboard.vice.com/read/oblivion-is-the-software-that-could-automate-the-right-to-be-forgotten. Accessed October 11, 2016.
- National Institutes of Health. 2016. Precision Medicine Initiative Cohort Program. https://www.nih.gov/precision-medicine-initiative-cohort-program. Accessed September 10, 2016.
- National Patient-Centered Clinical Research Network. 2016. Patient-powered research networks. http://www.pcornet.org/patient-powered-research-networks/. Accessed September 10, 2016.
- NHS England. 2016. The care.data programme [Online]. https://www.england.nhs.uk/ourwork/tsd/care-data/. Accessed September 9, 2016.
- Office of Science and Technology Policy. 2012. Obama administration unveils “big data” initiative. https://obamawhitehouse.archives.gov/the-press-office/2015/11/19/release-obama-administration-unveils-big-data-initiative-announces-200. Accessed September 9, 2016.
- Patel, C.J., J. Ji, J. Sundquist, J.P. Ioannidis, and K. Sundquist. 2016. Systematic assessment of pharmaceutical prescriptions in association with cancer risk: A method to conduct a population-wide medication-wide longitudinal study. Scientific Reports 6(Aug 10): 31308.CrossRefPubMedPubMedCentralGoogle Scholar
- Personal Genome Project. 2016. Sharing personal genomes. http://www.personalgenomes.org/. Accessed September 10, 2016.
- ———. 2016b. The importance of context: Risk-based de-identification of biomedical data. Methods of Information in Medicine 55(4): 347–355.Google Scholar
- Propellor. 2016. Citizen science and digital health tools inspiring public health impact. https://www.propellerhealth.com/2016/04/07/citizen-science-and-digital-health-tools-inspiring-public-health-impact/. Accessed September 9, 2016.
- Puschmann, C., and J. Burgess. 2014. Big data, big questions| Metaphors of big data. International Journal of Communication 8: 820.Google Scholar
- Ryan, P.B., D. Madigan, P.E. Stang, J. Marc Overhage, J.A. Racoosin, and A.G. Hartzema. 2012. Empirical assessment of methods for risk identification in healthcare data: Results from the experiments of the Observational Medical Outcomes Partnership. Statistics in Medicine 31(30): 4401–4415.CrossRefPubMedGoogle Scholar
- Scientific American. 2014. Citizen science is stimulating a wealth of innovative projects. http://www.scientificamerican.com/article/citizen-science-is-stimulating-a-wealth-of-innovative-projects/. Accessed September 9, 2016.
- ———. 2012. Protecting patient privacy in the age of big data. UMKC Law Review 81: 385.Google Scholar
- Terry, N. 2013. Big data proxies and health privacy exceptionalism. Health Matrix 2465-108.Google Scholar