Abstract
Big data consist of multiple fractions of small data. If you wish your big data to be valid, then you will, first, have to make sure, that the fractions are validated:
by the use of the scientific rules for clinical trials, and, in addition,
by the use of traditional diagnostic test validations.
Once this is all done well and good, only then you will be at the starting point of a serious big data analysis. Unfortunately, this is a pretty laborious scenario, and, although, currently, many data bases of big data do exist, most of them are, documentedly, of a poor quality and un-validated. Big data analyses tend to suffer from too many null-values, lack of experienced analysis teams, lacking validation tools, limited validation checklists. Big data tools are in expensive commercial software, and have not been judged by Academia. The best approach to big data analyses may be the use of large checklists, multiple analysis teams, and the use of multiple independent computers with simple programs rather than supercomputers with complex programs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Becker D, King T, Mcmullen B (2015) Big data, big data quality problem. Proceedings of IEEE international conference on bog data, pp 2264–3053.
Gao G, Xie C, Tao C (2016) Big data validation and quality assurance issues, challenges and needs. In: Proceedings of IEEE symposium on service oriented system engineering, Oxford, UK, pp 433–441.
Gassman J, Owens W, Kuntz T, Martin J, Amoroso W (1995) Data quality assurance, monitoring, and reporting. Control Clin Trials 16:104–136
Laranjeiro N, Soydemir S, Bernardino J (2015) A survey on data quality, classifying poor data. In: Proceedings of IEEE 21st Pacific Rim international symposium, pp 179–188.
Woodall P, Gao J, Parlikad A, Koronios A (2015) Classifying data quality problems in asset management. Springer Publications, Heidelberg
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Cleophas, T.J., Zwinderman, A.H. (2019). Validating Big Data, a Big Issue. In: Efficacy Analysis in Clinical Trials an Update. Springer, Cham. https://doi.org/10.1007/978-3-030-19918-0_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-19918-0_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19917-3
Online ISBN: 978-3-030-19918-0
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)