Skip to main content

Data Science: Transformation of Research and Scholarship

  • Chapter
  • First Online:
Big Data-Enabled Nursing

Abstract

The emergence of data science as a practice and discipline is revolutionizing research potential in all disciplines, but healthcare science has the potential to affect the health of individual lives. The use of existing data provides fertile ground for healthcare professionals to conduct research that will maximize quality outcomes, develop algorithms of care to increase efficiency and safety, and create predictive models that have the ability to prevent illness events and reduce healthcare expenditures. Data science can change practice through using existing and growing amounts of data to conduct research and build scholarship. Clinical trials, in some cases, may no longer be required to examine interventions. The pragmatic and efficient use of existing large cohort datasets has the ability to generate sample and control groups to determine efficacy. The collection of data from electronic medical records can provide substantial data to determine trends, construct algorithms, and consider disease and health behaviors modeling that alter patient care. Digital research incorporating vast amounts of data and new analytics has the ability to influence global healthcare.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Over 200 variants associated with Type 2 diabetes are recorded in the database of Genotypes and Phenotypes ([dbGaP] found at http://www.ncbi.nlm.nih.gov/gap).

References

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Lynda R. Hardy Ph.D., R.N., F.A.A.N. or Sandra Daack-Hirsch Ph.D., R.N. .

Editor information

Editors and Affiliations

Case Study 10.1: Complexity of Common Disease and Big Data

Case Study 10.1: Complexity of Common Disease and Big Data

Abstract

Human development, health, and disease processes are the culmination of complex interactions among DNA sequences, gene regulation—epigenetics, and the environment. To truly create individualized interventions to address prevention and treatment, complex data systems that are integrated are needed. As an exemplar, this case study will explore the complexity of information and the vast sources of big data and related analytics needed to better understand causes of Type 2 Diabetes (T2D), which in turn will drive individualized interventions to reduce risk and better treat individuals. Personalized healthcare (also called precision medicine) is becoming a reality. President Obama announced the Precision Medicine Initiative in February 2015. Conceptually, precision medicine has been defined as prevention and treatment strategies that take individual differences into account to generate knowledge applicable to the continuum of health and disease. To that end, this case study describes current initiatives to assemble and analyze the vast and complex phenotypic, genetic, epigenetic, and exposure information generated or that will be generated by researchers and clinicians on individuals, within a specific clinical example of T2D.

Keywords

Type 2 diabetes • Genetics/genomics • Exposome • Epigenetics • Omics • Big data • Personalized health care

1.1 10.1.1 Type 2 Diabetes (T2D) as a Significant Health Problem

Diabetes is a significant public health problem and its prevalence is increasing. As of 2012 an astounding 29.1 (9.3%) million Americans of all ages and racial/ethnic groups were affected by either Type 1 or Type 2 diabetes although, roughly 90–95% are Type 2. An estimated 8.1 million of those who have diabetes are undiagnosed (National Diabetes Information Clearinghouse [NDIC] 2014; Valdez et al. 2007) which potentiates the likelihood that they will also develop secondary health complications related to untreated diabetes (Klein Woolthuis et al. 2007). Diabetes is the leading cause of kidney failure, nontraumatic lower-limb amputation, heart disease, stroke, and new cases of blindness in the United States. In 2010 it was the seventh leading cause of death. As of 2012 medical expenses for people with diabetes (any form) were more than twice those for people without diabetes, costing Americans an estimated $245 billion in direct and indirect costs (NDIC 2014).

Type 2 diabetes (T2D) is a complex metabolic disease characterized by persistently elevated blood glucose caused by insulin resistance coupled with insulin deficiency. The substantial genetic component is thought to interact with environmental risk factors (plentiful diets and limited physical activity) to produce T2D. Several genetic variants have been associated with an increased risk to develop T2D (Wolfs et al. 2009), but the clinical validity of genetic variants alone to estimate diabetes risk remains limited, and genetic variants explain only a small portion of total risk variation (Burgio et al. 2015). While the staggering increase in prevalence of T2D is well documented, until recently the sharp increase in prevalence was largely thought to be driven by environmental factors experienced during adulthood (mainly an imbalance between energy intake and energy expenditure). However, there is also a sharp increase in T2D and obesity among persons under the age of twenty that challenges our understanding of risk factors (NDIC 2014). Just how genetic and environmental factors work in concert to produce the T2D phenotype remains unclear. Moreover, the environmental contribution to T2D is far more complex than an imbalance between energy intake and energy expenditure.

1.2 10.1.2 Factors Contributing to T2D

1.2.1 10.1.2.1 Genetics/Genomics

The genetic contribution to T2D has been established through research involving family history, twins studies, and genetic analysis. The lifetime risk for developing T2D in the Western world is reported to be between 7 and 10% (Burgio et al. 2015; Wolfs et al. 2009). Narayan et al. (2003) estimated that for individuals born in the United States in 2000, the lifetime risk for T2D is 1 in 3 for males and 2 in 5 for females. Family studies reveal that unaffected first-degree relatives of individuals with T2D have a two- to almost six-fold increase in risk to develop T2D over the course of a lifetime compared to people without a family history of T2D (Harrison et al. 2003; Valdez et al. 2007). The concordance rate for T2D among identical twins is high and is consistently reported to be greater than 50% in many populations (Medici et al. 1999; Newman et al. 1987; Poulsen et al. 2009). The fact that there are monogenetic (single gene) forms of diabetes (e.g., Maturity Onset Diabetes of the Young [MODY] and Permanent Neonatal Diabetes Mellitus [PNDM]) provides further evidence for a genetic role in the diabetes phenotype. Nevertheless, the majority of cases of diabetes are not one of the monogenetic forms; rather, T2D is a genetically complex disorder in which any number of genetic variants predispose an individual to develop the disease. Advances in genotyping technology have led to large-scale, population-based genetic studies to identify genetic variants (single nucleotide polymorphisms [SNPs]) associated with T2D. For example, the most recent studies have verified up to 65 SNPs to be associated with T2D (Morris et al. 2012; Talmud et al. 2015).

Discounting the rare monogenetic forms of diabetes and given the genetic and phenotypic heterogeneity of T2DFootnote 1 drawing firm conclusions about genotype—phenotype correlation using standard statistical analyses is difficult. To address statistical limitations, several efforts are underway to develop new algorithms to generate genetic risk scores (GRS) that predict T2D (Keating 2015; Talmud et al. 2015). As knowledge of the underlying genetic contribution grows, the prediction models improve. Mounting evidence shows that combining GRS and clinical risk factors (e.g., BMI, age, and sex) further improves the ability to detect incident cases (Keating 2015). However, the clinical utility of GRS remains problematic (Lyssenko and Laakso 2013). Developing risk prediction algorithms that combine GRS and phenotypic data for T2D is challenging in part due to the high heterogeneity in both the genetic factors and phenotypic elements of the disease. Moreover, genes and genetic variants at different locations in the genome (polygenic loci) that are associated with T2D are involved in multiple physiologic processes such as gluconeogenesis, glucose transport, and insulin homeostasis and many are also implicated in obesity (Burgio et al. 2015; Keating 2015; Slomko et al. 2012; Wolfs et al. 2009). To date, GRS use common genetic variants that have the strongest main effects. Other sources of genetic variance include rarer higher-penetrant variants, epigenetics, gene-gene and gene-environment interactions, and sex-specific genetic signals (Keating 2015; Lyssenko and Laakso 2013). The phenotype is also highly complex with patients presenting in various combinations of body type, age, family history, gestational diabetes, drug treatments, and comorbidities including obesity and metabolic syndrome. Detecting genetic differences is difficult when they are rare. Combining complex phenotype, interaction (gene x gene and or gene x environment), and gene variant information requires new data science approaches in order to leverage the complexity and create information that is clinically useful.

1.2.2 10.1.2.2 The Environment

In 2005 Wild coined the term “exposome” to describe the complementary environmental component of the gene-environment interaction indicative of complex traits and diseases (Wild 2005). As with the genetic component of T2D, the environmental component is also complex and plays a major role in the diabetes phenotype. Most of our knowledge of the T2D exposome is limited to the behavioral or modern living environment (Slomko et al. 2012). The modern living environment is characterized by increased access to low-cost, calorie-dense foods and increased sedentary lifestyle. The modern living environment is most amenable to intervention, and in fact interventions targeting diet and exercise are known to be effective in preventing or delaying the onset of T2D (Diabetes Prevention Program [DPP] Research Group 2002; Lindström et al. 2003; Venditti 2007). In the context of the modern living environment there is an emerging awareness of “unavoidable exposures” and their connection to T2D (Slomko et al. 2012). These are exposures to man-made chemicals through ambient particles, water, food, and use of consumer or personal care products—some are found in plastics and resins. These chemicals are ubiquitous in the everyday environment at levels below standards set by the Environmental Protection Agency and other regulatory agencies. While a single exposure is not likely to cause harm, little is known about chronic low-level exposure and risk for disease. Burgio et al. (2015) summarized the growing evidence that suggests endocrine-disrupting chemicals such as brominated flame retardants and organochlorine pesticides, heavy metals, and pharmaceuticals (e.g., corticosteroids, antipsychotics, beta-blockers, statins, thiazide diuretics) may interfere with β-cell function and induce insulin resistance (Burgio et al. 2015, p.809; Diabetes.co.uk 2015).

There is also emerging evidence that the gut microbiota composition could affect risk for T2D. Gut microbiota are important for intestinal permeability, host metabolism, host energy homeostasis, and human toxicodyamics (how chemicals affect the body). Changes in microbiota composition that interfere with these functions can lead to increased activation of inflammatory pathways which in turn interferes with insulin signaling, increase in energy harvesting and fat storage in adipose tissue, and potentiate the effect of chemical exposure—all potential pathways to increase risk for metabolic syndrome, obesity and or T2D. For a more in depth review of gut microbiota and T2D refer to Burgio et al. (2015) and Slomko et al. (2012).

1.3 10.1.3 Epigenetics

1.3.1 10.1.3.1 Overview of Epigenetics

Epigenetics may explain how genetic and environmental factors work in concert to produce T2D. “Epigenetics is the study of heritable changes (either mitotically or meiotically) that alter gene expression and phenotypes, but are independent from the underlying DNA sequence …”(Loi et al. 2013, p. 143). The epigenome is a series of chemical modifications (often referred to as tags or marks) that are superimposed on to the genome. In humans epigenetic modifications can either affect the proteins that are involved in the packaging of DNA into chromatin (known as chromatin modification), or directly attach to the DNA (e.g., DNA methylation). Epigenetic modification regulates gene expression by either activating (turning on) or deactiving (turning off) genes or segments of the DNA at given times (Genetics Learning Center 2014). Chromatin modification and DNA methylation are functionally linked to transcription and likely provide the mechanisms by which cells are programmed from one generation to the next. In other words, the epigenome activates genome in what is manifested as the phenotype.

During the pre-genomic era it was thought that disease and specific human traits were the direct result of variants in the DNA sequence (e.g., direct mutation of a single gene). However, very few diseases/traits are associated with only gene variants. To varying degrees, other factors such as poverty, nutrition, stress, and environmental toxin exposures can also contribute to health or lack thereof; yet none fully explain susceptibility to disease or variations in human traits. Environmental and social signals such as diet and stress can trigger changes in gene expression without changing the sequence of the DNA (Heijmans et al. 2008; McGowan et al. 2009; Mathers et al. 2010; Radtke et al. 2011; Weaver et al. 2004). Some of these epigenetic tags are cell specific and differentiate phenotype at a cellular level with respect to cell type and function. In a differentiated cell, only 10 to 20% of the genes are active (Genetic Science Learning Center 2014). Some epigenetic tags are acquired and lost over the life course of an individual, and some tags are passed on from generation to generation and may take several generations to change.

1.3.2 10.1.3.2 Examples of Epigenetic Modification and T2D

While it is widely known and accepted that maternal nutrition is of paramount importance to the health and development of the offspring, the precise biologic mechanisms linking maternal nutrition to offsprings’ wellbeing are just beginning to be understood. Epigenetic mechanisms may provide one such link. Evidence for epigenetic modification in the form of fetal programing can be found among individuals who were prenatally exposed to famine during the Dutch Hunger Winter in 1944–45. These individuals had less DNA methylation (hypomethylation) of the insulin-like growth factor 2 (IGF2) gene compared to their unexposed, same-sex siblings. IGF2 is a key factor in human growth and development. This epigenetic modification acquired in utero persisted throughout the children’s lifetime (Heijmans et al. 2008) and has been associated with higher rates of T2D, obesity, altered lipid profiles, and cardiovascular disease (Schulz 2010) among these children (Burgio et al. 2015).

A number of recent studies report changes in methylation patterns of specific genes associated with T2D (Rönn et al. 2013; Zhang et al. 2013; Ling et al. 2008; Yang et al. 2011; Kulkarni et al. 2012; Yang et al. 2012; Hall et al. 2013; Ribel-Madsen et al. 2012). Studies also reveal differential methylation patterns in genes associated with T2D among those affected by T2D (Zhang et al. 2013; Ling et al. 2008; Yang et al. 2011, 2012; Kulkarni et al. 2012; Hall et al. 2013) and in tissue specific samples (pancreases and mitochondria). These types of studies provide evidence that genotypes (DNA sequences) and their regulation (epigenetic modifications) are important factors contributing to T2D and that the epigenome is modifiable providing targets for interventions.

The environmental exposures described above (“unavoidable exposures”) could also lead to changes in gut microbiota composition. In fact, changes in gut microbiota composition have been shown to interfere with epigenetic regulation of FFAR3 gene in patients with T2D (Remely et al. 2014). FFAR3 is normally expressed in the pancreatic β-cells and mediates an inhibition of insulin secretion by coupling with other proteins (National Center for Biotechnology Information [NCBI] 2015). Interfering with the epigenetic regulation of FFAR3 would in turn lead to an inability to regulate insulin secretion appropriately.

1.3.3 10.1.3.3 Summary of Factors Contributing to T2D

T2D is the combination of biological contributing factors (genetics), environmental contributing factors (exposome), and the synthesis of biology and environment (epigenetics). Evolving epigenetic evidence suggests that epigenetic modifications could be important biomarkers for predicting risk, monitoring effectiveness of interventions, and targeting for therapy development to both prevent and treat T2D. Epigenetic patterns may serve as biomarkers connecting the exposome and genome (Fig. 10.1.1), thereby providing more comprehensive risk information for T2D. Unfavorable epigenetic modification may be reversed by lifestyle interventions, such as by modifying diet, increasing physical activity, and enriching the in utero environment. The rapid advances in genetic, exposome, and epigenetic sciences offer exciting possibilities for future discovery that will deepen our understanding of the complex balance between the environment and the genome, and how that balance influences health. Clearly an in-depth understanding T2D is largely dependent on big data and related advanced analytics.

Fig. 10.1.1
figure 4

Epigenetic modifications: the interaction of genetic and environmental risk factors over the life course

1.4 10.1.4 Current Initiatives to Leverage the Power of Big Data for Common Disease

1.4.1 10.1.4.1 Omics

Omics is the application of powerful high through-put molecular techniques to generate a comprehensive understanding of DNA, RNA, proteins, intermediary metabolites, micronutrients, and microbiota involved in biological pathways resulting in phenotypes. Scientists and informaticians are working on ways to integrate the layers of omic sciences and the exposome to better quantify an individual’s susceptibility to diseases such as T2D and to capitalize on his or her inherent protections against disease (Slomko et al. 2012). These techniques would allow for massive amounts of genomic, epigenomic, exposure, and phenotypic data to be analyzed in concert in order to build more powerful prediction models and provide targets for the development of prevention and treatment modalities.

1.4.2 10.1.4.2 Clinical Genomic Resources

Several initiatives are underway to assemble the vast and complex phenotypic, genetic, epigenetic and exposure information pertaining to wellness and disease states. These initiatives will leverage health information that is currently generated or will be generated by researchers and clinicians on individuals.

ClinGen. (http://clinicalgenome.org/) is a project to develop standard approaches for sharing genomic and phenotypic data provided by clinicians, researchers, and patients through centralized databases, such as ClinVar—a National Database of Clinically Relevant Genetic Variants (CRGV). ClinGen investigators are working to standardize the clinical annotation and interpretation of genomic variants. Goals of ClinGen include:

  • Share genomic and phenotypic data through centralized databases for clinical and research use

  • Standardize clinical annotation and interpretation of variants

  • Improve understanding of variation in diverse populations

  • Develop machine-learning algorithms to improve the throughput of variant interpretation

  • Implement evidence-based expert consensus for curation of clinical validity

  • Assess the ‘medical actionability’ of genes and variants to support their use in clinical care systems

  • Disseminate the collective knowledge/resources and ensure EHR interoperability (http://www.genome.gov/27558993)

Currently ClinGen efforts are focused on cardiovascular disease, pharmacogenomics, hereditary (germline) cancer, somatic cancer, and inborn errors of metabolism. However, knowledge generated on structure and process will serve as a template for approaching other diseases.

eMERGE. The Electronic Medical Records and Genomics (eMERGE) Network is a National Institutes of Health (NIH)-organized and funded consortium of U.S. medical research institutions. The eMERGE Network brings together researchers from leading medical research institutions across the country to conduct research in genomics, including discovery, clinical implementation and public resources. eMERGE was announced in September 2007 and began its third and final phase in September 2015 (http://www.genome.gov/27558993). The Network is comprised of six workgroups (see Table 10.1.1).

Table 10.1.1 eMerge workgroup summary. eMERGE https://emerge.mc.vanderbilt.edu/

The primary goal of the eMERGE Network is to develop, disseminate, and apply approaches to research that combine biorepositories with electronic medical record (EMR) systems for genomic discovery and genomic medicine implementation research. In addition, the consortium includes a focus on social and ethical issues such as privacy, confidentiality, and interactions with the broader community (eMERGE https://emerge.mc.vanderbilt.edu/).

PhenX. One of the limitations in being able to interpret findings from genome-wide association (GWA) studies is lack of uniform phenotypic descriptions and measures. For example, hundreds of associations between genetic variants and diabetes have been identified. However, most GWA studies have had relatively few phenotypic and exposure measures in common. Development and adoption of standard phenotypic and exposure measures could facilitate the creation of larger and more comprehensive datasets with a variety of phenotype and exposure data for cross-study analysis, thus increasing statistical power and the ability to detect associations of modest effect sizes and gene-gene and gene-environment interactions (http://www.genome.gov/27558993). PhenX was developed in recognition of the need for standard phenotypic and exposure measures, particularly as related to GWA studies. The National Human Genome Research Institute (NHGRI) initiated the PhenX Toolkit in 2006 with the goal of identifying and cataloguing 15 high-quality, well-established, and broadly applicable measures for each of 21 research domains (diabetes is one of these) for use in GWA studies and other large-scale genomic research (www.phenxtoolkit.org).

Roadmap Epigenetics Mapping Consortium. The National Institute of Health (NIH) Roadmap Epigenetics Mapping Consortium was created in an effort to understand epigenetic modifications and how these interact with underlying DNA sequences to contribute to health and disease. The project will provide publically available epigenetic maps on normal human tissues, support technology development, and provide funding in epigenetics research (National Institutes of Health 2015; Slomko et al. 2012).

Precision Medicine/Personalized Healthcare. Precision Medicine/personalized healthcare is a medical model that proposes to customize healthcare by incorporating medical decisions, practices, and products that are based on individual variability in genes, environment, and lifestyle. The potential ability of applying this concept more broadly has been dramatically improved by the recent development of large-scale biologic databases described above. The Precision Medicine Initiative Cohort Program proposes to:

  • Identify genomic variants that affect drug response

  • Assess clinical validity of genomic variants associated with disease

  • Identify biomarkers that are early indicators of disease

  • Understand chronic diseases and best management strategies

  • Understand genes/pathways/factors that protect from disease

  • Assess how well novel cellphone-based monitors of health work

  • Evaluate the ability of EHRs to integrate research data

  • Learn and apply new ways of engaging participants in research

  • Develop methodology for data mining and statistical analysis (https://www.nih.gov/precision-medicine-initiative-cohort-program)

1.5 10.1.5 Scope and Practice of Genetics/Genomics Nursing

The American Nurses Association in collaboration with the International Society of Nurses in Genetics provides an excellent resource for nurses interested in clinical genetics and nursing, the Genetics/Genomics Nursing: Scope and Standards of Practice, 2nd Edition (2016). This resource summarizes the role of nurses in genetics/genomics, which focuses on the actual and potential impact of genetic/genomic influences on health. Genetics/genomics nurses educate clients and families on genetic/genomic influences that might impact their health and intervene with the goals of optimizing health, reducing health risks, treating disease, and promoting wellness. This practice depends upon research and evidence-based practice, interprofessional collegiality and collaboration with genetics/genomics professionals and other healthcare professionals to provide quality patient care.

1.6 10.1.6 Conclusion

In conclusion, T2D is an increasingly common and complex disorder with genome, exposome, and epigenome factors contributing to the widely variable phenotype. Initiatives in precision medicine propose to customize healthcare by integrating data and information pertaining to individual variability in genes, environment, and lifestyle and interpreting this information to inform medical decisions, practices, and products that prevent, delay, and more effectively treat individuals who are at risk or have T2D. While many of our current initiatives build the evidence base needed to guide clinical practice for the individual, society also needs to be mindful of the social inequalities of opportunity including education, environmental quality, and access, not only to health care but to nutritious food, recreation, and community supports that contrite health and disease. These social determinants are part of the individual’s exposome, and yet are often beyond the control of the individual. Finally, motivating individuals at higher risk to engage in lifestyle changes to reduce their risk for T2D remains challenging. Communicating risk information about T2D is further complicated by how a person personalizes and rationalizes his or her risk to develop it (Shah et al. in press; Walter and Emory 2005). Knowing about genetic risk is not enough to motivate people to change behaviors (Grant et al. 2013). An important knowledge gap to fill is our understanding of how people at increased risk for T2D come to understand and manage behaviors to reduce their risk for disease. Understanding a person’s beliefs may facilitate effective collaboration with healthcare providers, and improve risk reduction education using a truly comprehensive personalized approach.

References

  • American Nurses Association. Genetics/genomics nursing: scope and standards of practice. 2nd ed. Washington, DC: ANA; 2016.

  • Bell G (2009). Forward in The Fourth Paradigm: Data-Intensive Scientific Discovery (p.xv). Redmond, WA, Microsoft.

  • Burgio E, Lopomo A, Migliore L. Obesity and diabetes: from genetics to epigenetics. Mol Biol Rep. 2015;42:799–818. doi:10.1007/s11033-014-3751-z.

  • Diabetes.co.uk the Global Diabetes Community. Drug induced diabetes. 2015. http://www.diabetes.co.uk/drug-induced-diabetes.html. Accessed 30 Jan 2016.

  • Knowler WC, Barrett-Connor E, Fowler SE, Hamman RF, Lachin JM, Walker EA, Nathan DM. Diabetes prevention program research group 2002. Reduction in the incidence of Type 2 diabetes with lifestyle intervention or Metformin. N Engl J Med. 2002;346(6):393–403. doi:10.1056/NEJMoa012512.

  • Diabetes Prevention Program [DPP] Research Group 2002. The diabetes prevention program (DPP): description of lifestyle intervention. Diabetes Care. 2002;25(12):2165–71. doi:10.2337/diacare.25.12.2165.

  • Genetic Science Learning Center. Epigenetics. 2014. http://learn.genetics.utah.edu/content/epigenetics/. Accessed 30 Jan 2016.

  • Grant RW, O’Brien KE, Waxler JL, Vassy JL, Delahanty LM, Bissett LG, Green RC, Stember KG, Guiducci C, Park ER, Florez JC, Meigs JB. Personalized genetic risk counseling to motivate diabetes prevention: a randomized trial. Diabetes Care. 2013;36(1):13–9. doi:10.2337/dc12-0884.

  • Hall E, Dayeh T, Kirkpatrick CL, Wollheim CB, Dekker Nitert M, Ling C. DNA methylation of the glucagon-like peptide 1 receptor (GLP1R) in human pancreatic islet. BMC Med Genet. 2013;14:76. doi:10.1186/1471-2350-14-76.

  • Harrison TA, Hindorff LA, Kim H, Wines RC, Bowen DJ, McGrath BB, Edwards KL. Family history of diabetes as a potential public health tool. Am J Prev Med. 2003;24(2):152–9. doi:10.1016/S0749-3797(02)00588-3.

  • Heijmans BT, Tobi EW, Stein AD, Putter H, Blauw GJ, Susser ES, Slagboom PE, Lumey LH. Persistent epigenetic differences associated with prenatal exposure to famine in humans. Proceedings from the National Academy of Science. 2008;105(44):17046–9. doi:10.1073/pnas.0806560105.

  • Jaenisch R, Bird A. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet. 2003;33(Suppl):245–54. doi:10.1038/ng1089.

  • Keating BJ. Advances in risk prediction of type 2 diabetes: integrating genetic scores with Framingham risk models. Diabetes. 2015;64(5):1495–7. doi:10.2337/db15-0033.

  • Klein Woolthuis EP, de Grauw WJ, van Gerwen WH, van den Hoogen HJ, van de Lisdonk EH, Metsemakers JF, van Weel C. Identifying people at risk for undiagnosed Type 2 diabetes using the GP’s electronic medical record. Fam Pract. 2007;24(3):230–6. doi:10.1093/fampra/cmm018.

  • Kulis M, Esteller M. DNA methylation and cancer. Adv Genet. 2010;70:27–56. doi:10.1016/B978-0-12-380866-0.60002-2.

  • Kulkarni SS, Salehzadeh F, Fritz T, Zierath JR, Krook A, Osler ME. Mitochondrial regulators of fatty acid metabolism reflect metabolic dysfunction in Type 2 diabetes mellitus. Metabolism. 2012;6(2):175–85. doi:10.1016/j.metabol.2011.06.014.

  • Lindström J, Louheranta A, Mannelin M, Rastas M, Salminen V, Eriksson J, Uusitupa M, Tuomilehto J. Finnish Diabetes Prevention Study group 2003. The Finnish Diabetes Prevention Study (DPS): lifestyle intervention and 3-year results on diet and physical activity. Diabetes Care. 2003;26(12):3230–6. doi:10.2337/diacare.26.12.3230.

  • Ling C, Del Guerra S, Lupi R, Rönn T, Granhall C, Luthman H, Masiello P, Marchetti P, Groop L, Del Prato S. Epigenetic regulation of PPARGC1A in human Type 2 diabetic islets and effect on insulin secretion. Diabetologia. 2008;51(2):615–22. doi:10.1007/s00125-007-0916-5.

  • Lyssenko V, Laakso M. Genetics screening and the risk for Type 2 diabetes: worthless or valuable? Diabetes Care. 2013;36(Suppl(2)):S120–6. doi:10.2337/dcS13-2009.

  • Loi M, Del Savio L, Stupka E. Social epigenetics and equality of opportunity. Public Health Ethics. 2013;6(2):142–53. doi:10.1093/phe/pht019.

  • McGowan PO, Sasaki A, D’Alessio AC, Dymov S, Labonté B, Szyf M, Turecki G, Meaney MJ. Epigenetic regulation of the glucocorticoid receptor in human brain associates with childhood abuse. Nat Neurosci. 2009;12(3):342–8. doi:10.1038/nn.2270.

  • Mathers JC, Strathdee G, Relton CL. Induction of epigenetic alterations by dietary and other environmental factors. Adv Genet. 2010;71:3–39. doi:10.1016/B978-0-12-380864-6.00001-8.

  • Medici F, Hawa M, Ianari A, Pyke DA, Leslie RD. Concordance rate for type II diabetes mellitus in monozygotic twins: actuarial analysis. Diabetologia. 1999;42(2):146–50. doi:10.1007/s001250051132.

  • Morris AP, Voight BF, Teslovich TM, et al. Wellcome Trust Case Control Consortium; Meta-Analyses of Glucose and Insulin-related traits Consortium (MAGIC) Investigators; Genetic Investigation of ANthropometric Traits (GIANT) Consortium; Asian Genetic Epidemiology Network–Type 2 Diabetes (AGEN-T2D) Consortium; South Asian Type 2 Diabetes (SAT2D) Consortium; DIAbetes Genetics Replication and Meta-analysis (DIAGRAM) Consortium 2012. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet. 2012;44:981–90. doi:10.1038/ng.2383.

  • National Diabetes Information Clearinghouse [NDIC]. National diabetes statistics. 2014. http://diabetes.niddk.nih.gov/dm/pubs/statistics/#Prevention. Accessed 27 Jan 2016.

  • National Center for Biotechnology Information (NCBI). FFAR3 free fatty acid receptor 3 [Homo sapiens (human)]. 2015. http://www.ncbi.nlm.nih.gov/gene/2865. Accessed 30 Jan 2016.

  • National Institutes of Health. The National Institutes of Health (NIH) roadmap epigenomics mapping consortium. 2015. http://www.roadmapepigenomics.org. Accessed 30 Jan 2016.

  • Narayan KM, Boyle JP, Thompson TJ, Sorenson SW, Williamson DF. Lifetime risk for diabetes mellitus in the United States. JAMA. 2003;290(14):1884–90. doi:10.1001/jama.290.14.1884.

  • Newman B, Selby JV, King MC, Slemenda C, Fabsitz R, Friedman GD. Concordance for type 2 (non-insulin-dependent) diabetes mellitus in male twins. Diabetologia. 1987;30(10):763–8. doi:10.1007/BF00275741.

  • Poulsen P, Grunnet LG, Pilgaard K, Storgaard H, Alibegovic A, Sonne MP, Carstensen B, Beck-Nielsen H, Vaag A. Increased risk of Type 2 diabetes in elderly twins. Diabetes. 2009;58(6):1350–5. doi:10.2337/db08-1714.

  • Puumala SE, Hoyme HE. Epigenetics in pediatrics. Pediatr Rev. 2015;36:14–21. doi:10.1542/pir.36-1-14.

  • Radtke KM, Ruf M, Gunter HM, Dohrmann K, Schauer M, Meyer A, Elbert T. Transgenerational impact of intimate partner violence on methylation in the promoter of the glucocorticoid receptor. Transl Psychiatry. 2011;1:21. doi:10.1038/tp.2011.21.

  • Remely M, Aumueller E, Merold C, Dworzak S, Hippe B, Zanner J, Pointner A, Brath H, Haslberger AG. Effects of short chain fatty acid producing bacteria on epigenetic regulation of FFAR3 in type 2 diabetes and obesity. Gene. 2014;537:85–92. doi:10.1016/j.gene.2013.11.081.

  • Ribel-Madsen R, Fraga MF, Jacobsen S, Bork-Jensen J, Lara E, Calvanese V, Fernandez AF, Friedrichsen M, Vind BF, Højlund K, Beck-Nielsen H, Esteller M, Vaag A, Poulsen P. Genome-wide analysis of DNA methylation differences in muscle and fat from monozygotic twins discordant for type 2 diabetes. PLoS One. 2012;7(12):e51302. doi:10.1371/journal.pone.0051302.

  • Rönn T, Volkov P, Davegårdh C, Dayeh T, Hall E, Olsson AH, Nilsson E, Tornberg A, Dekker Nitert M, Eriksson KF, Jones HA, Groop L, Ling C. A six months exercise intervention influences the genome-wide DNA methylation pattern in human adipose tissue. PLoS Genet. 2013;9(6):e31003572. doi:10.1371/journal.pgen.1003572.

  • Schulz LC. The Dutch hunger winter and the developmental origins of health and disease. Proceedings from the National Academy of Science. 2010;107(39):16757–8. doi:10.1073/pnas.1012911107.

  • Shah L, Perkhounkova Y, Daack-Hirsch S. Evaluation of the perception of risk factors for Type 2 diabetes instrument (PRF-T2DM) in an at-risk, non-diabetic population. J Nurs Meas. In press.

  • Slomko H, Heo HJ, Einstein FH. Minireview: epigenetics of obesity and diabetes in humans. Endocrinology. 2012;153(3):1025–30. doi:10.210/en.2011-1759.

  • Strahl BD, Allis CD. The language of covalent histone modifications. Nature. 2000;403(6765):41–5. doi:10.1038/47412.

  • Talmud PJ, Cooper JA, Morris RW, Dudbridge F, Shah T, Engmann J, Dale C, White J, McLachlan S, Zabaneh D, Wong A, Ong KK, Gaunt T, Holmes MV, Lawlor DA, Richards M, Hardy R, Kuh D, Wareham N, Langenberg C, Ben-Shlomo Y, Wannamethee SG, Strachan MW, Kumari M, Whittaker JC, Drenos F, Kivimaki M, Hingorani AD, Price JF, Humphries SE. Sixty-five common genetic variants and prediction of Type 2 diabetes. Diabetes. 2015;64(5):1830–40. doi:10.2337/db14-1504.

  • Valdez R, Yoon PW, Liu T, Khoury MJ. Family history and prevalence of diabetes in the U.S. population: the 6-year results from the National Health and Nutrition Examination Survey (1999–2004). Diabetes Care. 2007;30(10):2517–22. doi:10.2337/dc07-0720.

  • Venditti EM. Efficacy of lifestyle behavior change programs in diabetes. Curr Diab Rep. 2007;7(2):123–7. doi:10.1007/s11892-007-0021-7.

  • Walter FM, Emery J. Coming down the line—patients’ understanding of their family history of common chronic disease. Ann Fam Med. 2005;3(5):405–14. doi:10.1370/afm.368.

  • Weaver IC, Cervoni N, Champagne FA, D’Alessio AC, Sharma S, Seckl JR, Dymov S, Szyf M, Meaney MJ. Epigenetic programming by maternal behavior. Nat Neurosci. 2004;8:847–54. doi:10.1038/nn1276.

  • Wild CP. Complementing the genome with an ‘exposome’: the outstanding challenge of environmental exposure measurement in molecular epidemiology. Cancer Epidemiol Biomark Prev. 2005;14(8):1847–50. doi:10.1093/mutage/gen061.

  • Wolfs MG, Hofker MH, Wijmenga C, van Haefte TW. Type 2 diabetes mellitus: new genetic insights will lead to new therapeutics. Curr Genomics. 2009;10(2):110–8. doi:10.2174/138920209787847023.

  • Yang BT, Dayeh TA, Kirkpatrick CL, Taneera J, Kumar R, Groop L, Wollheim CB, Nitert MD, Ling C. Insulin promoter DNA methylation correlates negatively with insulin gene expression and positively with HbA(1c) levels in human pancreatic islets. Diabetologia. 2011;54(2):360–7. doi:10.1007/s00125-010-1967-6.

  • Yang BT, Dayeh TA, Volkov PA, Kirkpatrick CL, Malmgren S, Jing X, Renström E, Wollheim CB, Nitert MD, Ling C. Increased DNA methylation and decreased expression of PDX-1 in pancreatic islets from patients with type 2 diabetes. Mol Endocrinol. 2012;26(7):1203–12. doi:10.1210/me.2012-1004.

  • Zhang Y, Kent 2nd JW, Lee A, Cerjak D, Ali O, Diasio R, Olivier M, Blangero J, Carless MA, Kissebah AH. Fatty acid binding protein 3 (fabp3) is associated with insulin, lipids and cardiovascular phenotypes of the metabolic syndrome through epigenetic modifications in a Northern European family population. BMC Med Genet. 2013;6:9. doi:10.1186/1755-8794-6-9.

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Hardy, L.R., Bourne, P.E. (2017). Data Science: Transformation of Research and Scholarship. In: Delaney, C., Weaver, C., Warren, J., Clancy, T., Simpson, R. (eds) Big Data-Enabled Nursing. Health Informatics. Springer, Cham. https://doi.org/10.1007/978-3-319-53300-1_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-53300-1_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-53299-8

  • Online ISBN: 978-3-319-53300-1

  • eBook Packages: MedicineMedicine (R0)

Publish with us

Policies and ethics