Chapter 11: Challenges in and Principles for Conducting Systematic Reviews of Genetic Tests used as Predictive Indicators

Jonas, Daniel E.; Wilt, Timothy J.; Taylor, Brent C.; Wilkins, Tania M.; Matchar, David B.

doi:10.1007/s11606-011-1898-z

Chapter 11: Challenges in and Principles for Conducting Systematic Reviews of Genetic Tests used as Predictive Indicators

Original Research
Open access
Published: 31 May 2012

Volume 27, pages 83–93, (2012)
Cite this article

Download PDF

You have full access to this open access article

Journal of General Internal Medicine Aims and scope Submit manuscript

Chapter 11: Challenges in and Principles for Conducting Systematic Reviews of Genetic Tests used as Predictive Indicators

Download PDF

Daniel E. Jonas MD, MPH¹,
Timothy J. Wilt MD, MPH^2,3,
Brent C. Taylor PhD, MPH^2,3,
Tania M. Wilkins MS⁴ &
…
David B. Matchar MD^5,6

1506 Accesses
6 Citations
Explore all metrics

Abstract

In this paper, we discuss common challenges in and principles for conducting systematic reviews of genetic tests. The types of genetic tests discussed are those used to 1). determine risk or susceptibility in asymptomatic individuals; 2). reveal prognostic information to guide clinical management in those with a condition; or 3). predict response to treatments or environmental factors. This paper is not intended to provide comprehensive guidance on evaluating all genetic tests. Rather, it focuses on issues that have been of particular concern to analysts and stakeholders and on areas that are of particular relevance for the evaluation of studies of genetic tests. The key points include:

The general principles that apply in evaluating genetic tests are similar to those for other prognostic or predictive tests, but there are differences in how the principles need to be applied or the degree to which certain issues are relevant.
A clear definition of the clinical scenario and an analytic framework is important when evaluating any test, including genetic tests.
Organizing frameworks and analytic frameworks are useful constructs for approaching the evaluation of genetic tests.
In constructing an analytic framework for evaluating a genetic test, analysts should consider preanalytic, analytic, and postanalytic factors; such factors are useful when assessing analytic validity.
Predictive genetic tests are generally characterized by a delayed time between testing and clinically important events.
Finding published information on the analytic validity of some genetic tests may be difficult. Web sites (FDA or diagnostic companies) and gray literature may be important sources.
In situations where clinical factors associated with risk are well characterized, comparative effectiveness reviews should assess the added value of using genetic testing along with known factors compared with using the known factors alone.
For genome-wide association studies, reviewers should determine whether the association has been validated in multiple studies to minimize both potential confounding and publication bias. In addition, reviewers should note whether appropriate adjustments for multiple comparisons were used.

Clinical Genetic Research 1: Bias

Contributions of the UK biobank high impact papers in the era of precision medicine

Article 28 January 2020

The clinical utility of polygenic risk scores in genomic medicine practices: a systematic review

Article Open access 30 April 2022

With recent advances in genotyping, it is expected that whole genome sequencing will soon be available for less than $1000. Consequently, the number of studies of genetic tests will likely increase substantially, as will the need to evaluate studies of genetic tests. The general principles for evaluating genetic tests are similar to those for interpreting other prognostic or predictive tests, but there are differences in how the principles need to be applied and the degree to which certain issues are relevant, particularly when considering genetic test results that provide predictive rather than diagnostic information.

This paper focuses on issues of particular concern to analysts and stakeholders and areas of particular relevance for the evaluation of studies of genetic tests. It is not intended to provide comprehensive guidance on evaluating all genetic tests. We reflect on genetic tests used to 1) determine risk or susceptibility in asymptomatic individuals (to identify individuals at risk for future health conditions, such as BRCA1 and BRCA2 for breast and ovarian cancer); 2) reveal prognostic information to guide clinical management and treatment in those with a condition (e.g., Oncotype Dx® for breast cancer recurrence, a test to evaluate the tumor genome of surgically excised tumors from patients with breast cancer); or 3) predict response to treatments or environmental factors including diet (nutrigenomics), drugs (pharmacogenomics, such as CYP2C9 and VKORC1 tests to inform warfarin dosing), infectious agents, chemicals, physical agents, and behavioral factors. We do not address genetic tests used for diagnostic purposes. We address issues related to both heritable mutations and somatic mutations (e.g., genetic tests for tumors).

Clinicians, geneticists, analysts, policymakers, and other stakeholders may have varying definitions of what is considered a “genetic test.” We have chosen to use a broad definition in agreement with that of the Centers for Disease Control and Prevention (CDC)-sponsored Evaluation of Genomic Applications in Practice and Prevention (EGAPP) and the Secretary’s Advisory Committee on Genetics, Health, and Society,1 namely: “A genetic test involves the analysis of chromosomes, deoxyribonucleic acid (DNA), ribonucleic acid (RNA), genes, or gene products (e.g., enzymes and other proteins) to detect heritable or somatic variations related to disease or health. Whether a laboratory method is considered a genetic test also depends on the intended use, claim, or purpose of a test.”1 The same technologies are used for diagnostic and predictive genetic tests; it is the intended use of the test result that determines whether it is a diagnostic or predictive test.

In this paper, we discuss principles for addressing challenges related to developing the topic and structuring a genetic test review (context and scoping), as well as performing the review. This paper is meant to complement the Methods Guide for Comparative Effectiveness Reviews.2 We do not attempt to reiterate the challenges and principles described in earlier sections of this Medical Test Methods Guide, but focus instead on issues of particular relevance for evaluating studies of genetic tests. Although we have written this paper to serve as guidance for the Agency for Healthcare Research and Quality (AHRQ) Evidence-based Practice Centers (EPCs), we also intend for this to be a useful resource for other investigators interested in conducting systematic reviews on genetic tests.

COMMON CHALLENGES

Genetic tests are different from other medical tests in their relationship to the outcomes measured. Reviewers need to take into account the penetrance of the disease, time lag to outcomes, variable expressivity, and pleiotropy (as defined below). These particular aspects of genetic tests result in specific actions at various stages of planning and performing the review. Both single-gene and polygenic disorders are known. Single gene disorders are the result of a single mutated gene and may be passed on to subsequent generations in various well-described ways (i.e., autosomal dominant, autosomal recessive, X-linked). Polygenic disorders are the result of the combined action of more than one gene and are not inherited by simple Mendelian patterns. Some examples include heart disease and diabetes. Some of the terms described below (penetrance, variable expressivity, and pleiotropy) are generally used to describe single-gene disorders.

Penetrance

Evaluations of predictive genetic tests should always consider penetrance, defined as “the proportion of people with a particular genetic change who exhibit signs and symptoms of a disorder.”3 Penetrance is a key factor in determining the future risk of developing disease and assessing the overall clinical utility of predictive genetic tests. Sufficient data to determine precise estimates of penetrance are sometimes lacking.4,5 This can be due to the lack of reliable prevalence data or a lack of long-term outcomes data. In such cases, determining the overall clinical utility of a genetic test is difficult. In some cases, modeling with sensitivity analyses can be helpful to develop estimates.4

Time Lag

The time lag between genetic testing and clinically important events should be assessed in critical appraisal of studies of such tests. Whether the duration of studies and follow-up are sufficient to characterize the relationship between positive tests and clinical outcomes are important considerations. In addition, it should be determined whether or not subjects have reached the age beyond which clinical expression would be likely.

Variable Expressivity

Variable expressivity refers to the range of severity of the signs and symptoms that can occur in different people with the same condition.3 For example, the features of hemochromatosis vary widely. Some individuals have mild symptoms, while others experience life-threatening complications such as liver failure. The degree of expressivity should be considered in the evaluation of genetic tests.

Pleiotropy

Pleiotropy occurs when a single gene influences multiple phenotypic traits. For example, the genetic mutation causing Marfan syndrome results in cardiovascular, skeletal, and ophthalmologic abnormalities. Similarly, BRCA mutations can increase the risk of a number of cancers, including breast, ovarian, prostate, and melanoma.

Other Common Challenges

Another common challenge in evaluating predictive genetic tests is that direct evidence for the impact of the test results on health outcomes is often lacking. The evidence base may often be too limited in scope to evaluate the clinical utility of the test. In addition, it is often difficult to find published information on various aspects of genetic tests, especially data related to analytic validity. For example, laboratory-developed tests (LDT) are regulated by the Centers for Medicare & Medicaid Services (CMS) Clinical Laboratory Improvement Act (CLIA) regulations for clinical laboratories. CLIA does not require clinical validation and many LDTs have had no clinical validation or clinical utility studies.

Genetic tests also have a number of technical issues that are particularly relevant to assessing their analytic validity. These technical issues may differ according to the type of genetic test and may influence the interpretation of a genetic test result. Technical issues may also differ depending on the specimen being tested. For example, there are different considerations when assessing tumor genomes as opposed to human genomes.

Common challenges arise when attempting to use genetic tests to determine susceptibility or risk in asymptomatic individuals. The utility of such tests may depend on the ability of respondents, such as the patient or their relative, to report and identify certain clinical factors. For instance, if patients cannot accurately recall the family history of a heritable disease, it can be difficult to assess their risk of developing the disease.

Finally, statistical issues must be taken into account when evaluating studies of genetic tests. For example, genetic test results are often derived from analytically complex studies that have undergone a very large number of statistical tests, creating a high risk of Type I error (i.e., a spurious association is deemed significant).

Principles for Addressing the Challenges (Box 11-1)

Principle 1: Use an Organizing Framework Appropriate for Genetic Tests

Organizing frameworks for evaluating genetic tests have been developed by the United States Preventive Services Task Force (USPSTF), the CDC, and EGAPP.1,6,7 The model endorsed by the EGAPP initiative1 was based on a previous Task Force report8 and developed through a CDC-sponsored project, which piloted an evidence evaluation framework that applied the following three criteria: 1) analytic validity (technical accuracy and reliability), 2) clinical validity (ability to detect or predict an outcome, disorder, or phenotype), and 3) clinical utility (whether use of the test to direct clinical management improves patient outcomes). A fourth criterion was added: 4) ethical, legal, and social implications.6 The ACCE model (Analytic validity, Clinical validity, Clinical utility, and Ethical, legal and social implications) includes a series of 44 questions that are useful for analysts in defining the scope of a review, as well as for critically appraising studies of genetic tests (Table 1). The initial seven questions help to guide an understanding of the disorder, the setting, and the type of testing. A detailed description of the methods of the EGAPP Working Group is published elsewhere.1

Principle 2: Develop Analytic Frameworks to Reflect the Predictive Nature of Genetic Tests and Incorporate Appropriate Outcomes

It is important to have a clear definition of the clinical scenario and analytic framework when evaluating any test, including a predictive genetic test. Prior to performing a review, analysts should develop clearly defined key questions and understand the needs of decision makers and the context in which the tests are used. They should consider whether this is a test used for determining future risk of disease in asymptomatic individuals, establishing prognostic information that will influence treatment decisions, or predicting response to treatments (either effectiveness or harms)—or used for some other purpose. They should clarify the type of specimens used for the genetic test under evaluation (i.e., patient genome or tumor genome). The PICOTS typology (Patient population, Intervention, Comparator, Outcomes, Timing, Setting) should be clearly described as it will inform the development of the analytic framework and vice versa.

In constructing an analytic framework, it may be useful for analysts to consider preanalytic, analytic, and postanalytic factors particularly applicable to genetic tests (described later in this paper), as well as the key outcomes of interest. Analytic frameworks should incorporate the factors and outcomes of greatest interest to decision makers. Figure 1 illustrates a generic analytic framework for evaluating predictive genetic tests that can be modified as necessary for various situations.

In addition to effects on family members, psychological distress and possible stigmatization or discrimination are potential harms that may result from predictive genetic tests, particularly those test results that predict probability of disease occurring with a high likelihood, especially if no proven preventive or ameliorative measures are available. For these potential harms, analysts should take into account whether the testing is for inherited or acquired genetic mutations since these factors influence the potential for harms. In addition, whether the condition related to the test is multifactorial or follows classic Mendelian inheritance will affect the potential for these harms.

Other important outcomes to consider when evaluating genetic tests include, but are not limited to, cost, quality of life, long-term morbidity, and indirect impact. Genetic tests may have an impact on decisions that are difficult to measure, yet very important, such as decisions regarding pregnancy.

Depending on the context, the impact of genetic testing on family members may be important, particularly in cases that involve testing for heritable conditions. One approach to including family members in the analytic framework is illustrated in Figure 2.

Principle 3: Search Databases Appropriate for Genetic Tests

The Human Genome Epidemiology Network (HuGE Net) Web site can provide a helpful supplement to searches, as it includes many meta-analyses of genetic association studies as well as a source called the HuGE Navigator that can identify all types of available studies related to a genetic test.9

When assessing the gray literature, U.S. Food and Drug Administration (FDA)-approved test package inserts contain summaries of the analytic validity data. Package inserts are available on the FDA and manufacturer Web sites. Laboratory-developed tests do not require FDA clearance, and there is no requirement for publicly available data on analytic validity. When there are no published data on analytic validity of a genetic test, the external proficiency testing program carried out jointly by the American College of Medical Genetics (ACMG) and the College of American Pathologists (CAP) can be useful in establishing the degree of laboratory-to-laboratory variability, as well as some sense of reproducibility.10–12 Other potentially useful sources of unpublished data include conference publications from professional societies (e.g., the College of American Pathologists), the GeneTests Web site (www.genetests.org), the Association for Molecular Pathology Web site (www.amp.org), CDC programs (e.g., the Genetic Testing Reference Materials Coordination Program and the Newborn Screening Quality Assurance Program), and international proficiency testing programs.13

An AHRQ “horizon scan” found two databases—the LexisNexis® database (www.lexisnexis.com) and Cambridge Healthtech Institute (CHI) (www.healthtech.com/)—that had high utility in identifying genetic tests in development for clinical cancer care. A number of others had low-to-moderate utility, and some were not useful.14

Principle 4: Consult with Experts to Determine which Technical Issues are Important to Address in Assessing Genetic Tests

There are a number of technical issues related to analytic validity that can influence the interpretation of a genetic test result, including preanalytic, analytic, and postanalytic factors.15,16 In general, preanalytic steps are those involved in obtaining, fixing or preserving, and storing samples prior to staining and analysis. Important analytic variables include the type of assay chosen and its reliability, types of samples, the specific analyte investigated, specific genotyping methods, timing of sample analysis, and complexity of performing the assay. Postanalytic variables relate to the complexity of interpreting the test result, variability from laboratory to laboratory, and quality control.15,16 Comparative effectiveness review teams should include or consult with molecular pathologists, geneticists, or others familiar with the issues related to the process of performing and reporting genetic tests to determine which of these technical issues are pertinent for a given review. Table 2 summarizes some of the preanalytic, analytic, and postanalytic questions that should be addressed.

For genetic testing of tumor specimens, it is important to understand that the tumor genome may be in a dynamic state, with mutations emerging over time (e.g., due to drug exposure or disruption of cellular repair). Tumor specimens will often contain normal cells from the patient as well as tumor cells. To accurately assess for somatic mutations using tumor specimens, particular strategies may be needed, such as enriching samples for tumor cells (e.g., by microscopic evaluation and dissection of tumor cells).

Principle 5: Distinguish Between Functional Assays and DNA-based Assays to Determine Important Technical Issues

Some studies may utilize DNA-based assays whereas others may utilize functional assays with different sensitivities and specificities. Functional assays, in which a substrate or product of a metabolic process affected by a particular genetic polymorphism is measured, may have the advantage of showing potentially more important information than the presence of the genetic polymorphism itself. However, they may be affected by a number of factors and do not necessarily reflect the polymorphism alone. Unmeasured environmental factors, other genetic polymorphisms, and various disease states may influence the results of functional assays. In addition, functional assays that measure enzyme activity are taken at a single point in time. Depending on the enzyme and polymorphism being evaluated, the variation in enzyme activity over time should be considered in critical appraisal. Inconsistent results between studies using DNA-based molecular methods and those using phenotypic assays have been reported.16–18

For DNA-based tests, a variety of sample sources are available (e.g., blood, cheek swab, hair) that should hypothetically result in identical genotype results.16,19–23 However, DNA may be more difficult to obtain and purify from some tissues than from blood, particularly if the tissues have been fixed in paraffin versus fresh samples (DNA extraction from formalin-fixed tissue is difficult, but sometimes possible).16 Some studies utilize different sources of DNA for cases and controls, introducing potential measurement bias from differences in the ease of technique and test accuracy. Extraction of DNA from tumors in oncology studies may raise additional issues that influence analytic validity, including the quantity of tissue, admixture of normal and cancerous tissue, amount of necrosis, timing of collection, and storage technique (e.g., fresh, frozen, paraffin, formalin).16

When evaluating DNA-based molecular tests, the complexity of the test method, laboratory-to-laboratory variability, and quality control should be assessed. A number of methods are available for genotyping single nucleotide polymorphisms that vary in complexity and potential for polymorphism misclassification.16,24–26 Considering laboratory reporting of internal controls and repetitive experiments can be useful in assessment of overall analytic validity. The method of interpreting test results may influence complexity as well. For example, some tests require visual inspection of electrophoresis gels. Inter-observer variability should be considered for such tests.16,27

Principle 6: Evaluate Case-Control Studies Carefully for Potential Selection Bias

In critical appraisal of any case-control study, it is important to determine whether cases and controls were selected from the same source population. In the case of genetic studies, the geographic location of the population does not suffice. Rather, having cases and controls matched for ethnicity/race or ancestry is important since the frequencies of DNA polymorphisms vary from population to population (i.e., population stratification). It has been noted that many case-control studies of gene-disease associations have selected controls from a population that does not represent the population from which the cases arose.16,17,28–30 In general, only nested case-control studies could have low enough potential for selection bias to provide reliable information.

Principle 7: Determine the Added Value of the Genetic Test over Existing Risk Assessment Approaches

For some scenarios, a number of clinical factors associated with risk assessment or susceptibility may already be well characterized. In such cases, comparative effectiveness reviews should determine the added value of using genetic testing along with known factors compared with using the known factors alone. For example, age, sex, smoking, hypertension, diabetes, and cholesterol are all well-established risk factors for cardiovascular disease. Risk stratification of individuals to determine cholesterol-lowering targets is based on these factors.31 Assessment of newly identified polymorphisms—such as those described on chromosome 9p2132—that may confer increased risk of cardiovascular disease and have potential implications for medical interventions should be evaluated in the context of these known risk factors. In this scenario, investigators should determine the added value of testing for polymorphisms of chromosome 9p21 in addition to known clinical risk factors.

Multiple polymorphisms may be associated with risk of disease, prognosis, or prediction of drug response. In such cases, the effect of multiple polymorphisms can be explored using a multiple regression model. Then, prospective studies would usually be needed to determine whether the model including the genetic tests has clinical utility. For example, VKORC1 and CYP2C9 genotypes have been associated with warfarin dose requirements in multiple regression models. In order to determine whether tests for VKORC1 and CYP2C9 have clinical utility, studies would need to compare the use of a prediction model that contains the genetic tests in combination with known clinical factors that affect warfarin dose (e.g., age, BMI) with the use of clinical factors alone.33–35

Principle 8: Understand Statistical Issues of Particular Relevance to Genetic Tests

Table 1 ACCE Model Questions for Reviews of Genetic Tests6

Full size table

Table 2 Questions for Assessing Preanalytic, Analytic, and Postanalytic Factors for Evaluating Predictive Genetic Tests*

Full size table

Hardy–Weinberg Equilibrium

In population genetics, most allele distributions follow a usual distribution, known as Hardy–Weinberg equilibrium (HWE). Genetic association studies should generally report whether the frequencies of the alleles being evaluated follow HWE. There are a number of reasons that distributions may deviate from HWE, including new mutations, selection, migration, genetic drift, and inbreeding.36 In addition, when numerous polymorphisms are tested for associations with diseases or outcomes, as in many genome-wide association studies, many of them (5%) will deviate from HWE based on chance alone (related to multiple testing).37 Although it is not specific and possibly not sensitive, deviation from HWE may be a clue to bias and genotyping error.37 Analysts should consider whether studies have tested for and reported HWE. A more detailed discussion of this topic as it relates to genetic association studies has been published elsewhere.36,37

Sample Size Calculations

When assessing internal validity of studies, it is important to assess whether sample size calculations appropriately accounted for the number of variant alleles and the prevalence of variants in the population of interest. This is particularly relevant for pharmacogenomic studies evaluating the functional relevance of genetic polymorphisms.38 Such studies often enroll an insufficient number of subjects to account for the number of variant alleles and the prevalence of variants in the population.38

Genetic Association Studies and Multiple Comparisons

Genetic test results are sometimes derived from analytically complex studies that have undergone a very large number of statistical tests. These may be in the form of genome-wide association studies searching for associations between a huge number of genetic polymorphisms and health conditions. Such association studies may launch further understanding of the importance of genetics in relation to a variety of health conditions but should generally be used to generate hypotheses rather than to test hypotheses or to confirm cause-effect relationships.16 Close scrutiny should be applied to ensure that the evidence for the association has been validated in multiple studies to minimize both potential confounding and potential publication bias issues. In addition, reviewers should note whether appropriate adjustments for multiple comparisons were used. Many recommend using a P value of less than 5 × 10⁻⁸ for the threshold of significance in large genome-wide studies.37,39,40 Other approaches include assessing the false positive report probability and controlling the false discovery rate.41–43

When a genetic mutation associated with increased risk is present, evaluating potential causality can be difficult as many other factors may influence associations. These include environmental exposures, behaviors, and other genes. Many genetic variants identified that are thought to influence susceptibility to diseases are associated with low relative and absolute risk.16,44 Thus, exclusion of non-causal explanations for associations and consideration of potential confounders are central to critical appraisal of such associations. It may also be important to explore biologic plausibility (e.g., from in vitro studies) to help support or oppose theories of causation.16

Overlapping Data Sets

Be cautious of publications that report prevalence estimates for genetic variants that have actually arisen from overlapping data sets.16 For example, genome-wide association studies or other large collaborative efforts, such as the International Warfarin Pharmacogenomics Consortium, may pool samples of patients that were previously included in other published studies.3 To the degree possible, investigators should identify overlapping data sets and avoid double-counting. It may be useful to organize evidence tables by study time period and geographic area to identify potential overlapping data sets.16

Assessing Tumor Genetics

As mentioned under Principle 4, it is important to understand that a tumor genome may be in a dynamic state. In addition, tumor specimens will often contain normal cells from the patient. The characteristics of the specimen will influence the sensitivity and operating characteristics of the test. Tests with greater sensitivity may be required when specimens contain both normal cells and tumor cells.

ILLUSTRATIONS

Since the completion of the Human Genome Project, the Hap Map project, and related works, there have been a great number of publications describing the clinical validity of genetic test results (e.g., gene-disease associations), but far fewer studies of the clinical utility. A review of genetic testing for cytochrome P450 polymorphisms in adults with depression treated with selective serotonin reuptake inhibitors (SSRIs) developed an analytic framework and five corresponding key questions which, taken together, provide an example of a well-defined predictive genetic test scenario that explores a potential chain of evidence relating to intermediate outcomes (Figure 3).45 The authors found no prospective studies with clinical outcomes that used genotyping to guide treatment. They constructed a chain of questions to assess whether sufficient indirect evidence could answer the overarching question by evaluating the links between genotype and metabolism of SSRIs (phenotype), metabolism and SSRI efficacy, and metabolism and adverse drug reactions to SSRIs.

An EPC report on HER2 testing to manage patients with breast cancer and other solid tumors provides a detailed assessment of challenges in conducting a definitive evaluation of preanalytic, analytic, and postanalytic factors when there is substantial heterogeneity or lack of available information related to the methods of testing.46 The authors noted that it had been only very recently that many aspects of HER2 assays were standardized, and that the effects of widely varying testing methods could not be isolated. Thus, they approached this challenge by providing a narrative review for their first key question (What is the evidence on concordance and discrepancy rates for methods [e.g., FISH, IHC, etc.] used to analyze HER2 status in breast tumor tissue?).

Additional considerations arise when evaluating genetic test results used to determine susceptibility or risk in asymptomatic individuals. The utility of such tests may depend on the ability of patients and providers to report and identify certain clinical factors. For example, a review of genetic risk assessment and BRCA mutation testing underscores the importance of accurately determining family history.4,47 The analytic framework begins by classifying asymptomatic women into high, moderate, or average risk categories. This is a good example of incorporating a key preanalytic factor (family history), that has an important influence on analytic validity. Tests for BRCA mutations may be used to predict the risk for breast and ovarian cancer in high-risk women (i.e., those with a family history suggesting increased risk). However, because we do not know all of the genes that contribute to hereditary breast and ovarian cancer and because analytic methods to detect mutations in the known genes are not perfect, population-based testing for hereditary susceptibility to breast and ovarian cancer is currently not an appropriate strategy. Rather, family history-based testing is the paradigm that is recommended to guide the use of BRCA testing.4, 47

Thus, family history is a genetic/genomics tool that is used to 1) identify people with possible inherited disease susceptibilities, 2) guide genetic testing strategies, 3) help interpret genetic test results, and 4) assess disease risk. The ability of providers to accurately determine a family history that confers increased risk is a key prerequisite to the utility of BRCA mutation and other predictive genetic testing. It is sometimes difficult for people to accurately recall the presence of a condition in their relatives. Sensitivity and specificity of self-reported family history are important in determining overall usefulness of predictive genetic testing.4

CONCLUSIONS

Analysts should understand common challenges, and apply the principles for addressing those challenges, when conducting systematic reviews of genetic tests used as predictive indicators. Key points include:

1)
The general principles that apply in evaluating genetic tests are similar to those for other prognostic or predictive tests, but there are differences in how the principles need to be applied or the degree to which certain issues are relevant.
2)
A clear definition of the clinical scenario and an analytic framework is important when evaluating any test, including genetic tests.
3)
Organizing frameworks and analytic frameworks are useful constructs for approaching the evaluation of genetic tests.
4)
In constructing an analytic framework for evaluating a genetic test, analysts should consider preanalytic, analytic, and postanalytic factors; such factors are useful when assessing analytic validity.
5)
Predictive genetic tests are generally characterized by a delayed time between testing and clinically important events.
6)
Finding published information on the analytic validity of some genetic tests may be difficult. Web sites (FDA or diagnostic companies) and gray literature may be important sources.
7)
In situations where clinical factors associated with risk are well characterized, comparative effectiveness reviews should assess the added value of using genetic testing along with known factors compared with using the known factors alone.
8)
For genome-wide association studies, reviewers should determine whether the association has been validated in multiple studies to minimize both potential confounding and publication bias. In addition, reviewers should note whether appropriate adjustments for multiple comparisons were used.

REFERENCES

Teutsch SM, Bradley LA, Palomaki GE, et al. The Evaluation of Genomic Applications in Practice and Prevention (EGAPP) initiative: methods of the EGAPP Working Group. Genet Med. 2008.
Methods Guide for Effectiveness and Comparative Effectiveness Reviews. AHRQ Publication No. 10(11)-EHC063-EF. Rockville, MD: Agency for Healthcare Research and Quality; March 2011; Available at: www.effectivehealthcare.ahrq.gov. Accessed August 22, 2011.
Lister Hill National Center for Biomedical Communications: Collections of the National Library of Medicine. What are reduced penetrance and variable expressivity? [electronic resource]; 2008; Available at: http://ghr.nlm.nih.gov/handbook/inheritance/penetranceexpressivity. Accessed August 22, 2011.
Nelson HD, Huffman LH, Fu R, Harris EL. Genetic risk assessment and BRCA mutation testing for breast and ovarian cancer susceptibility: systematic evidence review for the U.S. Preventive Services Task Force. Ann Intern Med. 2005;143(5):362–79.
PubMed CAS Google Scholar
Whitlock EP, Garlitz BA, Harris EL, Beil TL, Smith PR. Screening for hereditary hemochromatosis: a systematic review for the U.S. Preventive Services Task Force. Ann Intern Med. 2006;145(3):209–23.
PubMed Google Scholar
National Office of Public Health Genomics C. ACCE Model Process for Evaluating Genetic Tests. 2007; Available at: http://www.cdc.gov/genomics/gtesting/ACCE/index.htm. Accessed August 22, 2011.
Harris RP, Helfand M, Woolf SH, et al. Current methods of the US Preventive Services Task Force: a review of the process. Am J Prev Med. 2001;20(3 Suppl):21–35.
Article PubMed CAS Google Scholar
Task Force on Genetic Testing (NIH). Promoting Safe and Effective Genetic Testing in the United States. Final Report of the Task Force on Genetic Testing. 1997; Available at: http://www.genome.gov/10001733. Accessed August 22, 2011.
Khoury MJ, Dorman JS. The Human Genome Epidemiology Network. Am J Epidemiol. 1998;148(1):1–3.
Article PubMed CAS Google Scholar
Palomaki GE, Bradley LA, Richards CS, Haddow JE. Analytic validity of cystic fibrosis testing: a preliminary estimate. Genet Med. 2003;5(1):15–20.
Article PubMed Google Scholar
Palomaki GE, Haddow JE, Bradley LA, FitzSimmons SC. Updated assessment of cystic fibrosis mutation frequencies in non-Hispanic Caucasians. Genet Med. 2002;4(2):90–4.
Article PubMed Google Scholar
Palomaki GE, Haddow JE, Bradley LA, Richards CS, Stenzel TT, Grody WW. Estimated analytic validity of HFE C282Y mutation testing in population screening: the potential value of confirmatory testing. Genet Med. 2003;5(6):440–3.
Article PubMed Google Scholar
Sun F, Bruening W, Erinoff E, Schoelles KM. Evaluation Frameworks, Analytic Validity and Quality Rating of Genetic Tests. Evidence Report/Technology Assessment (Prepared by the ECRI Institute Evidence-based Practice Center under Contract No. HHSA 290-2007-10063-I.) AHRQ Pre-publication version. Rockville, MD: Agency for Healthcare Research and Quality; April 2001.
Agency for Healthcare Research and Quality. Genetic Tests for Cancer. Technology Assessment. Rockville, MD: Agency for Healthcare Research and Quality; 2006; Available at: http://archive.ahrq.gov/clinic/ta/gentests/. Accessed August 22, 2011.
Burke W, Atkins D, Gwinn M, et al. Genetic test evaluation: information needs of clinicians, policy makers, and the public. Am J Epidemiol. 2002;156(4):311–8.
Article PubMed Google Scholar
Little J, Bradley L, Bray MS, et al. Reporting, appraising, and integrating data on genotype prevalence and gene-disease associations. Am J Epidemiol. 2002;156(4):300–10.
PubMed Google Scholar
Brockton N, Little J, Sharp L, Cotton SC. N-acetyltransferase polymorphisms and colorectal cancer: a HuGE review. Am J Epidemiol. 2000;151(9):846–61.
Article PubMed CAS Google Scholar
d'Errico A, Malats N, Vineis P, Boffetta P. Review of studies of selected metabolic polymorphisms and cancer. IARC scientific publications. 1999(148):323–93.
Yang M, Hendrie HC, Hall KS, Oluwole OS, Hodes ME, Sahota A. Improved procedure for eluting DNA from dried blood spots. Clin Chem. 1996;42(7):1115–6.
PubMed CAS Google Scholar
Gale KB, Ford AM, Repp R, et al. Backtracking leukemia to birth: identification of clonotypic gene fusion sequences in neonatal blood spots. Proceedings of the National Academy of Sciences of the United States of America. 1997;94(25):13950–4.
Article PubMed CAS Google Scholar
Walker AH, Najarian D, White DL, Jaffe JF, Kanetsky PA, Rebbeck TR. Collection of genomic DNA by buccal swabs for polymerase chain reaction-based biomarker assays. Environ Health Perspect. 1999;107(7):517–20.
Article PubMed CAS Google Scholar
Harty LC, Shields PG, Winn DM, Caporaso NE, Hayes RB. Self-collection of oral epithelial cell DNA under instruction from epidemiologic interviewers. Am J Epidemiol. 2000;151(2):199–205.
Article PubMed CAS Google Scholar
Garcia-Closas M, Egan KM, Abruzzo J, et al. Collection of genomic DNA from adults in epidemiological studies by buccal cytobrush and mouthwash. Cancer Epidemiol Biomarkers Prev. 2001;10(6):687–96.
PubMed CAS Google Scholar
Hixson JE, Vernier DT. Restriction isotyping of human apolipoprotein E by gene amplification and cleavage with HhaI. J Lipid Res. 1990;31(3):545–8.
PubMed CAS Google Scholar
Tobe VO, Taylor SL, Nickerson DA. Single-well genotyping of diallelic sequence variations by a two-color ELISA-based oligonucleotide ligation assay. Nucleic Acids Res. 1996;24(19):3728–32.
Article PubMed CAS Google Scholar
Lee LG, Connell CR, Bloch W. Allelic discrimination by nick-translation PCR with fluorogenic probes. Nucleic Acids Res. 1993;21(16):3761–6.
Article PubMed CAS Google Scholar
Bogardus ST Jr, Concato J, Feinstein AR. Clinical epidemiological quality in molecular genetic research: the need for methodological standards. JAMA. 1999;281(20):1919–26.
Article PubMed Google Scholar
Botto LD, Yang Q. 5,10-Methylenetetrahydrofolate reductase gene variants and congenital anomalies: a HuGE review. Am J Epidemiol. 2000;151(9):862–77.
Article PubMed CAS Google Scholar
Dorman JS, Bunker CH. HLA-DQ locus of the human leukocyte antigen complex and type 1 diabetes mellitus: a HuGE review. Epidemiol Rev. 2000;22(2):218–27.
Article PubMed CAS Google Scholar
Cotton SC, Sharp L, Little J, Brockton N. Glutathione S-transferase polymorphisms and colorectal cancer: a HuGE review. Am J Epidemiol. 2000;151(1):7–32.
Article PubMed CAS Google Scholar
National Cholesterol Education Program. Third Report of the National Cholesterol Education Program (NCEP) Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults (Adult Treatment Panel III) final report; 2002 Dec 17. Report No.: 1524–4539 (Electronic).
Schunkert H, Gotz A, Braund P, et al. Repeated replication and a prospective meta-analysis of the association between chromosome 9p21.3 and coronary artery disease. Circulation. 2008;117(13):1675–84.
Article PubMed Google Scholar
Gage BF, Eby C, Milligan PE, Banet GA, Duncan JR, McLeod HL. Use of pharmacogenetics and clinical factors to predict the maintenance dose of warfarin. Thromb Haemost. 2004;91(1):87–94.
PubMed CAS Google Scholar
Gage BF, Lesko LJ. Pharmacogenetics of warfarin: regulatory, scientific, and clinical issues. J Thromb Thrombolysis. 2008;25(1):45–51.
Article PubMed CAS Google Scholar
Jonas DE, McLeod HL. Genetic and clinical factors relating to warfarin dosing. Trends Pharmacol Sci. 2009;30(7):375–86.
Article PubMed CAS Google Scholar
Attia J, Ioannidis JP, Thakkinstian A, et al. How to use an article about genetic association: A: Background concepts. JAMA. 2009;301(1):74–81.
Article PubMed CAS Google Scholar
Attia J, Ioannidis JP, Thakkinstian A, et al. How to use an article about genetic association: B: Are the results of the study valid? JAMA. 2009;301(2):191–7.
Article PubMed CAS Google Scholar
Williams JA, Johnson K, Paulauskis J, Cook J. So many studies, too few subjects: establishing functional relevance of genetic polymorphisms on pharmacokinetics. J Clin Pharmacol. 2006;46(3):258–64.
Article PubMed CAS Google Scholar
Hoggart CJ, Clark TG, De Iorio M, Whittaker JC, Balding DJ. Genome-wide significance for dense SNP and resequencing data. Genetic epidemiology. 2008;32(2):179–85.
Article PubMed Google Scholar
McCarthy MI, Abecasis GR, Cardon LR, et al. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nature reviews. 2008;9(5):356–69.
Article PubMed CAS Google Scholar
Benjamini Y, Drai D, Elmer G, Kafkafi N, Golani I. Controlling the false discovery rate in behavior genetics research. Behav Brain Res. 2001;125(1–2):279–84.
Article PubMed CAS Google Scholar
Wacholder S, Chanock S, Garcia-Closas M, El Ghormli L, Rothman N. Assessing the probability that a positive report is false: an approach for molecular epidemiology studies. J Natl Cancer Inst. 2004;96(6):434–42.
Article PubMed Google Scholar
Ziegler A, Konig IR, Thompson JR. Biostatistical aspects of genome-wide association studies. Biom J. 2008;50(1):8–28.
Article PubMed Google Scholar
Caporaso N. Selection of candidate genes for population studies. In: Vineis P, Malats N, Lang M, et al, editors. Metabolic polymorphisms and susceptibility to cancer. Lyon, France: IARC Monogr Eval Carcinog Risks Hum; 1999. p. 23–36.
Matchar DB, Thakur ME, Grossman I, et al. Testing for cytochrome P450 polymorphisms in adults with non-psychotic depression treated with selective serotonin reuptake inhibitors (SSRIs). Evid Rep Technol Assess (Full Rep). 2007(146):1–77.
Seidenfeld J, Samson DJ, Rothenberg BM, Bonnell CJ, Ziegler KM, Aronson N. HER2 Testing to Manage Patients With Breast Cancer or Other Solid Tumors/Technology Assessment No. 172. Rockville, MD: (Prepared by Blue Cross and Blue Shield Association Technology Evaluation Center Evidence-based Practice Center, under Contract No. 290-02-0026.) 2008.
U.S. Preventive Services Task Force. Genetic risk assessment: recommendation statement. Genetic risk assessment and BRCA mutation testing for breast and ovarian cancer susceptibility: recommendation statement. Ann Intern Med. 2005;143(5):355–61.
Google Scholar

Download references

ACKNOWLEDGEMENTS

We would like to thank Halle R. Amick (University of North Carolina, Cecil G. Sheps Center for Health Services Research) and Crystal M. Riley (Duke-NUS Graduate Medical School Singapore) for their assistance with preparation of this manuscript, insightful editing, and outstanding attention to detail. We deeply appreciate the considerable support, commitment, and contributions of Stephanie Chang, MD, MPH, the AHRQ Task Order Officer for this project and the Evidence-based Practice Center Director.

This project was funded under contract HHSA290200710056I #1 from the Agency for Healthcare Research and Quality (AHRQ), U.S. Department of Health and Human Services.

The expressed views are the authors’ and do not necessarily represent the Agency for Healthcare Research and Quality, the U.S. Department of Health and Human Services, or the Veterans Health Administration.

Conflicts of interest

None disclosed.

Author information

Authors and Affiliations

Department of Medicine, Cecil G. Sheps Center for Health Services Research, and Institute for Pharmacogenomics and Individualized Therapy, University of North Carolina, 5034 Old Clinic Building, CB #7110, Chapel Hill, NC 27599, Chapel Hill, NC, USA
Daniel E. Jonas MD, MPH
Center for Chronic Disease Outcomes Research, Department of Veterans Affairs Health Care System, Minneapolis, MN, USA
Timothy J. Wilt MD, MPH & Brent C. Taylor PhD, MPH
Department of Medicine, University of Minnesota, Minneapolis, MN, USA
Timothy J. Wilt MD, MPH & Brent C. Taylor PhD, MPH
Cecil G. Sheps Center for Health Services Research, University of North Carolina, Chapel Hill, NC, USA
Tania M. Wilkins MS
Duke-NUS Graduate Medical School Singapore, Health Services and Systems Research, Singapore, Singapore
David B. Matchar MD
Department of Medicine, Duke University Medical Center, Durham, NC, USA
David B. Matchar MD

Authors

Daniel E. Jonas MD, MPH
View author publications
You can also search for this author in PubMed Google Scholar
Timothy J. Wilt MD, MPH
View author publications
You can also search for this author in PubMed Google Scholar
Brent C. Taylor PhD, MPH
View author publications
You can also search for this author in PubMed Google Scholar
Tania M. Wilkins MS
View author publications
You can also search for this author in PubMed Google Scholar
David B. Matchar MD
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel E. Jonas MD, MPH.

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License ( https://creativecommons.org/licenses/by-nc/2.0 ), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Jonas, D.E., Wilt, T.J., Taylor, B.C. et al. Chapter 11: Challenges in and Principles for Conducting Systematic Reviews of Genetic Tests used as Predictive Indicators. J GEN INTERN MED 27 (Suppl 1), 83–93 (2012). https://doi.org/10.1007/s11606-011-1898-z

Download citation

Published: 31 May 2012
Issue Date: June 2012
DOI: https://doi.org/10.1007/s11606-011-1898-z

KEY WORDS

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Chapter 11: Challenges in and Principles for Conducting Systematic Reviews of Genetic Tests used as Predictive Indicators

Abstract