Defining Clinical Trial Estimands: A Practical Guide for Study Teams with Examples Based on a Psychiatric Disorder

Polverejan, Elena; O’Kelly, Michael; Hefting, Nanco; Norton, Jonathan D.; Lim, Pilar; Walton, Marc K.

doi:10.1007/s43441-023-00524-2

Defining Clinical Trial Estimands: A Practical Guide for Study Teams with Examples Based on a Psychiatric Disorder

Analytical Report
Open access
Published: 27 May 2023

Volume 57, pages 911–939, (2023)
Cite this article

Download PDF

You have full access to this open access article

Therapeutic Innovation & Regulatory Science Aims and scope Submit manuscript

Defining Clinical Trial Estimands: A Practical Guide for Study Teams with Examples Based on a Psychiatric Disorder

Download PDF

Elena Polverejan ORCID: orcid.org/0000-0002-2813-4364¹,
Michael O’Kelly²,
Nanco Hefting³,
Jonathan D. Norton⁴,
Pilar Lim¹ &
…
Marc K. Walton⁵

4621 Accesses
1 Altmetric
Explore all metrics

Abstract

While the ICH E9(R1) Addendum on “Estimands and Sensitivity Analysis in Clinical Trials” was released in late 2019, the widespread implementation of defining and reporting estimands across clinical trials is still in progress and the engagement of non-statistical functions in this process is also in progress. Case studies are sought after, especially those with documented clinical and regulatory feedback. This paper describes an interdisciplinary process for implementing the estimand framework, devised by the Estimands and Missing Data Working Group (a group with clinical, statistical, and regulatory representation) of the International Society for CNS Clinical Trials and Methodology. This process is illustrated by specific examples using various types of hypothetical trials evaluating a treatment for major depressive disorder. Each of the estimand examples follows the same template and features all steps of the proposed process, including identifying the trial stakeholder(s), the decisions they need to make about the investigated treatment in their specific role and the questions that would support their decision making. Each of the five strategies for handling intercurrent events are addressed in at least one example; the featured endpoints are also diverse, including continuous, binary and time to event. Several examples are presented that include specifications for a potential trial design, key trial implementation elements needed to address the estimand, and main and sensitivity estimator specifications. Ultimately this paper highlights the need to incorporate multi-disciplinary collaborations into implementing the ICH E9(R1) framework.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Clinical trials were traditionally planned as follows: a general trial objective was stated, then the trial design, analysis sets, and statistical methods determined how the treatment effect was estimated. This approach was not optimal, because the definition of what was being estimated by the trial was either not stated clearly or not stated at all.

The ICH E9(R1) Addendum [1] on “Estimands and Sensitivity Analysis in Clinical Trials”, released in 2019, (hereafter referred to as “the Addendum”) recommends a change in the process of planning, design, conduct and reporting of clinical trials. The Addendum emphasizes that to properly inform decision-making by various stakeholders and to provide clear descriptions of benefits and risks of a treatment, it is important to have precise descriptions of the treatment effects of interest reflecting clinical questions posed by trial objectives (i.e., the estimands) that are clearly understood and relevant to support the decision(s) to be made by the stakeholders. Estimands must be documented in the protocol; trial design and all aspects of trial conduct and the planned analyses flow from their specification. As pragmatic considerations may impinge on the feasibility of estimating a specified estimand, this process will, in practice, be iterative.

The Estimands and Missing Data Working Group of the International Society for CNS Clinical Trials and Methodology (ISCTM Estimand WG) includes members representing both clinical and statistical functions, with both trial and regulatory experience. This working group had the objective to develop an interdisciplinary process for implementing the estimand framework in the planning stage of a clinical trial. The current paper describes such a process, illustrated by specific examples using hypothetical trials evaluating a treatment for major depressive disorder (MDD). The description of this process and the examples are intended to be a practical aid to clinical trial teams in applying the recommendations of the Addendum to clinical trials across many disease areas.

Section "Process for Selecting and Constructing Estimands" of this paper describes the recommended process for selecting and constructing estimands and highlights key points regarding the estimand attributes. Section "Process for Selecting an Estimator Aligned with an Estimand" describes the process of selecting an estimator aligned with an estimand. Section "Estimand Examples for Major Depressive Disorder" presents multiple examples of estimands for MDD, some with examples of aligned estimators. Section "Discussion" includes discussion points and further thoughts on this topic.

Process for Selecting and Constructing Estimands

As noted in the Addendum, the purpose of a study is to support decision-making by one or more stakeholders who will use the study results. The precise question(s) each stakeholder needs to answer to support their decision-making can be different, and thus different estimands could be defined for each stakeholder identified for a trial.

The ISCTM Estimand WG recommends the following steps in applying the estimand framework:

Identify stakeholder(s)
State decision(s) to be made by each stakeholder
Define objective(s)
Under each objective supporting main decision making:
- Formulate the clinical question of interest:
  - Consider the clinical context
  - Consider potential intercurrent events (ICEs) and how they relate to the question
- Define the corresponding estimand
- Justify the utility of the selected question and corresponding estimand to the specific stakeholder(s).

This process may, in practice, be iterative. If an estimand is determined not to be estimable, a relevant alternative question of interest that is aligned with the selected objective should be sought.

Identify Stakeholder(s) and Decision(s) to be Made

There are often a variety of stakeholders who will make decisions based on the results of a clinical trial. Health authority agencies (HAAs, such as FDA, EMA, Health Canada, PMDA etc.) might for example need to decide whether a study contributes substantial evidence of short-term efficacy for a new treatment or that a new treatment is effective as maintenance treatment after an initial short-term response. A company developing a new drug might for example need to determine whether a study provides enough evidence of efficacy to decide on continuing its development. Payers might need to determine whether a study contributes substantial evidence of clinically meaningful patient-level benefit for a new drug or whether the decision to prescribe a new drug is more clinically effective over a long-term period than the decision to prescribe another well-established drug. Eventually payers make decisions on whether to include a drug in a formulary, and what level of payment to provide in relation to available products. Physicians and patients will need enough information to enable their individual decision-making on starting a treatment. This might include answering the questions: what benefit can be expected in patients who could adhere to treatment? How likely is it that the treatment would be adhered to?

Estimand examples in "Estimand Examples for Major Depressive Disorder" section highlight the variety of stakeholders for a study and the decisions they need to make. While these examples highlight decisions on the efficacy of a new treatment, such decisions are complemented in practice by those based on safety and risk–benefit evaluations.

Define an Objective(s)

Each objective should support the stakeholder’s decision making. For example, if the decision for a HAA is to determine if the study contributes substantial evidence of efficacy for a new monotherapy drug for MDD, the following objective supports this decision (see Estimand 1 example in "Estimand Examples for Major Depressive Disorder" section): To assess the superiority of new drug versus placebo in short-term symptom reduction when given as monotherapy treatment in MDD patients. The statistical hypotheses for an endpoint (e.g., superiority or non-inferiority) or the statistical decision rules (e.g., Go/No Go decision rules) relate to the chosen objectives. A trial objective should mention both the treatment conditions that are being compared and the target population for treatment, both being attributes of an estimand (as discussed in "Define the Estimand" section).

Multiple objectives typically inform each stakeholder’s decision making. Protocol templates [2, 3] require that the included objectives reference all endpoints selected for the trial. These objectives are usually prioritized for the trial as primary, key secondary, other secondary or exploratory to distinguish those used for main decisions (primary and key secondary), and those that have supportive or other roles. This distinction is especially important in the regulatory setting. Of note, it is possible for multiple objectives to reference the same endpoint (e.g., for different target populations).

Formulate the Clinical Question of Interest, Define the Corresponding Estimand, and Justify Their Utility to the Stakeholder

As mentioned above, an objective is a general statement of what supports a stakeholder’s decision. The clinical question of interest is a meaningful and concise definition of the treatment effect, best formulated using natural, non-technical language for easy comprehension; it is paired with a formal, detailed definition of the corresponding estimand. They must be relevant to the stakeholder and have their utility justified. All the estimand examples from "Estimand Examples for Major Depressive Disorder" section include these three components.

Formulate the Clinical Question of Interest

The formulation of the clinical question of interest must consider the clinical context of use. This involves consideration of:

Target population (including typical comorbidities and behaviors)
Treatment and comparators pertinent to that context and population (including the availability and effectiveness of alternative treatments in the target population)
Outcome of interest, reflecting the qualitative aspect of the treatment effect (e.g., achieving or avoiding a certain discrete outcome such as treatment success or failure, time to an outcome, change in a continuous score) as well as its temporal aspect (e.g., effect at a fixed time point, over a fixed period, at a variable point in time, over a variable period).

When these have been carefully specified, potential intercurrent events (ICEs) can be considered. ICEs [1] are defined as events occurring after treatment initiation that affect either the interpretation or the existence of the measurements associated with the clinical question of interest (e.g. treatment discontinuation, starting alternative treatments, death; see Sect. Identify ICEs). Once the ICEs pertinent to the clinical context are identified, a study team can formulate a precise clinical question of interest, for example “For a patient with MDD, what would be the expected effect of prescribing drug X on depression severity at Week 8, were no other antidepressant medications available?” While this target treatment effect will be formalized in the estimand definition, formulating the clinical question of interest is an important step as it allows a cross-disciplinary discussion in the study team.

The clinical question of interest formulation needs to capture a clear, specific treatment effect of interest relative to each group of identified ICEs. When the estimand is defined (see Sect. "Define the Estimand"), estimand attributes including the strategies selected for the identified ICEs (see Sect. ICE-Handling Strategies, Table 1) will be linked to the clinical question of interest. Examples of types of clinical question of interest formulations (implying different ICE strategies) are presented below:

Treatment effect under the assignment to either experimental treatment or placebo, regardless of ICE—Treatment policy strategy
Treatment effect under a counterfactual scenario (e.g., as if patients would continue treatment as assigned or as if patients would not start other pharmacological treatments for MDD as they were not available)—Hypothetical strategy
Treatment effect on the likelihood of a patient experiencing a treatment response, where the response definition incorporates the ICE (e.g., patient with ICE is considered as non-responder)—Composite Variable strategy
Treatment effect while treatment is being taken—While on treatment strategy
Treatment effect in a stratum of patients who would/would not experience the ICE (e.g., in MDD patients who would adhere to drug X as prescribed for Y weeks)—Principal Stratum strategy.

The examples above are not exhaustive; other language and formulations that link to different ICE strategies could also be used in the question of interest.

The question should be formulated concisely as possible to serve as a guide for the specification of the estimand. Therefore, when formulating the clinical question of interest, some attributes of the corresponding estimand need not be detailed (e.g., exact endpoint, such as the method/scale of capturing depression severity, or exact population-level summary) or may be implied by the description of the effect (e.g., “expected effect” may imply that the population-level summary will be a difference of means).

Define the Estimand

The estimand is a formal, operationalized expression of the clinical question of interest, constructed with the following attributes (see Section A.3.3 of the Addendum):

Treatment condition of interest and Alternative treatment condition The interventions being compared. Here, not only the experimental treatment (versus control, if applicable) should be specified but the planned treatment regimen as a whole, including (if applicable) the recommended use of additional or background treatment and/or the strategies for handling ICEs related to the treatment regimen.
Population The population targeted by the clinical question of interest. (It can also reflect a population defined by membership in a principal stratum—see Table 1 for definition of the Principal Stratum strategy). This differs from the analysis set (e.g., all randomized participants), referred to in the past as the analysis population, which should be described under the estimator specifications.
Variable (or endpoint) A value that can be measured in individual patients that is required to address the clinical question, e.g., change from baseline to time X in a measure, time to an event, a binary responder variable. It cannot be a proportion, for example, as this cannot be measured per patient. It can take into account ICEs if the Composite Variable strategy is used, or it can reflect the patient-dependent treatment duration if the While on Treatment strategy is used.
Population-level summary The population-level quantity (derived from the patient-level Variable) that provides a basis for comparisons between treatment conditions and quantifies the treatment effect.
ICEs and corresponding strategies Here, strictly speaking, only the ICEs not covered in the other attributes should be specified together with the strategies used to handle them. However, to improve clarity in this implementation phase, we prefer to list all ICEs and corresponding strategies, including those reflected in other estimand attributes. Patients could experience overlapping ICEs and, if these ICEs are addressed with different strategies, the priority order of applying these strategies must be specified. This will depend on the clinical context; for example, the composite variable strategy will most likely have a higher priority over strategies such as treatment policy or hypothetical (see Sect. ICE-Handling Strategies).

The Addendum recommends at a minimum that estimands for all trial objectives that are likely to support regulatory decisions (such as those related to primary and key secondary endpoints) be defined and specified explicitly. If the trial is to serve multiple stakeholders with different questions of interest, estimands for each stakeholder should be formulated in the protocol or in other prospectively written associated documents. A particular estimand might be of interest to multiple stakeholders, as reflected in some of the estimand examples from "Estimand Examples for Major Depressive Disorder" section.

The following sub-sections provide additional details on the identification of ICEs and on the types of available strategies for addressing ICEs.

Identify ICEs

All foreseeable ICEs that are likely to be relevant for a trial are to be identified when planning the trial (see Section A.3.1. of the Addendum). The applicable ICEs depend on the specific setting of the trial, but the following is a list of ICEs that are often encountered based on authors’ experience:

ICEs related to the study treatment:
- Treatment discontinuation (Tx DC)
- Change in planned dosage or frequency of administration
- Treatment non-adherence (i.e., intermittent or partial adherence)
ICEs related to initiation, adjustment or discontinuation of treatments that are concomitantly taken with the study treatment and may influence the outcome of interest
Changes in how the outcome of interest is measured (e.g., use of uncertified rater or scale, switching to remote assessment)
ICEs precluding the existence of values after the event, such as death.

Events could also occur that impact the validity or interpretability of the outcome measurement tool. For example, a cerebrovascular accident could reduce the reliability of assessment of psychomotor impairments attributable to a major depressive episode.

Disease specific regulatory guidance documents for Industry have started to recommend ICEs of interest and strategies to address them, such as the FDA guidance [4] for Chronic Rhinosinusitis with Nasal Polyps or the EMA Guideline [5] on the clinical investigation of medicines for the treatment of Alzheimer’s disease.

On rare occasions a major unforeseen source of ICEs may occur. For example, at the time of writing, clinical trials are being impacted by the COVID-19 pandemic and by the war in Ukraine, resulting in disruption to the provision of drugs, changes to methods of assessment, but also affecting the health of the study subjects, and leading to changes in circumstances (individual or societal) affecting the relationship between disease severity and impairment of function or the reliability or validity of measures designed for use under normal social conditions. In these situations, protocols and other study documents such as Statistical Analysis Plans (SAPs) must be amended to address these unforeseen, major, broadly occurring ICEs [6,7,8,9].

Each type of ICE could be considered as a unified event or could be further divided into sub-categories. For example, Tx DC due to different reasons (e.g., due to adverse events, lack of efficacy, or other reasons, such as site closures or other administrative reasons) could be considered as one or as different ICEs depending on reason for Tx DC; likewise different severities of the same event such as low/moderate versus severe treatment non-adherence could be considered separately. Different strategies could then be used if these different events are addressed differently in the clinical question of interest.

ICEs are not synonymous with missing data. Indeed, it is usually desirable to collect data after ICEs, and there are data that are missing without (known) occurrence of ICEs. Study withdrawal is not considered by the Addendum as an ICE. Rather, it is a study event leading to missing data (i.e., data that would be meaningful for the analysis of a given estimand but were not collected). Some ICEs might be immediately followed by missing data (which could also be intermittent), while others not. The ICE of death cannot lead to missing data as no measurements exist and can be collected after death.

ICE-Handling Strategies

ICEs can be addressed by several potential strategies that are described in Section A.3.2. of the Addendum. Table 1 describes each of the five strategies, points to consider on the use of each strategy, and additional considerations on estimation (see Sect. Process for Selecting an Estimator Aligned with an Estimand on the process for selecting an estimator aligned with an estimand). The formulation of the clinical question of interest should drive the selection of strategies addressing the identified ICEs. This requires a collaborative effort across disciplines and is not an exercise for statisticians only.

Table 1 ICH E9(R1) strategies of addressing an intercurrent event

Full size table

Process for Selecting an Estimator Aligned with an Estimand

For each of the estimands, an aligned method of analysis, or estimator [1], should be implemented that is able to provide an estimate on which reliable interpretation can be based.

Once an estimand is defined and the aligned estimator is selected with the chosen assumptions, the following elements are recommended to be included in the estimator specification:

Define the estimand and estimator aligned analysis set, specifying not only what trial participants are included (e.g., all randomized) but the selection of measurements to be used for each participant.

Here, specify what data are not used or missing or sometimes not existing, including:

• Data not used—Data that may be collected but are not used for the estimator chosen for this estimand, for example the endpoint values collected after an ICE and replaced by imputation;

• Missing data—Data that would have been useful but could not be collected (e.g., due to withdrawal from the study or intermittent missing)—considered the “true” missing data by the Addendum;

• Data not existing—such as data after death or, for Principal Stratum estimators, data on the occurrence of ICEs had the patient been assigned to other treatment instead.

Specify the main estimator for this estimand, including:

• Assumptions for data not used and missing data; these assumptions, whether the data is treated as missing due to an ICE or simply missing because not collected, inform the scenarios analyzed by the statistical model, and may for example lead to censoring, imputation or generation of a composite outcome.

• Statistical model and its assumptions (e.g. proportional hazard assumption for Cox regression).

Specify the sensitivity estimator(s) for this estimand, ensuring that the same estimand is targeted and stating how elements and assumptions differ from those of the main estimator.

Extensive details on selecting estimators aligned with an estimand are provided in Mallinckrodt et al. [20]. Of note, as this is a rapidly evolving field, it is likely that any recommendations beyond those of principle could be superseded. Mitroiu et al. [21] provided a summary of what analysis methods have been commonly used in short-term depression studies, mapping estimands to these methods.

The main estimator produces an estimate for the estimand population-level summary, a clinically understandable estimate of the amount of clinical benefit (or risk, for a safety variable) that was associated with the treatment. This is often loosely referred to as the ‘study result’. As mentioned in Section "Define an Objective(s)", an objective often includes the statistical hypotheses for an endpoint (e.g., superiority or non-inferiority) or the statistical decision rules. Ideally, the analysis used for decision making should be same as the main estimator or at least with similar assumptions. However, it is possible for the analysis used for decision making to be different than the main estimator, especially for the binary and time to event endpoints. As an example, the population-level summary of hazard ratio for a time to event endpoint can estimate the amount of benefit and be derived from the Cox proportional hazard model and the decision-making of superiority can be based on the p-value from the log-rank test. Further research [22,23,24] is currently being done on constructing time to event methods that could be used for both the main estimator and decision-making.

Section "Estimand Examples for Major Depressive Disorder" includes several examples of estimator specifications.

Estimand Examples for Major Depressive Disorder

The ISCTM Estimand WG chose MDD to exemplify the process to select and construct estimand, knowing that:

It is highly prevalent [25, 26] and extensively studied, with widely accepted endpoints.
Nevertheless, it is a complex indication to pursue, with many challenges, including high treatment dropout rates.
Many issues encountered in defining estimands in clinical trials of treatment for MDD can be generalized and applied to clinical trials in many other disease areas. These issues include a relatively high number of discontinuations from treatment, (partial) compliance, and starting other pharmacological treatments for MDD that could influence the trial outcomes.

MDD is defined in the Diagnostic and Statistical Manual of Mental Disorders, 5th Edition (DSM-5-TR) [27], by the occurrence of one or more major depressive episodes. Such episodes must be of at least 2 weeks duration, with at least five of nine specified symptoms co-occurring during that period, not attributable to other causes, and leading to impairment of function compared to a state prior to symptom onset. These episodes comprise a primary symptom of subjective or observed persistence and prevalence of either (1) depressed mood (i.e., sad, empty, or hopeless) or (2) markedly diminished interest or pleasure in almost all activities, and additional potential symptoms of (3) spontaneous loss of appetite or weight, (4) insomnia or hypersomnia, (5) fatigue, (6) observable psychomotor retardation or agitation, (7) impairment in ability to think, concentrate, or make decisions, (8) inappropriate feelings of worthlessness or guilt, and (9) recurrent thoughts of death, particularly suicide.

The symptomatic presentations and durations of episodes, and presence, frequency, and patterns of recurrence, as well as level of subsyndromal inter-episodic symptoms are all highly variable both between and within individuals. Thus, pertinent features of MDD as a clinical entity that may impact the choice of estimand in a clinical trial are:

No single common pathophysiology—samples may comprise pathophysiologic subpopulations that inform patient strata.
Episodes may be characterized by multiple symptom dimensions [28]—outcome measures must be appropriately responsive to differential treatment effects on symptom dimensions.
Typical symptoms may differ depending on patient age (e.g., more negative valence system symptoms in younger adults, more prominent positive valence system deficits in older adults) [28]—such differences may inform selection of outcome measures and characterization of patient strata.
Episodes can have gradual or abrupt onset and offset and duration ranges widely from a defined minimum of 2 weeks, to over a year [29]—consideration of such features is important for time-based elements of study endpoints.
Episode duration may also differ depending on patient age [30].
Episode recurrence rates are variable [29]—consideration of such features is important for time-based elements of study endpoints and relevant ICEs.

For the evaluation of monotherapy treatment, short-term, placebo-controlled trials with or without an active reference arm are the usual standard. The short-term, acute treatment trials are typically followed by long-term, randomized withdrawal trials. Drugs may also be developed to be used as adjunctive treatments to existing antidepressant therapy. The MDD estimand examples in this section are presented in the following type of context:

Short-term monotherapy MDD treatment
Maintenance monotherapy MDD treatment
Short-term adjunctive MDD treatment
Maintenance adjunctive treatment in patients with treatment resistant MDD (TRD).

The MDD examples included in this section follow the estimand framework steps recommended in Section "Process for Selecting and Constructing Estimands". Some of the examples include specifications for a potential trial design, key trial implementation elements needed to address the estimand, and main and sensitivity estimator specifications that include the elements recommended in Sect. Process for Selecting an Estimator Aligned with an Estimand. It is important to emphasize that the presented estimand and estimator examples are not to be taken as guidance; estimand attributes could be described differently and some of the included elements are subject to further research, especially in the field of aligning estimand and estimators. Each of the five strategies for handling ICEs is addressed in at least one example; all examples are considered to be applicable to MDD, based on the authors’ experience.

Estimands 7a and 7b:

The following estimand examples from the context of maintenance add-on/adjunctive treatment in MDD were inspired by the LQD study description from Marwood et al. [39]. They do not reflect exactly this trial original objectives and are provided as an example of estimands that complement each other. As a different example from same context, an estimand that could be aligned with the randomized withdrawal trial presented in Brunner et al. [40] could have common elements with Estimand 5 so it has not been used as an additional example for this manuscript.

Estimands 7a and 7b, defined in the following, could either be considered co-primary estimands (if the objective is to show superiority on both) or one could be considered primary and the other supplementary.

Discussion

This paper describes an interdisciplinary process for implementing the estimand framework proposed by the ISCTM Estimand WG, a group that represents both clinical and statistical functions. Building on Bell et al. [41] and Ratitch et al. [42, 43], we expand the “thinking process” outlined in the ICH E9(R1) official training material [44] by considering the trial stakeholder(s), the decisions they need to make and the questions that would support their decision making. Study teams are encouraged to justify how answering the proposed questions of interest would support stakeholder decision-making.

The thinking process proposed is reflected in multiple examples using hypothetical trials evaluating a treatment for MDD. While this process is relevant to any therapeutic setting, all examples have been chosen to be applicable to this disease state, based on the authors’ experience.

While multiple estimand examples have been included for a given context, such as short-term monotherapy treatment in MDD, each example followed the recommended process, with clarity on the stakeholder, the decision to be made and the corresponding objective and question of interest. This is different from the previous practice (that the Addendum aims to curtail) of running multiple “sensitivity analyses”, without thought to what they estimate and their usefulness and purpose. With regard to sensitivity analyses, the Addendum recommends instead a structured approach to stress-test the assumption of the main estimator. This has been reflected in the sensitivity analyses exemplified in this paper.

In this paper we focus on the process of defining the estimand itself and do not directly address in detail the implications for the study procedures. However, the defined estimands will be reflected in the design of a study, from consent form through duration and level of follow-up to final analysis. For example, we note that selecting the estimand will lead the study team to consider logistical elements of study including.

the burden of the study for participants (the duration of follow-up, the number of visits, complexity of data collection)
whether to continue follow-up after an ICE (e.g., possibility of subjects remaining in the study after ICEs such as discontinuation of study treatment)
flexibility to collect some but not all protocol assessments after treatment discontinuation or other ICE

Ultimately this paper highlights the need to incorporate multi-disciplinary collaborations into implementing the ICH E9(R1) framework and provides extensive examples on how this can be accomplished. The process described includes the element of estimand justification to foster alignment within study teams, to ensure that trials will provide answers to the most relevant clinical questions for key trial stakeholders.

References

ICH E9 (R1) addendum on estimands and sensitivity analysis in clinical trials to the guideline on statistical principles for clinical trials. International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use (ICH). Updated Nov 20 2019. https://database.ich.org/sites/default/files/E9-R1_Step4_Guideline_2019_1203.pdf. Accessed Sept 7 2022
Protocol template for phase 2 and 3 clinical trials that require FDA-IND or IDE application. National Institutes of Health (NIH). Updated Apr 7 2017. https://grants.nih.gov/policy/clinical-trials/protocol-template.htm. Accessed Sept 7 2022
Common Protocol Template (CPT). TransCelerate BioPharma INC, Clinical Content & Reuse Solutions. Updated 2021. https://www.transceleratebiopharmainc.com/assets/clinical-content-reuse-solutions/. Accessed Sept 7 2022
Chronic Rhinosinusitis with Nasal Polyps: Developing Drugs for Treatment, Guidance for Industry. U.S. Food and Drug Administration (FDA). Updated Dec 16 2021. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/chronic-rhinosinusitis-nasal-polyps-developing-drugs-treatment. Accessed Feb 27 2023
Guideline on the clinical investigation of medicines for the treatment of Alzheimer’s disease. European Medicines Agency (EMA). Updated Sept 1 2018. https://www.ema.europa.eu/en/documents/scientific-guideline/guideline-clinical-investigation-medicines-treatment-alzheimers-disease-revision-2_en.pdf. Accessed Mar 7 2023
Guidance to Sponsors on How to Manage Clinical Trials During the COVID-19 Pandemic. European Medicines Agency Committee for Medicinal Products for Human Use (EMA/CHMP). Updated Mar 20 2020. https://www.ema.europa.eu/en/documents/press-release/guidance-sponsors-how-manage-clinical-trials-during-covid-19-pandemic_en.pdf. Accessed Sept 7 2022
Points to Consider on Implications of Coronavirus Disease (COVID-19) on Methodological Aspects of Ongoing Clinical Trials. European Medicines Agency Committee for Medicinal Products for Human Use (EMA/CHMP). Updated Jun 26 2020. https://www.ema.europa.eu/en/documents/scientific-guideline/points-consider-implications-coronavirus-disease-covid-19-methodological-aspects-ongoing-clinical_en-0.pdf. Accessed Sept 7 2022
Guidance on Conduct of Clinical Trials of Medical Products During COVID-19 Public Health Emergency. U.S. Food and Drug Administration (FDA). Updated Aug 30 2021. https://www.fda.gov/media/136238/download. Accessed Sept 7 2022
Points to consider on the impact of the war in Ukraine on methodological aspects of ongoing clinical trials. European Medicines Agency Committee for Medicinal Products for Human Use (EMA/CHMP). Updated Apr 13 2022. https://www.ema.europa.eu/en/documents/scientific-guideline/points-consider-impact-war-ukraine-methodological-aspects-ongoing-clinical-trials_en.pdf. Accessed Sept 7 2022
Fletcher C, Hefting N, Wright M, et al. Marking 2-years of new thinking in clinical trials: the estimand journey. Therap Innov Regul Sci. 2022;56(4):637–50. https://doi.org/10.1007/s43441-022-00402-3.
Article CAS Google Scholar
Guizzaro L, Pétavy F, Ristl R, Gallo C. The use of a variable representing compliance improves accuracy of estimation of the effect of treatment allocation regardless of discontinuation in trials with incomplete follow-up. Stat Biopharm Res. 2021;13(1):119–27. https://doi.org/10.1080/19466315.2020.1736141.
Article Google Scholar
Polverejan E, Dragalin V. Aligning treatment policy estimands and estimators—a simulation study in Alzheimer’s disease. Stat Biopharm Res. 2020;12(2):142–54. https://doi.org/10.1080/19466315.2019.1689845.
Article Google Scholar
Lasch F, Guizzaro L, Pétavy F, Gallo C. A simulation study on the estimation of the effect in the hypothetical scenario of no use of symptomatic treatment in trials for disease-modifying agents for Alzheimer’s disease. Stat Biopharm Res. 2022. https://doi.org/10.1080/19466315.2022.2055633.
Article Google Scholar
Olarte Parra C, Daniel RM, Bartlett JW. Hypothetical estimands in clinical trials: a unification of causal inference and missing data methods. Stat Biopharm Res. 2022. https://doi.org/10.1080/19466315.2022.2081599.
Article PubMed PubMed Central Google Scholar
Meininger V, Genge A, van den Berg LH, et al. Safety and efficacy of ozanezumab in patients with amyotrophic lateral sclerosis: a randomised, double-blind, placebo-controlled, phase 2 trial. Lancet Neurol. 2017;16(3):208–16. https://doi.org/10.1016/S1474-4422(16)30399-4.
Article CAS PubMed Google Scholar
Darken P, Nyberg J, Ballal S, Wright D. The attributable estimand: a new approach to account for intercurrent events. Pharm Stat. 2020;19(5):626–35. https://doi.org/10.1002/pst.2019.
Article PubMed Google Scholar
Ratitch B, O’Kelly M, Tosiello R. Missing data in clinical trials: from clinical assumptions to statistical analysis using pattern mixture models. Pharm Stat. 2013;12(6):337–47. https://doi.org/10.1002/pst.1549.
Article PubMed Google Scholar
Little R, Kang S. Intention-to-treat analysis with treatment discontinuation and missing data in clinical trials. Stat Med. 2015;34(16):2381–90. https://doi.org/10.1002/sim.6352.
Article PubMed Google Scholar
Akacha M, Bretz F, Ruberg S. Estimands in clinical trials—broadening the perspective. Stat Med. 2017;36(1):5–19. https://doi.org/10.1002/sim.7033.
Article PubMed Google Scholar
Mallinckrodt CH, Bell J, Liu G, et al. Aligning estimators with estimands in clinical trials: putting the ICH E9(R1) guidelines into practice. Ther Innov Regul Sci. 2020;54(2):353–64. https://doi.org/10.1007/s43441-019-00063-9.
Article CAS PubMed Google Scholar
Mitroiu M, Teerenstra S, Oude Rengerink K, Pétavy F, Roes KCB. Estimation of treatment effects in short-term depression studies. An evaluation based on the ICH E9(R1) estimands framework. Pharm Stat. 2022.https://doi.org/10.1002/pst.2214
Mehrotra DV, Marceau WR. Survival analysis using a 5-step stratified testing and amalgamation routine (5-STAR) in randomized clinical trials. Stat Med. 2021;40(19):4341–3. https://doi.org/10.1002/sim.9116.
Article PubMed Google Scholar
Royston P, Parmar MKB. Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome. BMC Med Res Methodol. 2013;13(1):152. https://doi.org/10.1186/1471-2288-13-152.
Article PubMed PubMed Central Google Scholar
Uno H, Claggett B, Tian L, et al. Moving beyond the hazard ratio in quantifying the between-group difference in survival analysis. J Clin Oncol. 2014;32(22):2380–5. https://doi.org/10.1200/JCO.2014.55.2208.
Article PubMed PubMed Central Google Scholar
Major Depressive Disorder: Developing Drugs for Treatment, Guidance for Industry. U.S. Food and Drug Administration (FDA). Updated June 2018. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/major-depressive-disorder-developing-drugs-treatment. Accessed Mar 7 2023
Kessler RC, Bromet EJ. The epidemiology of depression across cultures. Annu Rev Public Health. 2013;34:119–38. https://doi.org/10.1146/annurev-publhealth-031912-114409.
Article PubMed PubMed Central Google Scholar
Diagnostic and Statistical Manual of Mental Disorders (DSM-5-TR). Updated 2022. https://psychiatry.org/psychiatrists/practice/dsm. Accessed Sept 7 2022
Medeiros GC, Rush AJ, Jha M, et al. Positive and negative valence systems in major depression have distinct clinical features, response to antidepressants, and relationships with immunomarkers. Depress Anxiety. 2020;37(8):771–83. https://doi.org/10.1002/da.23006.
Article CAS PubMed PubMed Central Google Scholar
Ten Have M, de Graaf R, van Dorsselaer S, Tuithof M, Kleinjan M, Penninx B. Recurrence and chronicity of major depressive disorder and their risk indicators in a population cohort. Acta Psychiatr Scand. 2018;137(6):503–15. https://doi.org/10.1111/acps.12874.
Article PubMed Google Scholar
Parker G, Roy K, Hadzi-Pavlovic D, Wilhelm K, Mitchell P. The differential impact of age on the phenomenology of melancholia. Psychol Med. 2001;31(7):1231–6. https://doi.org/10.1017/s0033291701004603.
Article CAS PubMed Google Scholar
Hamilton M. Development of a rating scale for primary depressive illness. Br J Soc Clin Psychol. 1967;6(4):278–96. https://doi.org/10.1111/j.2044-8260.1967.tb00530.x.
Article CAS PubMed Google Scholar
O’Kelly M, Ratitich B. Clinical trials with missing data. New York: Wiley; 2014.
Book Google Scholar
Bunouf P, Grouin JM, Molenberghs G. Analysis of an incomplete binary outcome derived from frequently recorded longitudinal continuous data: application to daily pain evaluation. Stat Med. 2012;31(15):1554–71. https://doi.org/10.1002/sim.4491.
Article CAS PubMed Google Scholar
Estimating Principal Strata. Drug Information Association Scientific Working Group on Estimands and Missing Data. Updated Sept 2 2021. https://www.lshtm.ac.uk/research/centres-projects-groups/missing-data#dia-working-group. Accessed Sept 7 2022
Bornkamp B, Rufibach K, Lin J, et al. Principal stratum strategy: potential role in drug development. Pharm Stat. 2021;20(4):737–51. https://doi.org/10.1002/pst.2104.
Article PubMed Google Scholar
Lipkovich I, Ratitch B, Qu Y, Zhang X, Shan M, Mallinckrodt C. Using principal stratification in analysis of clinical trials. Stat Med. 2022;41(19):3837–77. https://doi.org/10.1002/sim.9439.
Article PubMed Google Scholar
Lipkovich I, Ratitch B, O’Kelly M. Sensitivity to censored-at-random assumption in the analysis of time-to-event endpoints. Pharm Stat. 2016;15(3):216–29. https://doi.org/10.1002/pst.1738.
Article PubMed Google Scholar
Boyd AP, Kittelson JM, Gillen DL. Estimation of treatment effect under non-proportional hazards and conditionally independent censoring. Stat Med. 2012;31(28):3504–15. https://doi.org/10.1002/sim.5440.
Article PubMed Google Scholar
Marwood L, Taylor R, Goldsmith K, et al. Study protocol for a randomised pragmatic trial comparing the clinical and cost effectiveness of lithium and quetiapine augmentation in treatment resistant depression (the LQD study). BMC Psychiatry. 2017;17(1):231. https://doi.org/10.1186/s12888-017-1393-0.
Article CAS PubMed PubMed Central Google Scholar
Brunner E, Tohen M, Osuntokun O, Landry J, Thase ME. Efficacy and safety of olanzapine/fluoxetine combination vs fluoxetine monotherapy following successful combination therapy of treatment-resistant major depressive disorder. Neuropsychopharmacology. 2014;39(11):2549–59. https://doi.org/10.1038/npp.2014.101.
Article CAS PubMed PubMed Central Google Scholar
Bell J, Hamilton A, Sailer O, Voss F. The detailed clinical objectives approach to designing clinical trials and choosing estimands. Pharm Stat. 2021;20(6):1112–24. https://doi.org/10.1002/pst.2129.
Article PubMed Google Scholar
Ratitch B, Bell J, Mallinckrodt C, et al. Choosing estimands in clinical trials: putting the ICH E9(R1) into practice. Ther Innov Regul Sci. 2020;54(2):324–41. https://doi.org/10.1007/s43441-019-00061-x.
Article PubMed Google Scholar
Ratitch B, Goel N, Mallinckrodt C, et al. Defining efficacy estimands in clinical trials: examples illustrating ICH E9(R1) guidelines. Ther Innov Regul Sci. 2020;54(2):370–84. https://doi.org/10.1007/s43441-019-00065-7.
Article PubMed Google Scholar
E9(R1) Training Material - PDF_0.pdf. ich.org. Updated Dec 2021. https://database.ich.org/sites/default/files/E9%28R1%29%20Training%20Material%20-%20PDF_0.pdf. Accessed Sept 7 2022

Download references

Acknowledgements

The authors would like to thank Zimri S. Yaseen, MD (FDA) and Lorenzo Guizzaro, MD, PhD (EMA) for significant contributions to this paper.

Funding

No funding was received for this research.

Author information

Authors and Affiliations

Statistics and Decision Sciences, Janssen Pharmaceuticals - Johnson & Johnson, 1125 Trenton-Harbourton Rd, Titusville, NJ, 08560, USA
Elena Polverejan & Pilar Lim
Center for Statistics in Drug Development, IQVIA, Dublin 3, Ireland
Michael O’Kelly
Global Clinical Development, Therapeutic Area Psychiatry, H. Lundbeck A/S, Valby, Denmark
Nanco Hefting
Statistical & Quantitative Sciences, Takeda Pharmaceuticals U.S.A., Inc., Lexington, MA, USA
Jonathan D. Norton
Quantitative Sciences Consulting, Statistics and Decision Sciences, Janssen Pharmaceuticals - Johnson & Johnson, Titusville, NJ, USA
Marc K. Walton

Authors

Elena Polverejan
View author publications
You can also search for this author in PubMed Google Scholar
Michael O’Kelly
View author publications
You can also search for this author in PubMed Google Scholar
Nanco Hefting
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan D. Norton
View author publications
You can also search for this author in PubMed Google Scholar
Pilar Lim
View author publications
You can also search for this author in PubMed Google Scholar
Marc K. Walton
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elena Polverejan.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Polverejan, E., O’Kelly, M., Hefting, N. et al. Defining Clinical Trial Estimands: A Practical Guide for Study Teams with Examples Based on a Psychiatric Disorder. Ther Innov Regul Sci 57, 911–939 (2023). https://doi.org/10.1007/s43441-023-00524-2

Download citation

Received: 16 December 2022
Accepted: 08 April 2023
Published: 27 May 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s43441-023-00524-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Defining Clinical Trial Estimands: A Practical Guide for Study Teams with Examples Based on a Psychiatric Disorder

Abstract

Introduction

Process for Selecting and Constructing Estimands

Identify Stakeholder(s) and Decision(s) to be Made

Define an Objective(s)

Formulate the Clinical Question of Interest, Define the Corresponding Estimand, and Justify Their Utility to the Stakeholder

Formulate the Clinical Question of Interest

Define the Estimand

Identify ICEs

ICE-Handling Strategies

Process for Selecting an Estimator Aligned with an Estimand

Estimand Examples for Major Depressive Disorder

Discussion

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation