Estimand Framework Explained: ICH E9(R1) & Intercurrent Events

If you design or analyze clinical trials, the estimand framework introduced by the ICH E9(R1) addendum has become part of the everyday vocabulary of protocols, statistical analysis plans, and regulatory interactions. It changed the starting point of trial design: instead of asking “which analysis method should we use?”, teams now ask “what treatment effect are we actually trying to estimate?” — and only then choose the method.

This guide explains what an estimand is, the five attributes that define one, the five strategies for handling intercurrent events, how the choice plays out across therapeutic areas, and how each strategy maps to a concrete analysis in R. The intended reader is a biostatistician, clinical development scientist, or regulatory reviewer who knows trial statistics but wants a precise, working model of the framework.

記事の目次

What is an estimand?

An estimand is the target of estimation — a precise statement of the treatment effect that answers the clinical question of the trial. The ICH E9(R1) addendum defines it as:

> A precise description of the treatment effect reflecting the clinical question posed by a given clinical trial objective.

The crucial point is that an estimand is not an analysis method. It is useful to keep three terms separate:

The estimand is what you want to estimate (the true treatment effect of interest).
The estimator is the method used to estimate it (for example, an MMRM or multiple imputation).
The estimate is the number the estimator produces in your data.

The same estimator can target different estimands, and the same estimand can be approached by different estimators. Run an MMRM on two different definitions of the treatment effect and you get two different conclusions from the same data. That is exactly the ambiguity the framework removes.

Aspect	Traditional (ITT-centric) thinking	Estimand framework
Starting point	Choice of analysis set and method	Definition of the treatment effect of interest
Intercurrent events	Often handled implicitly or after the fact	Chosen explicitly from five strategies, up front
Missing data	Mixed up with treatment discontinuation	Separated cleanly from intercurrent events
Sensitivity analysis	An optional add-on	A required check tied to the estimand’s assumptions

The five attributes of an estimand

An estimand is specified by five attributes. Leave any one undefined and the “treatment effect” is no longer fully pinned down.

#	Attribute	What it specifies
1	Treatment	The treatment condition and the comparator — monotherapy or combination, dose, duration, and titration rules
2	Population	The patients the question is about, defined by inclusion and exclusion criteria
3	Variable (endpoint)	The measurement obtained from each subject — its definition, timing, and units
4	Intercurrent events	Post-randomization events that affect interpretation or existence of the endpoint — and the strategy chosen for each
5	Population-level summary	The summary measure for the between-group comparison (mean difference, odds ratio, hazard ratio, difference in response rates, …)

The fourth attribute is where most of the thinking happens. An intercurrent event (ICE) is a post-randomization event — treatment discontinuation, use of rescue medication, a switch to a different therapy, or death — that occurs after treatment starts and changes how (or whether) the endpoint can be interpreted.

⚠️ Caution: an intercurrent event is not the same as missing data
An intercurrent event is an event (a patient stops treatment, takes rescue medication). Missing data is a separate, downstream question of whether a measurement exists. A measurement can still be taken after an intercurrent event, and missing data can arise with no intercurrent event at all. Conflating the two leads to the wrong handling strategy and a mis-specified sensitivity analysis.

Five strategies for intercurrent events

ICH E9(R1) sets out five strategies for dealing with intercurrent events. The strategy you choose changes the clinical meaning of the estimated treatment effect — not just the numbers.

Strategy	How the event is handled	Clinical question it answers	Typical example
Treatment policy	The event is treated as irrelevant; the observed outcome is used regardless	What is the effect of being assigned the treatment regimen as it would be used in practice?	Overall survival in oncology
Hypothetical	The outcome is the one that would have been observed had the event not occurred	What would the effect have been if the event had not happened?	HbA1c in diabetes, ignoring rescue medication
Composite	The event itself is folded into the endpoint (e.g., counted as a failure)	What is the combined success of efficacy and avoiding the event?	“Responder and still on treatment” in psychiatry
While on treatment	Only data observed before the event are used	What is the effect during the time patients are actually on treatment?	Symptom or QoL scores in palliative care
Principal stratum	The effect is targeted in a latent subgroup defined by event behavior	What is the effect in the subgroup who would not experience the event?	Effect among those who would remain uninfected, in vaccine trials

💡 Key point
You can assign different strategies to different intercurrent events within a single estimand. It is entirely standard to handle “rescue medication” with a hypothetical strategy while handling “death” with a composite or treatment-policy strategy. The estimand is defined by the full set of choices, not a single one.

Choosing a strategy by therapeutic area

Strategy selection is hard to reason about in the abstract, so it helps to anchor it in concrete clinical questions.

Diabetes (HbA1c endpoint)

Patients whose glycemic control deteriorates may be given rescue therapy, and HbA1c measured after rescue no longer reflects the randomized drug. If the question is “what is the intrinsic effect of the study drug?”, a hypothetical strategy (the outcome had rescue not been given) is natural. If the question is “how effective is this as a real-world treatment policy?”, use treatment policy. If it is “what fraction maintained control without needing rescue?”, that is a composite endpoint.

Oncology (overall survival)

Overall survival has death as the event, so measurement continues even after treatment discontinuation or subsequent therapy. A treatment policy strategy is usually the first choice and is generally well accepted by regulators. For progression-free survival, the influence of subsequent therapy is more contentious, and hypothetical strategies are sometimes discussed.

Psychiatry (depression, schizophrenia)

Discontinuation rates are high, and discontinuation mixes “lack of efficacy” with “tolerability”. A naive hypothetical strategy can be misleading here, so composite strategies (“on treatment and improved” counts as success) and while-on-treatment strategies have become more common.

Vaccines and infectious disease

Patients who become infected and are then treated are a different latent group from those who never become infected. To estimate the effect within those who would remain uninfected, a principal stratum strategy is used — but it carries an identifiability problem and therefore demands strong assumptions and explicit sensitivity analysis.

From estimand to analysis in R

Once the estimand is fixed, the strategy for each intercurrent event implies how the data are prepared and which estimator is appropriate. The snippets below are schematic — they show the shape of the analysis for each strategy rather than a single packaged dataset. Assume a long-format data frame long_data with columns subject, arm, visit, change (change from baseline), base (baseline value), and ice_time (the time of the intercurrent event, or NA if none).

Treatment policy keeps everything, including measurements taken after the event, because the event is considered part of the regimen:

library(mmrm)

# Nothing is removed; post-event measurements are retained.
fit_tp <- mmrm(
  change ~ arm * visit + base + us(visit | subject),
  data = long_data
)

Hypothetical treats post-event measurements as missing and targets the outcome that would have occurred without the event. Under a missing-at-random (MAR) assumption, a likelihood-based MMRM estimates this without explicit imputation:

# Drop measurements taken at or after the intercurrent event.
hypo_data <- subset(long_data, is.na(ice_time) | visit < ice_time)

fit_hypo <- mmrm(
  change ~ arm * visit + base + us(visit | subject),
  data = hypo_data
)

This is the strategy most naturally aligned with a default MMRM under MAR; the mechanics of fitting and interpreting that model are covered in our companion guide.

MMRM Explained: Mixed Models for Repeated Measures in RA practical guide to MMRM for clinical trials: core concepts, covariance structures, missing data, and R implementation with the mmrm package....

Composite folds the event into the endpoint. Here a “success” requires both a clinical response and no intercurrent event, and the comparison is a difference in success rates:

# One row per subject: responded (0/1) and whether an event occurred.
subj$success <- as.integer(subj$responded == 1 & is.na(subj$ice_time))

tab <- table(subj$arm, subj$success)
prop.test(tab[, "1"], rowSums(tab))   # difference in success proportions

While on treatment restricts each subject to data observed strictly before the event:

on_trt <- subset(long_data, is.na(ice_time) | visit < ice_time)
# then fit the same MMRM on `on_trt`, interpreting the effect as "while on treatment"

Principal stratum cannot be read off the data by simple subsetting, because the latent stratum (for example, “would remain uninfected under either arm”) is not directly observed. It requires structural assumptions and dedicated methods, and should always be paired with a sensitivity analysis.

FAQ

Is the estimand the same as the statistical analysis method?
No. The estimand is the treatment effect you want to estimate; the analysis method (the estimator) is how you estimate it. Two trials can use the same MMRM yet target different estimands, and reach different conclusions. Fix the estimand first, then choose the estimator to match it.

What is the difference between an intercurrent event and missing data?
An intercurrent event is something that happens to the patient (stopping treatment, taking rescue medication). Missing data is whether a measurement was recorded. A measurement can exist after an intercurrent event, and data can be missing without any intercurrent event. The estimand’s intercurrent-event strategy is a design decision; missing data is then handled by the estimator.

Which strategy do regulators prefer?
There is no universally preferred strategy — it depends on the clinical question and the therapeutic area. Treatment policy is common and well accepted for survival endpoints; hypothetical strategies are common for pharmacodynamic endpoints like HbA1c. FDA, EMA, and PMDA can have different expectations for the same disease, so confirm the intended estimand in scientific advice or a pre-submission meeting.

Does the estimand framework make ITT obsolete?
No. The framework refines, rather than replaces, the thinking behind ITT. ITT addresses the analysis population and the principle of analyzing as randomized; the estimand framework makes the rest of the question — especially the handling of intercurrent events — explicit and pre-specified.

Do I need a main estimand and supplementary estimands?
Often, yes. A primary estimand aligned with the regulatory question, paired with one or more supplementary estimands that reflect a different clinical interpretation, tends to communicate the trial’s findings more completely. Each estimand carries its own primary estimator and its own sensitivity analyses.

The Estimand Framework Explained: ICH E9(R1), Five Attributes, and Intercurrent-Event Strategies

What is an estimand?

The five attributes of an estimand

Five strategies for intercurrent events

Choosing a strategy by therapeutic area

Diabetes (HbA1c endpoint)

Oncology (overall survival)

Psychiatry (depression, schizophrenia)

Vaccines and infectious disease

From estimand to analysis in R

FAQ

Further reading

The Estimand Framework Explained: ICH E9(R1), Five Attributes, and Intercurrent-Event Strategies

第I相がん臨床試験の用量探索デザイン ― 3+3・CRM・BOINをRで実装する ―

負の二項回帰と再発イベント率解析 ― 喘息・COPD増悪をRで実装する ―

ベイズ階層モデル ― バスケット試験での借用と縮約をRで実装 ―

ティッピングポイント解析（Tipping Point Analysis）とは ― MNAR欠測の感度分析をRで実装する ―

ICH E6（GCP）とは？改訂のポイントまで図解でわかりやすく解説

MMRM（反復測定混合モデル）とは― 臨床試験での柔軟な時系列解析手法 ―

p値を正しく理解する：統計学を勉強していく人のための基礎から応用まで

効果量（Effect Size）を理解すると統計が一気に実務的になる― p値の限界を超えて、“どれだけ効くか”を語れる統計へ ―

Cox比例ハザードモデル入門〜数式から実務応用まで〜