Biostatistics & Population Health
Sensitivity analysis in cost-effectiveness studies
— Inputs derived from small trials, expert opinion, or extrapolated long-term outcomes
— ICER falls near a decision threshold (e.g., $95k/QALY when WTP is $100k)
— Model relies on surrogate endpoints translated into QALYs
— Industry-sponsored analyses where parameter selection may favor a product
— Long time horizons (lifetime models) where discount rate choice dominates
Board pearl: A cost-effectiveness conclusion that does not survive a reasonable one-way or probabilistic sensitivity analysis should be considered hypothesis-generating only — analogous to a non-significant trend in a clinical trial. SA is the CEA equivalent of asking, "Would my decision change if my assumptions were wrong?"

— "The authors varied the drug cost from $40k to $120k" → one-way SA of parameter uncertainty
— "They re-ran the analysis assuming a 3-state vs 5-state Markov model" → structural/scenario analysis
— "They changed the perspective from healthcare sector to societal" → methodological analysis
— "10,000 Monte Carlo iterations sampling from input distributions" → probabilistic SA
Board pearl: If the stem mentions "Monte Carlo," "Dirichlet/beta/gamma distributions," or "cost-effectiveness acceptability curve," the technique is probabilistic sensitivity analysis (PSA). If it mentions "tornado diagram," it is deterministic one-way SA. These visual cues are the fastest identifiers on exam day.

Step 3 management: When shown a tornado diagram with drug cost as the widest bar, the appropriate next step is often price negotiation or a value-based contract, not additional clinical trials — because clinical efficacy is not the dominant uncertainty. Recognizing the driver of uncertainty directs the right downstream action.

— Transparent, easy to communicate to non-statisticians
— Identifies which parameters most influence the result
— Quickly reveals threshold values for negotiation
— Ignores joint uncertainty (correlations between parameters)
— Cannot produce probability statements ("80% chance of being cost-effective")
— Best-case/worst-case scenarios may be implausibly extreme
Board pearl: If a question asks which parameter the analyst should target with a future trial to reduce decision uncertainty, the answer comes from the tornado diagram's widest bar — that is the parameter where additional evidence has the most decision-relevant value (related to expected value of perfect information, EVPI).

— Beta distribution — probabilities and utilities (bounded 0–1)
— Gamma or log-normal distribution — costs and resource use (right-skewed, ≥0)
— Log-normal — relative risks, hazard ratios, odds ratios
— Dirichlet distribution — transition probabilities across multiple Markov states (multivariate extension of beta)
— Normal — log-transformed regression coefficients
— Mean and 95% credible interval for the ICER
— Probability cost-effective at multiple WTP thresholds
— CEAC for the full WTP range
Board pearl: PSA does not address structural uncertainty — if the underlying Markov model is wrong (e.g., wrong health states), PSA will produce precise-looking results that are still biased. PSA is precision around a possibly-misspecified mean.

— x-axis: willingness-to-pay threshold ($/QALY)
— y-axis: probability the intervention is cost-effective at that threshold
— At WTP = $0, the curve shows P(intervention is cost-saving)
— At very high WTP, the curve approaches the probability the intervention is simply more effective
— Strong: dominant strategy (less costly + more effective) or CEAC >95% at WTP
— Moderate: ICER well below WTP, CEAC 70–95%
— Fragile: ICER near WTP, CEAC 40–70%, wide credible interval
— Reject or revise: dominated strategy or CEAC <40%
Key distinction: A favorable point-estimate ICER with a flat, low CEAC = the result is statistically fragile. A less favorable ICER with a steep CEAC reaching 95% quickly = robust conclusion. Always favor robustness over point-estimate optimism — analogous to preferring a tight CI to a flashy effect size.

— Deterministic one-way SA on all key parameters → tornado diagram
— Scenario analyses for structural/methodological choices (perspective, time horizon, discount rate)
— Probabilistic SA with ≥1,000 iterations → CEAC + cost-effectiveness plane
— Threshold analyses for policy-relevant inputs (drug price, adherence)
— Tornado diagram (parameter influence)
— PSA with CEAC (probabilistic robustness)
— Scenario for societal perspective (policy relevance)
— Discount rate sensitivity (0%, 3%, 5%)
— Disclosure of funding source and model availability
Board pearl: A CEA that reports only a point-estimate ICER without PSA should be treated like a clinical trial reporting only point-estimate efficacy without confidence intervals — methodologically incomplete and not suitable for guideline-grade evidence.

— Step 1: List all model parameters with point estimate, plausible range, and distribution. Cost inputs → gamma; probabilities → beta; HRs → log-normal.
— Step 2: For one-way SA, vary each parameter across its 95% CI (or ±25–50% if no CI available), recompute ICER, record min/max.
— Step 3: Rank parameters by absolute change in ICER → plot horizontal bars sorted widest-to-narrowest → tornado diagram.
— Step 4: For threshold analysis on key parameters (e.g., drug cost), solve for the value at which ICER = WTP.
— Step 5: For PSA, sample once from every parameter's distribution per iteration; run 1,000–10,000 iterations. Preserve correlations where biologically/economically plausible.
— Step 6: Plot all (ΔCost, ΔQALY) pairs on the cost-effectiveness plane. Overlay WTP line.
— Step 7: Compute probability the intervention is cost-effective at each WTP → CEAC.
— Step 8: Calculate EVPI; if high, consider EVPPI to identify which parameters warrant further research.
— Convergence: do mean ICER and CEAC stabilize as iterations increase?
— Face validity: do sampled inputs produce clinically plausible outcomes?
— Internal consistency: do probabilities sum to 1 within each Markov cycle?
CCS pearl: If a question asks the next step after observing that the tornado diagram is dominated by one parameter, the answer is usually probabilistic SA with focused attention on that parameter's distribution, or value-of-information analysis (EVPPI) to decide whether a new study is warranted before adopting the intervention.

— An intervention may be cost-effective on average but not in low-risk subgroups (low baseline event rate → small absolute benefit → high ICER).
— Conversely, it may be highly cost-effective in high-risk subgroups even when the population average is borderline.
— Example: statins for primary prevention — cost-effective at 10-year ASCVD risk ≥7.5%, less so below.
Key distinction: Pooling subgroups into a single ICER can mask clinically and ethically important variation. A vignette in which a therapy is "cost-effective overall but ICER is $400k/QALY in patients ≥75" should trigger age-stratified policy, not blanket adoption — a recurring Step 3 systems-of-care theme.

— Lifetime horizons of 70+ years amplify the impact of the discount rate — differential discounting (e.g., 3% costs, 1.5% QALYs) is sometimes used and should be tested in SA.
— Pediatric utility weights are difficult to elicit; sensitivity around proxy-reported utilities is essential.
— Long-term effects of childhood interventions (vaccines, screening) compound — small per-cycle errors balloon over decades.
— Dual-patient framework: maternal and fetal/neonatal outcomes must both feed the QALY calculation. SA should include scenarios with maternal-only and combined outcomes.
— Short time horizons for acute interventions (e.g., antenatal corticosteroids) reduce discount-rate sensitivity but increase sensitivity to event probability inputs.
— Herd immunity assumptions (structural)
— Vaccine efficacy waning curve (parameter)
— Discount rate (methodological)
— Disease incidence (parameter, often from surveillance with wide CIs)
— Small trials → wide parameter CIs → PSA credible intervals span orders of magnitude.
— Some payers apply modified WTP thresholds for ultra-rare conditions (e.g., NICE's higher threshold for end-of-life and rare disease therapies); SA should test multiple thresholds.
Step 3 management: For an orphan drug CEA with a base-case ICER of $500,000/QALY but PSA showing 30% probability of being cost-effective at a $300,000/QALY rare-disease threshold, the appropriate framing is "decision under substantial uncertainty" — supporting conditional coverage, coverage with evidence development, or value-based pricing, rather than outright adoption or rejection.

Board pearl: When a CEA's conclusion changes under reasonable variation of a single non-clinical parameter (e.g., assumed drug price), the conclusion is not robust — treat it like a clinical trial whose result depends on excluding one outlier patient.

— If EVPI > cost of a definitive trial → trial is worth conducting.
— If EVPI < trial cost → adopt current best estimate; further research not value-generating.
— SA shows decision uncertainty → calculate EVPI
— EVPI > expected trial cost → calculate EVPPI to localize uncertainty
— EVPPI identifies key parameter(s) → calculate EVSI for proposed study
— Design and fund the study accordingly
Step 3 management: When asked the most appropriate next step after a CEA reveals a 50/50 probability of cost-effectiveness at the WTP threshold, the right answer is rarely "adopt" or "reject" — it is typically conduct a value-of-information analysis to determine whether additional research is justified, or conditional coverage with mandated data collection.

Board pearl: When a vignette describes Monte Carlo sampling and QALYs, think CUA with PSA. When it describes 5-year total spending projections with uptake scenarios, think BIA with scenario analysis. The SA tools differ accordingly.

— Calibration = tuning model parameters so predicted outcomes match observed data
— Validation = checking model predictions against an external dataset
— Neither is sensitivity analysis, but both are prerequisites for credible SA.
Key distinction: "Sensitivity" in "sensitivity and specificity" describes test performance; "sensitivity" in "sensitivity analysis" describes model robustness. On Step 3, the context (diagnostic vignette vs cost-effectiveness vignette) disambiguates — but the trap is real.

— Item 20: Characterizing heterogeneity (subgroup analyses)
— Item 21: Characterizing distributional effects (DCEA)
— Item 22: Characterizing uncertainty (deterministic and probabilistic SA)
— Item 23: Approach to engagement with patients, stakeholders, and funders
— Healthcare sector AND societal perspectives
— Lifetime time horizon for chronic conditions
— 3% discount rate (with 0% and 5% in SA)
— Impact inventory specifying included/excluded costs
— PSA with CEAC
— Full parameter table with distributions
— Model structure diagram
— Software and version
— Ideally, public code repository (GitHub, OSF)
— Conditional coverage / managed entry agreements
— Risk-sharing contracts (manufacturer refunds if real-world outcomes underperform)
— Outcome-based pricing
— Mandated post-marketing surveillance
— Periodic re-analysis as new evidence emerges
Step 3 management: For a value-based contract built around a CEA with substantial PSA uncertainty, the appropriate long-term plan includes pre-specified re-analysis triggers (e.g., re-run CEA after 3 years of registry data) — mirroring the post-marketing surveillance pathway for drug safety.

— Drug acquisition cost (often falls with biosimilars/generics — a 30–80% drop can flip a non-cost-effective therapy to cost-saving)
— Real-world adherence (typically lower than trial adherence → reduces real-world cost-effectiveness)
— Long-term effectiveness (durability of treatment effect beyond trial duration)
— Comparator landscape (a new competitor changes the relevant ICER)
— Quality-of-life evidence (more mature utility data may shift QALY estimates)
— Major drug class: every 3–5 years or upon biosimilar entry
— Rapidly evolving fields (oncology, gene therapy): every 1–2 years
— Stable interventions (vaccines, screening): every 5–10 years or upon major epidemiologic change
— Translate ICERs and CEACs into plain language for clinicians and policymakers ("at $100k/QALY, 78% likely to be cost-effective")
— Distinguish economic from clinical recommendations — a non-cost-effective therapy may still be clinically appropriate; the CEA informs resource allocation, not patient-level decisions in most US contexts.
CCS pearl: When biosimilar entry is announced for a high-cost biologic, the appropriate "next order" is to re-run the cost-effectiveness model with updated price inputs and full PSA — frequently flipping a borderline therapy into a cost-effective range and triggering coverage policy updates within months.

— Disabled patients (lower baseline utility → smaller absolute QALY gains)
— Elderly patients (shorter remaining life-years)
— Patients with severe chronic illness
— In response, the Affordable Care Act (Section 1182) prohibits CMS/PCORI from using QALYs to deny or limit Medicare coverage — a uniquely US legal constraint that distinguishes our system from NICE (UK) or CADTH (Canada). SA on alternative metrics (equal-value life-years, healthy-years equivalents) is increasingly required for US policy use.
Board pearl: A Step 3 stem describing a US Medicare coverage decision invoking a $/QALY threshold should trigger recognition that CMS cannot legally use QALYs as the sole basis for coverage denial — a frequent ethics/health-policy test point.

— Strong dominance: more effective AND less costly → adopt
— Weak/extended dominance: dominated by a linear combination of other strategies → eliminate
Key distinction: Cost-effective ≠ affordable ≠ clinically appropriate. Each axis requires its own analysis, and SA applies to each.

Board pearl: Always identify (1) which type of SA is described, (2) what uncertainty type it addresses, and (3) whether the conclusion is robust at the relevant US WTP threshold — three checks that solve most Step 3 SA stems.

Sensitivity analysis is the formal toolkit — one-way deterministic SA (tornado diagrams), scenario analysis, and probabilistic SA (Monte Carlo with CEACs) — used to test whether a cost-effectiveness analysis's ICER conclusion survives plausible variation in its inputs, model structure, and methodological choices, with value-of-information analysis as the escalation pathway when decision uncertainty remains substantial.
— Deterministic one-way SA answers "Which input most influences my ICER?" via the tornado diagram; PSA answers "What is the probability the intervention is cost-effective?" via the CEAC and cost-effectiveness plane. Both are required by modern reporting standards (CHEERS 2022, ISPOR, Second Panel).
— Use beta distributions for probabilities and utilities, gamma or log-normal for costs, log-normal for hazard/relative risks, and Dirichlet for multi-state transition probabilities; the US reference case uses a 3% discount rate with sensitivity at 0% and 5%, a lifetime horizon for chronic disease, and both healthcare-sector and societal perspectives.
— When PSA reveals substantial decision uncertainty at the relevant WTP threshold (commonly $50–150k/QALY in the US), the appropriate next step is value-of-information analysis (EVPI/EVPPI/EVSI) to determine whether further research is worth funding, often paired with coverage with evidence development as a policy bridge.
— Recognize the US legal context: ACA Section 1182 prohibits CMS from using QALY-based thresholds to deny Medicare coverage, making equity-informed analyses (DCEA, equal-value life-years) increasingly important; recognize that cost-effective ≠ affordable — budget impact analysis is a separate, complementary requirement, as the 2014 hepatitis C DAA experience demonstrated.
Board pearl: On Step 3, identify the SA type from visual/textual cues (tornado → one-way; Monte Carlo + CEAC → PSA), check robustness at the US WTP threshold, and escalate to VOI or conditional coverage when uncertainty remains — these three moves resolve nearly every sensitivity-analysis vignette you will encounter.

