top of page

Eduovisual

Biostatistics & Population Health

Absolute risk reduction vs relative risk reduction

Clinical Overview and When to Suspect Misleading Risk Reporting

ARR = control event rate (CER) − experimental event rate (EER); reported as a percentage-point difference

RRR = (CER − EER) / CER = ARR / CER; reported as a proportional reduction

Number needed to treat (NNT) = 1 / ARR; the clinical translator of ARR into "how many patients must I treat to prevent one event?"

— Drug ad or pharma rep claims "50% reduction in heart attacks" without baseline risk

— Headline trumpets a "doubling of risk" when baseline incidence is 1 in 10,000

— A question asks you to counsel a patient who saw a TV commercial quoting only RRR

— Stem provides a 2×2 table or two event rates and asks for the "most accurate" way to describe benefit to the patient

— Outpatient counseling: shared decision-making about statins, anticoagulation, cancer screening, and aspirin hinges on translating trial data into patient-understandable risk

— Value-based care: NNT/NNH frame cost-effectiveness conversations

— Patient safety and informed consent: presenting only RRR without ARR is considered a framing bias that undermines true informed consent

Board pearl: Whenever a stem quotes only a relative number ("reduces stroke by 40%"), the answer almost always involves recognizing that absolute risk reduction and NNT give the clinically meaningful picture — especially when baseline risk is low.

Core concept: Absolute risk reduction (ARR) and relative risk reduction (RRR) are two ways to express the same trial result, but they communicate dramatically different magnitudes of benefit to clinicians and patients.
When to suspect misleading reporting on a board stem:
Why Step 3 cares:
Conceptual anchor: RRR is constant across populations with very different baseline risks; ARR shrinks as baseline risk falls. A drug with 30% RRR helps a high-risk 70-year-old far more (large ARR, small NNT) than a low-risk 40-year-old (tiny ARR, huge NNT).
Solid White Background
Presentation Patterns and Key History

— "A new trial reports the drug reduces MI from 4% to 2% over 5 years." → ARR = 2%, RRR = 50%, NNT = 50

— "The pharmaceutical representative tells the physician the medication produces a 30% reduction in cardiovascular events." → Recognize this as RRR; demand baseline rates before counseling

— "Compared with placebo, the intervention reduced the event rate from 1.0% to 0.5%." → RRR is still 50%, but ARR is only 0.5% and NNT is 200 — clinically marginal

— Stem provides a 2×2 contingency table → compute CER, EER, then ARR and RRR

— Patient cites a magazine article, online ad, or family member's anecdote with a percentage

— Patient asks, "How much will this really help me?" — the answer is ARR/NNT applied to their baseline risk

— Patient declines therapy because of a scary "doubling" of side-effect risk → ask for the absolute side-effect rate (relative risk increase vs. absolute risk increase)

— Identify the control event rate (e.g., placebo arm 8%)

— Identify the experimental event rate (e.g., drug arm 6%)

— ARR = 8% − 6% = 2 percentage points

— RRR = 2/8 = 25%

— NNT = 1/0.02 = 50 patients treated for the trial duration to prevent one event

— Use ARR and NNT when counseling individual patients (transparent, scaled to baseline risk)

— Recognize RRR as the metric most often used in marketing and headlines because it appears larger

Key distinction: RRR is dimensionless and baseline-independent; ARR is anchored to baseline risk and shrinks proportionally as untreated risk drops. A trial in low-risk primary prevention can have impressive RRR but trivial ARR — this is the single most tested concept on Step 3 biostatistics items.

Typical question stem framings you must recognize instantly:
History clues in patient-facing vignettes:
Practical reading of a trial abstract:
Communication framing — pick the right metric:
Solid White Background
Physical Exam Findings (and Hemodynamic Assessment when relevant)

Absolute event rates in each arm (often buried in the results, not the abstract conclusion)

— Whether the headline result is RRR, ARR, hazard ratio, or odds ratio

Confidence intervals around the ARR — a CI crossing zero means non-significant

— Duration of follow-up — a "50% reduction" over 10 years is not the same as over 1 year

— Composite endpoints — an impressive RRR may be driven by a soft component (hospitalization) rather than mortality

— Apply validated calculators (ASCVD pooled cohort, CHA₂DS₂-VASc, FRAX, Gail) to derive this patient's baseline risk

— Multiply baseline risk × RRR to estimate personalized ARR

— Example: ASCVD 10-year risk 20%, statin RRR ~25% → personalized ARR ~5%, NNT ~20

— Example: ASCVD 10-year risk 5%, same RRR → personalized ARR 1.25%, NNT 80

— A large RRR with a tiny ARR is like a "high gradient, low flow" finding — looks dramatic, delivers little

— A modest RRR applied to high baseline risk produces large ARR and meaningful NNT

— Always check both numbers; one without the other is incomplete

— Only RRR reported, no event counts

— Subgroup-driven RRR after a null primary endpoint

— Surrogate endpoint masquerading as clinical benefit

— Short follow-up extrapolated to lifetime risk

Step 3 management: When a patient brings in a drug ad, your first move is to identify the metric used (almost always RRR), then translate it to ARR/NNT using the patient's individualized baseline risk before recommending or deferring therapy.

No physical exam applies to a statistical concept — instead, "examine" the trial and the way data are presented. Treat the trial abstract and the patient encounter as your exam.
Inspecting the trial — what to look for:
Inspecting the patient — baseline risk assessment:
Hemodynamic analogy — "pressure of evidence":
Red flags during your "exam" of the data:
Solid White Background
Diagnostic Workup — Initial Labs / Imaging / ECG / Biomarkers

```

Event (+) No Event (−)

Treatment (Exp) a b

Control c d

```

EER (experimental event rate) = a / (a + b)

CER (control event rate) = c / (c + d)

ARR = CER − EER

RRR = (CER − EER) / CER = 1 − RR

Relative Risk (RR) = EER / CER

NNT = 1 / ARR (round up to the next whole patient)

— Trial: secondary prevention statin

— CER = 20% (200/1000), EER = 15% (150/1000)

— ARR = 5% → NNT = 20

— RRR = 5/20 = 25%

— Clinically meaningful: treat 20 patients for trial duration, prevent 1 event

— Trial: primary prevention in low-risk adults

— CER = 2%, EER = 1.5%

— ARR = 0.5% → NNT = 200

— RRR = 25% (identical!)

— Marketing headline screams "25% reduction"; reality is one event prevented per 200 treated

Absolute Risk Increase (ARI) = adverse event rate in treated − in control

Relative Risk Increase (RRI) = ARI / control adverse event rate

NNH (number needed to harm) = 1 / ARI

— Compare NNT vs NNH to weigh net benefit

— Confusing RR (a ratio) with RRR (1 − RR)

— Forgetting to convert percentages to decimals before computing NNT

— Using odds ratio when relative risk is requested (OR ≈ RR only when outcome is rare, <10%)

Board pearl: If you remember only one formula, remember NNT = 1/ARR — Step 3 examiners love asking you to compute NNT directly from a 2×2 table or a pair of event rates.

"Labs" for ARR vs RRR = the 2×2 table. Master this and you can compute every measure asked on Step 3.
Core calculations:
Worked example — high baseline risk:
Worked example — low baseline risk (same RRR):
Harm side — mirror calculations:
Common arithmetic traps on Step 3:
Solid White Background
Diagnostic Workup — Advanced or Confirmatory Studies

Hazard ratio (HR): from time-to-event (Cox) analyses; interpreted similarly to RR but accounts for follow-up duration and censoring. HR of 0.75 ≈ 25% RRR over the study period.

Odds ratio (OR): from case-control studies and logistic regression; approximates RR only when the outcome is rare. Overstates effect size when outcomes are common.

Confidence intervals on ARR: if the 95% CI for ARR crosses 0, the result is not statistically significant. For RR/OR/HR, the no-effect value is 1, not 0.

— NNT is time-bound — always state "NNT = 50 over 5 years"

— NNT is population-specific — derived NNTs do not transfer cleanly to populations with different baseline risk

— NNT confidence intervals can be unstable when ARR is near zero (CI may cross infinity)

— When ARR is negative (treatment harms), 1/ARR becomes the NNH

— Patient's baseline 10-year risk = 15%

— Trial HR = 0.70 (30% RRR)

— Estimated treated risk ≈ 15% × 0.70 = 10.5%

— Personalized ARR ≈ 4.5%, NNT ≈ 22

— A 40% RRR in LDL lowering is not the same as a 40% RRR in MI

— Always confirm that ARR/RRR figures refer to patient-important outcomes (death, MI, stroke, hospitalization, quality of life) — not surrogates

— Pooled RRRs are reasonably transportable across populations

— Pooled ARRs are not — you must recompute ARR using your patient's baseline risk

Key distinction: RR, OR, and HR are all relative measures; only ARR (and its inverse, NNT) anchor a result to absolute, patient-level benefit. On Step 3, the "best answer" for patient counseling is almost always the absolute measure.

Beyond the basic 2×2 — measures that show up alongside ARR/RRR:
NNT pitfalls and refinements:
Translating a hazard ratio into ARR for a patient:
Surrogate vs. clinical endpoints:
Meta-analysis nuance:
Solid White Background
Risk Stratification or First-Line Management Logic

1. Identify baseline risk (CER). Is this primary or secondary prevention? High- or low-risk cohort?

2. Identify the relative effect (RR, RRR, HR). This is usually transportable across populations.

3. Compute the absolute effect (ARR, NNT) for the trial population.

4. Reapply to your patient. Use the patient's individualized baseline risk × RRR to estimate personalized ARR and NNT.

5. Weigh harms. Compute NNH from adverse event rates.

6. Net benefit decision. Is NNT < NNH for outcomes of comparable severity? Discuss with patient.

Statins for primary prevention: USPSTF recommends statins when ASCVD 10-year risk ≥ 10% and ≥ 1 risk factor — precisely because ARR becomes meaningful at this threshold despite stable RRR

Aspirin for primary prevention: USPSTF now recommends against routine initiation ≥ 60 years; low baseline risk in healthy adults makes ARR for MI smaller than ARI for bleeding (NNT > NNH)

Anticoagulation in AFib: CHA₂DS₂-VASc ≥ 2 (men) or ≥ 3 (women) ensures stroke ARR exceeds bleeding ARI from DOACs

Cancer screening: RRRs for mammography are stable, but ARRs are much larger in women 50–74 than in women 40–49 — driving USPSTF age thresholds

— Low baseline risk → huge NNT

— Comparable NNH for serious adverse events

— Limited life expectancy (patient won't live long enough to realize ARR)

— Patient preference after informed disclosure of ARR/NNT

Step 3 management: When asked whether to start a preventive medication, the correct logic is to multiply the patient's baseline risk by the trial RRR to estimate personalized ARR, then compare NNT to NNH — not to act on RRR alone.

Framework for evaluating any trial result on Step 3:
High-yield clinical applications of this framework:
When to defer therapy despite a "good" RRR:
Solid White Background
Pharmacotherapy — First-Line Drug Regimen

— Trial-level RRR for major vascular events: ~22% per 1 mmol/L LDL reduction (CTT meta-analysis)

— Secondary prevention (10-yr ASCVD ~30%): ARR ~6–7%, NNT ~15

— Primary prevention, ASCVD 10% (JUPITER-like): ARR ~1.5–2%, NNT ~50–60

— Primary prevention, ASCVD 5%: ARR ~1%, NNT ~100

— Same drug, same RRR, dramatically different ARR/NNT depending on whom you treat

— RRR for stroke vs placebo ~60–70%

— CHA₂DS₂-VASc 0: annual stroke risk ~0.2% → ARR negligible, NNT > 500 — anticoagulation not recommended

— CHA₂DS₂-VASc 4: annual risk ~4% → ARR ~2.5%, NNT ~40 — strongly favored

— NNH for major bleeding on DOAC ~100–200 — net benefit favorable at higher CHA₂DS₂-VASc

— RRR for nonfatal MI ~20–25%; RRR for major bleeding similar magnitude

— In low-risk adults, ARR for MI ≈ ARI for bleeding → net neutral or harmful

— USPSTF 2022: individualized decision ages 40–59 with ASCVD ≥10%; avoid initiation ≥ 60

— RRR for CV death or HF hospitalization ~25–30%

— Even in moderate baseline risk, ARR ~3–5% over 2–3 years → NNT 20–30 — clinically meaningful

— "Out of 100 people like you treated for 5 years, about [ARR×100] will avoid an event; the rest see no change."

— Avoid: "This drug cuts your risk in half" — accurate only relatively, misleading absolutely

Board pearl: The same drug can be strongly indicated (NNT 20) in a high-risk patient and net-harmful (NNT > NNH) in a low-risk patient — RRR is constant, ARR drives the decision.

Statins (atorvastatin, rosuvastatin) — canonical Step 3 ARR/RRR example:
Anticoagulants for AFib (apixaban, rivaroxaban, warfarin):
Aspirin for primary prevention:
SGLT2 inhibitors in HFrEF / CKD:
Counseling scripts (Step 3 favorite):
Solid White Background
Procedures / Revascularization / Invasive Management (or expanded pharmacology if non-procedural)

— Symptomatic ≥ 70% stenosis (NASCET): 5-yr stroke risk 26% medical vs 9% surgical → ARR 17%, NNT ~6

— Asymptomatic ≥ 60% stenosis (ACAS): 5-yr stroke risk 11% medical vs 5% surgical → ARR 6%, NNT ~17

— RRR similar (~50%), but symptomatic patients gain far more in absolute terms — driving stronger guideline endorsement

— No mortality RRR vs optimal medical therapy; modest symptom benefit

— ARR for death/MI essentially zero → PCI reserved for refractory symptoms or high-risk anatomy

— Classic Step 3 trap: a stem describing stable CAD with controlled symptoms — medical therapy is correct because invasive benefit ARR is null

— EF ≤ 35%, NYHA II–III: RRR for all-cause mortality ~23%; ARR ~7% over 5 yr; NNT ~14

— In patients with limited life expectancy < 1 year, ARR collapses → ICD not indicated

— RRR for diabetes progression vs medical therapy ~70%; ARR depends on baseline severity

— NNH (perioperative complications, nutritional deficiency) must be weighed

— Colonoscopy CRC mortality RRR ~30%; ARR ~0.3% over 10 yr → NNT ~300 — still cost-effective at population level, but transparency matters

— LDCT lung cancer mortality RRR ~20%; ARR ~0.3% in eligible heavy smokers → NNT ~320

CCS pearl: When a CCS-style case offers both invasive and medical options, choose the strategy with a published ARR/NNT advantage in patients matching the vignette's risk profile — not the one with the flashiest RRR headline.

Procedures are evaluated with the same ARR/RRR lens — and Step 3 loves contrasting medical vs invasive therapy using NNT.
Carotid endarterectomy (CEA) for asymptomatic vs symptomatic stenosis:
PCI for stable angina (COURAGE/ISCHEMIA):
ICD for primary prevention (MADIT-II, SCD-HeFT):
Bariatric surgery for diabetes remission:
Screening procedures (colonoscopy, low-dose CT lung):
Solid White Background
Special Populations — Elderly and Renal/Hepatic Impairment

— Baseline event risk is higher → ARR for effective therapies often larger (favoring treatment)

— BUT competing mortality from non-target diseases shortens the window to realize benefit → effective ARR shrinks

Lag-time-to-benefit concept: therapies with delayed benefit (e.g., statins, intensive glycemic control, cancer screening) require life expectancy > lag time for ARR to materialize

Statins in adults > 75 for primary prevention: USPSTF says insufficient evidence; ARR uncertain because trials underrepresent this group

Tight glycemic control (HbA1c < 7) in elderly with comorbidities: ARR for microvascular benefit takes 8–10 years; ARI for hypoglycemia is immediate → relax target to 7.5–8%

Mammography ≥ 75: USPSTF says insufficient evidence; ARR depends on life expectancy

Anticoagulation in AFib ≥ 80: ARR for stroke still favorable; bleeding ARI rises but DOACs preferred (lower NNH than warfarin)

— Higher baseline CV risk → larger ARR for many cardioprotective therapies

— BUT NNH for drug-specific harms also rises (e.g., metformin lactic acidosis at low GFR, dabigatran bleeding when CrCl < 30)

— SGLT2 inhibitors: renal and CV ARR substantial down to eGFR 20–25; do not stop based solely on declining eGFR

— Affects NNH more than NNT (drug accumulation, bleeding from coagulopathy)

— Statin NNT preserved; NNH for hepatotoxicity rises modestly — avoid in decompensated cirrhosis

— Each additional drug raises cumulative ARI for falls, delirium, bleeding

— Deprescribing logic: if individual NNT > life expectancy in years × 10, benefit is unlikely

Step 3 management: In elderly patients, estimate life expectancy and lag-time-to-benefit before initiating preventive therapy — if life expectancy is shorter than the lag, ARR approaches zero regardless of RRR.

Elderly patients — competing risks shrink ARR:
Specific elderly considerations:
Renal impairment:
Hepatic impairment:
Polypharmacy and additive NNH:
Solid White Background
Special Populations — Pregnancy, Pediatrics, or Other Demographic Subgroups

Aspirin 81 mg for preeclampsia prevention in high-risk women: RRR ~15–20%; baseline risk 15–25% → ARR 3–5%, NNT ~25 — USPSTF Grade A

In low-risk pregnancies: baseline preeclampsia risk ~3% → ARR < 1%, NNT > 100 → not recommended

— Same drug, same RRR, opposite recommendation based on ARR

— RRR for neonatal RDS ~50%; baseline RDS risk varies sharply with gestational age

— At 24–28 weeks: ARR very large, NNT < 10

— At 34–36 weeks: ARR small but still meaningful given outcome severity

— Most preventive trials use adult cohorts; extrapolation to children requires explicit baseline-risk recalculation

HPV vaccination: RRR for HPV-related cancers > 90%; ARR is lifetime, computed against population cancer incidence

Childhood obesity intervention: RRR for metabolic disease modest; ARR depends on family/genetic baseline risk

— Pooled cohort ASCVD calculator may overestimate risk in some populations and underestimate in others → personalized ARR estimates can be off

— Step 3 ethics overlap: present ARR using validated population-specific risk tools when available

— Curative-intent therapy ARR may be matched or exceeded by NNH for treatment-related morbidity

— Goals-of-care conversations should incorporate ARR for quality-of-life outcomes, not just survival

Key distinction: In pregnancy and pediatrics, outcome severity (neonatal death, lifelong disability) can justify intervention even at modest ARR — meaning a small NNT-equivalent benefit on a catastrophic outcome outweighs a larger NNT on a milder one. Always weigh ARR × outcome severity, not ARR alone.

Pregnancy — ARR/RRR shifts because baseline risks and outcome severity differ:
Antenatal corticosteroids for preterm delivery:
Pediatric considerations:
Sex-based and racial/ethnic subgroups:
Frailty and palliative populations:
Solid White Background
Complications and Adverse Outcomes

— Patients started on statins, aspirin, or PPI prophylaxis based on impressive RRR despite trivial ARR

— Cumulative polypharmacy, adverse drug events, cost, pill burden

— Classic case: aspirin for primary prevention in a low-risk 65-year-old → GI bleed (NNH realized) without preventing MI (NNT unreached)

— Patient or clinician sees "doubled risk of bleeding" on a DOAC and declines therapy

— In reality, baseline bleeding risk 1% → 2%: ARI 1%, NNH 100

— Stroke ARR from same DOAC may be 3% → net benefit favors treatment

— Studies show patients are far more likely to accept therapy when told RRR ("50% reduction") than when told ARR ("2% reduction") or NNT ("50 to prevent 1 event")

— Presenting only RRR is considered a violation of balanced informed consent

— High false-positive rates → biopsies, anxiety, overdiagnosis (NNH side)

— If screening ARR for mortality is 0.3% but overdiagnosis rate is 10%, net population harm may exceed benefit

— Multiple subgroup analyses inflate type I error; an impressive RRR in one subgroup may be spurious

— Always demand pre-specified subgroups and adjustment for multiple comparisons

— Composites can produce eye-catching RRR driven by the least important component (hospitalization)

— Examine ARR for each component separately

Board pearl: The single most tested "complication" on Step 3 biostat items is a patient harmed by being counseled with RRR alone — either by accepting a marginally beneficial therapy or refusing a strongly beneficial one. Always re-frame in ARR and NNT.

"Complications" of misusing ARR/RRR are clinical and ethical:
Overtreatment driven by RRR-only thinking:
Undertreatment driven by misreading ARI:
Framing bias and informed consent harms:
Screening harms when ARR is small:
Subgroup chasing — statistical "complication":
Composite endpoint inflation:
Solid White Background
When to Escalate Care — ICU, Consult, or Inpatient Triage

— Patient is on the fence about a preventive therapy

— Drug has significant cost, monitoring burden, or known adverse effects

— Baseline risk is borderline (e.g., ASCVD 7.5–10%, CHA₂DS₂-VASc 1)

— Trial reports only relative measures and a vendor/colleague is pushing adoption

— Trial uses a non-standard endpoint (composite, surrogate)

— Subgroup-driven conclusion after null primary endpoint

— Effect size depends heavily on per-protocol vs intention-to-treat analysis

— Number of events is small (wide CI on ARR, unstable NNT)

— Anticoagulation in high-bleeding-risk AFib patient → cardiology + pharmacy

— Statin intolerance in high-ASCVD-risk patient → lipid clinic

— Genetic testing for cancer risk → genetic counselor (translate RRR/ARR for variant-carriers)

— Formulary decisions and quality metrics should be benchmarked to NNT and cost-per-event-prevented, not RRR

— Value-based care contracts reward absolute outcome improvements — alignment with ARR

— Document that ARR and NNT were discussed, not just RRR

— Use decision aids (e.g., Mayo Clinic statin choice tool) that display ARR pictographically

— Pictographs (100-person icon arrays) reduce framing bias more than numerical RRR

— Therapies with overwhelming ARR (e.g., antibiotics for sepsis, anticoagulation for PE) where individualized NNT calculations are unnecessary

— Standard-of-care interventions with established population benefit

CCS pearl: On a CCS case, document shared decision-making with ARR/NNT when starting a preventive medication — this mirrors real-world quality measures and is the "right answer" framing for ambulatory Step 3 vignettes.

"Escalation" in the biostatistics frame = when to escalate the rigor of evidence appraisal:
Escalate to formal NNT/NNH analysis when:
Escalate to a biostatistician or guideline review when:
Consult specialty/pharmacy for individualized weighing:
System-level escalation:
Documentation and shared decision-making:
When NOT to escalate:
Solid White Background
Key Differentials — Same-Category Causes

— Ratio of risks: EER / CER

— From cohort studies and RCTs (incidence data available)

— RR = 1: no effect; RR < 1: protective; RR > 1: harmful

— RRR = 1 − RR (when RR < 1)

— Ratio of odds: (a/b) / (c/d) = ad/bc

— From case-control studies (incidence unknown) and logistic regression

— Approximates RR only when outcome is rare (<10%)

— Overstates effect magnitude when outcome is common — classic Step 3 trap

— From Cox proportional hazards models; ratio of instantaneous event rates

— Accounts for time-to-event and censoring

— Interpreted like RR but technically distinct; reported in survival/time-to-event trials (CV outcomes, cancer)

— Ratio of incidence rates (events per person-time)

— Used when follow-up varies between patients

— Case-control study → OR

— Cohort/RCT with binary outcome → RR or RRR

— Survival analysis with Kaplan-Meier → HR

— Vaccine trial → vaccine efficacy = RRR

— All relative measures require the baseline (control) risk to be converted to ARR

— ARR ≈ CER × (1 − RR) = CER × RRR

— Example: CER 10%, HR 0.80 → ARR ≈ 10% × 0.20 = 2%, NNT = 50

— Outcome incidence 40%, OR 0.5 → actual RR ≈ 0.67 (not 0.5)

— Reporting OR as if it were RR exaggerates RRR

Key distinction: RR, RRR, OR, and HR are all relative — none answers "how much does this help my patient?" Only ARR and NNT do. Step 3 questions asking for "the most appropriate measure to counsel a patient" almost always want ARR or NNT.

Differentiating among relative effect measures (the "same category"):
Relative Risk (RR):
Odds Ratio (OR):
Hazard Ratio (HR):
Incidence Rate Ratio (IRR):
Common stem distinguishers:
Converting to ARR:
Key trap: OR vs RR with common outcomes:
Solid White Background
Key Differentials — Other-Category Causes

— Synonyms in many textbooks; sometimes called risk difference

— "Attributable risk" sometimes refers to the proportion of disease in the exposed group attributable to the exposure (attributable risk percent = RRR in exposed)

— Excess risk in the whole population attributable to an exposure

— PAR = (overall risk in population) − (risk in unexposed)

— Used for public health priority-setting, not individual counseling

— High prevalence × modest RR can yield large PAR (e.g., HTN and stroke)

— Similar to NNT but for screening tests; usually much larger than NNT for therapies

— Mammography NNS ~1300 over 10 yr (ages 50–59) to prevent one breast cancer death

— Mirror of NNT for adverse outcomes; 1/ARI

— Critical for net-benefit calculations

Diagnostic, not therapeutic — different question entirely

— Confused with relative risk measures on hurried reads — they describe test performance, not treatment effect

— Diagnostic test metrics — not interchangeable with ARR/RRR

— Step 3 may pivot mid-question from diagnostic to therapeutic statistics; recognize the switch

— Build on ARR — incremental cost / incremental QALY gained

— Drugs with NNT 20 and modest cost dominate drugs with NNT 200 even at the same RRR

— US informal threshold ~$50,000–$150,000/QALY

— A "highly significant" p < 0.001 with ARR of 0.1% is clinically trivial

— Always separate statistical significance from clinical significance

Board pearl: When a stem says "the result was highly statistically significant," look for the effect size (ARR/NNT) — Step 3 examiners love questions where p is impressive but ARR is clinically negligible.

Other measures and concepts often confused with ARR/RRR — know them cold:
Attributable Risk (AR) = ARR:
Population Attributable Risk (PAR):
Number Needed to Screen (NNS):
Number Needed to Harm (NNH):
Likelihood Ratios (LR+, LR−):
Sensitivity/Specificity/PPV/NPV:
Cost-effectiveness ratios (ICER, $/QALY):
p-value vs effect size:
Solid White Background
Secondary Prevention / Discharge Medications / Long-Term Plan

High-intensity statin: RRR ~25%, secondary prevention ARR ~6%, NNT ~16 over 5 yr

Aspirin 81 mg: secondary prevention ARR ~2–3%, NNT ~30–50; benefit clearly outweighs bleeding NNH

Beta-blocker post-MI: ARR for mortality ~4% in first year, NNT ~25

ACEi/ARB if EF reduced or HTN/DM: ARR for HF hospitalization ~3–5%, NNT ~25

SGLT2 inhibitor in HF or CKD: ARR ~3–5% over 2–3 yr, NNT ~20–30

— Combine all four pillars → cumulative ARR far exceeds primary prevention

— Provoked VTE: 3-month course; extended therapy ARR for recurrence small after provocation resolves

— Unprovoked VTE: recurrence risk ~10%/yr off therapy; extended anticoagulation ARR ~8%, NNT ~12 over 1 yr

— Tight glycemic control (HbA1c < 7) ARR for microvascular complications ~3–4% over 10 yr in newly diagnosed T2DM (NNT ~25–35)

— In older patients with comorbidity, ARR shrinks; NNH for hypoglycemia rises

— Translate annual ARR to a memorable framing: "If you take this every day for the next 5 years, your chance of another heart attack drops from about 1 in 4 to about 1 in 6"

— Pictographs and risk calculators reduce non-adherence

— If a patient's life expectancy < lag-time-to-benefit, deprescribe

— Examples: bisphosphonates after 5 yr (drug holiday), statins in end-stage disease, intensive glycemic control in dementia

Step 3 management: At every transition of care, explicitly link each discharge medication to its ARR-justified indication — this prevents both over- and under-prescribing and is the implicit standard for ambulatory follow-up vignettes.

Applying ARR/RRR to discharge and long-term planning is the bread-and-butter of Step 3 ambulatory questions.
Post-MI / post-stroke discharge bundles — high baseline risk = large ARR:
Post-VTE anticoagulation:
Diabetes long-term plan:
Counseling for medication adherence:
Deprescribing logic at follow-up:
Solid White Background
Follow-Up, Monitoring Parameters, and Rehab/Counseling

— Statins: lipid panel at 6–8 weeks (titration); annual thereafter; ALT only if symptomatic

— Anticoagulation: CBC/renal function annually for DOACs (every 6 mo if CrCl 30–60); INR for warfarin

— Antihypertensives: BP recheck 4 weeks after initiation; sustained ARR for stroke realized over years

— SGLT2 inhibitors: renal function at 2–4 weeks (transient dip expected); monitor for UTI/euglycemic DKA

100-person icon arrays — visual pictographs showing absolute benefit

Mayo Clinic statin/anticoagulation decision aids

Personalized risk calculators (ASCVD, CHA₂DS₂-VASc, HAS-BLED, FRAX)

— Teach-back: ask patient to restate the absolute benefit in their own words

— Individual: did patient remain event-free, tolerate therapy, achieve surrogate targets?

— Population (quality metrics): MIPS/HEDIS measures use absolute outcome rates, not RRR

— Cardiac rehab post-MI: mortality RRR ~20%, ARR ~3%, NNT ~30 — comparable to many drugs

— Smoking cessation: 10-yr CV mortality halved (ARR depends on baseline); often the single highest-ARR intervention available

— Mediterranean diet (PREDIMED): RRR for major CV events ~30%, ARR ~1.6% over 5 yr in high-risk adults

— Adherence assessment before escalation

— Reassess baseline risk — has it changed (new diagnoses, new labs)?

— Recompute personalized ARR; deprescribe if no longer favorable

— Note "ARR/NNT discussed" in problem-based notes for preventive therapies

— Reassess at annual visits; preventive ARR estimates change as patient ages

CCS pearl: Order follow-up labs and visits timed to the therapy's expected benefit horizon — premature outcome assessment misses realized ARR, and delayed follow-up risks unmonitored NNH (e.g., bleeding, electrolyte disturbance).

Monitoring for realized benefit and harm — turning ARR/RRR into clinical follow-up:
Follow-up cadence anchored to expected benefit timeline:
Counseling tools that improve ARR/NNT communication:
Tracking outcomes — population vs individual:
Rehab and lifestyle ARR contributions:
When monitoring reveals therapy is not working:
Documentation pearls:
Solid White Background
Ethical, Legal, and Patient Safety Considerations

— Presenting only RRR ("50% reduction") without ARR has been argued to constitute incomplete disclosure — patients consent to therapy under inflated expectations

— Best practice: disclose baseline risk, RRR, ARR, and NNT alongside NNH

— Use decision aids/pictographs for therapies where the choice is preference-sensitive (anticoagulation, statins, cancer screening, prostate biopsy)

— Aspirin for primary prevention in low-risk elderly (ARRIVE, ASPREE): ARI for major bleeding ≥ ARR for MI → net harm

— Polypharmacy from RRR-driven prescribing increases falls, drug interactions, hospitalization

— Deprescribing as a safety intervention when NNT > life expectancy

— Drug ads, DTC marketing, and rep visits overwhelmingly cite RRR

— Physicians have an ethical obligation to translate to ARR before prescribing

— Sunshine Act (Open Payments) requires industry-physician payment disclosure

— Discharge medication lists frequently omit ARR rationale; receiving providers may discontinue effective therapy or continue therapy whose NNT/NNH balance has shifted

— Medication reconciliation should include indication and expected benefit horizon

— Public health surveillance uses PAR to prioritize interventions — large PAR with modest individual ARR can still justify population campaigns (e.g., sodium reduction, vaccination)

— Quality metrics (HEDIS, MIPS) reward absolute outcome improvements — aligned with ARR thinking

— Trials require clinical equipoise; once a large ARR is demonstrated in interim analyses, DSMBs may halt for efficacy or futility

— Reporting trials must include absolute event rates (CONSORT guidelines), not RRR alone

— Risk calculators derived from predominantly white cohorts may misestimate baseline risk in other populations → personalized ARR misestimated → inequitable prescribing

— Use validated, population-specific tools where available

Board pearl: A Step 3 ethics question may ask which counseling approach is most appropriate when a patient cites a drug ad — the correct answer always includes presenting ARR/NNT in addition to (or instead of) RRR to honor informed consent.

Informed consent and framing bias:
Patient safety — overtreatment harms:
Pharmaceutical marketing and conflicts of interest:
Transition-of-care risk:
Mandatory reporting and population-level statistics:
Research ethics — equipoise and ARR:
Health equity dimension:
Solid White Background
High-Yield Associations and Rapid-Fire Clinical Facts

Key distinction: RRR sells; ARR/NNT decide. Whenever a Step 3 stem quotes only a relative number, the right counseling answer involves computing or requesting the absolute number.

ARR = CER − EER. RRR = ARR / CER = 1 − RR. NNT = 1/ARR (round up).
RRR is baseline-independent; ARR shrinks as baseline risk falls. Same drug, same RRR, vastly different NNT across populations.
ARR and NNT are time-bound — always state the follow-up duration ("NNT 50 over 5 years").
NNH = 1/ARI. Net benefit favors therapy when NNT < NNH for outcomes of similar severity.
Aspirin primary prevention: ARR ≈ ARI; USPSTF recommends against routine initiation ≥ 60 yr.
Statins primary prevention: USPSTF Grade B at ASCVD 10-yr ≥ 10% with ≥ 1 risk factor — threshold where ARR becomes meaningful.
Anticoagulation in AFib: treat when CHA₂DS₂-VASc ≥ 2 (men) / ≥ 3 (women); below this, ARR < NNH.
Aspirin in pregnancy for preeclampsia: ARR 3–5% in high-risk women, NNT ~25 — USPSTF Grade A.
Lag-time-to-benefit: if patient's life expectancy < lag time, ARR ≈ 0 — deprescribe.
CEA symptomatic ≥ 70%: NNT 6 over 5 yr; asymptomatic ≥ 60%: NNT 17.
OR ≈ RR only when outcome <10%. Common outcomes → OR exaggerates effect.
HR ≈ RR for short follow-up; converted to ARR by multiplying baseline risk × (1−HR).
Statistical significance ≠ clinical significance. A p < 0.001 with ARR 0.1% is trivial.
Composite endpoints inflate RRR — examine ARR for each component.
Subgroup analyses inflate type I error — beware "impressive" RRR in unplanned subgroups.
Pictographs (100-person icon arrays) reduce framing bias better than numerical RRR.
Cardiac rehab NNT post-MI ~30 — comparable to many drugs; under-prescribed.
PAR (population attributable risk) drives public health priorities, not individual decisions.
Quality metrics (HEDIS/MIPS) reward absolute outcomes — aligned with ARR.
Solid White Background
Board Question Stem Patterns

— Stem: "In a trial of drug X, MI occurred in 4% of treated vs 8% of placebo. What is the NNT?"

— Approach: ARR = 8% − 4% = 4%; NNT = 1/0.04 = 25

— Distractors: 4 (forgot decimal), 50 (used wrong rate), 12.5 (used RRR)

— Stem: A patient sees a TV ad: "Drug Y reduces stroke by 60%!" What is the most appropriate next step in counseling?

— Best answer: Discuss ARR and NNT specific to her baseline risk; avoid RRR alone

— Distractors: Prescribe immediately; Refuse to discuss; Quote the RRR

— Stem: Two trials show identical 25% RRR; trial A in secondary prevention (CER 20%), trial B in primary prevention (CER 2%). Which population benefits more?

— Answer: Trial A (ARR 5%, NNT 20) >> Trial B (ARR 0.5%, NNT 200)

— Stem: Outcome occurs in 40% of unexposed, OR = 0.5. Estimate RR.

— Approach: OR overstates protection when outcome is common; RR ≈ 0.67, not 0.5

— Stem: New anticoagulant — stroke ARR 2%, major bleed ARI 1%. NNT vs NNH?

— NNT 50, NNH 100 → net benefit favors treatment (stroke and bleed of similar severity)

— Stem: 82-year-old with metastatic cancer and life expectancy 6 months — start statin for primary prevention?

— Answer: No — lag time exceeds life expectancy; ARR ≈ 0

— Stem: Drug Z lowers LDL by 30%. Should it be prescribed for CV prevention?

— Answer: Surrogate endpoint; require clinical-outcome ARR before adoption

— Stem: Pharma rep emphasizes 50% RRR; physician's obligation?

— Answer: Translate to ARR/NNT before patient counseling

Board pearl: Compute, don't intuit. Step 3 biostat items reward arithmetic and disciplined frame-translation — write out CER, EER, ARR, RRR, NNT each time rather than guessing.

Pattern 1 — Direct calculation from 2×2 or event rates:
Pattern 2 — Patient counseling and framing:
Pattern 3 — Comparing two trials with same RRR but different baseline risks:
Pattern 4 — OR vs RR with common outcome:
Pattern 5 — Net benefit weighing NNT vs NNH:
Pattern 6 — Lag-time-to-benefit / elderly:
Pattern 7 — Surrogate vs clinical endpoint:
Pattern 8 — Ethical framing / informed consent:
Solid White Background
One-Line Recap

Absolute risk reduction (CER − EER) and its inverse, NNT, are the only measures that tell a patient how much a therapy will actually help them, while relative risk reduction (1 − RR) stays constant across populations and inflates apparent benefit when baseline risk is low.

Board pearl: When in doubt on any Step 3 prevention or counseling question, anchor your answer to ARR and NNT applied to this patient's baseline risk — that single habit converts trial statistics into the correct clinical decision and is the discriminating move between average and high-performing test-takers.

Master formulas: ARR = CER − EER; RRR = ARR/CER = 1 − RR; NNT = 1/ARR (round up); NNH = 1/ARI.
Translate every RRR to a personalized ARR by multiplying the patient's baseline risk by the trial RRR — this is the single most-tested operation on Step 3 biostatistics.
Net benefit decision = compare NNT to NNH for outcomes of comparable severity, then layer life expectancy, lag-time-to-benefit, patient preference, and cost.
Ethical mandate: present ARR and NNT alongside RRR during informed consent; pictographs and decision aids reduce framing bias and align with USPSTF, AHA/ACC, and shared decision-making standards.
Solid White Background
bottom of page