Biostatistics & Population Health
Allocation concealment and randomization methods
— Suspect selection bias when allocation concealment is broken (e.g., envelopes that can be held up to the light, alternating assignment by day of week, assignment by birth date).
— Suspect confounding by indication in observational studies that lack randomization altogether.
— Suspect performance or detection bias when blinding (a separate concept) is absent after randomization.
— Baseline tables showing significant differences between arms in a small trial (chance imbalance, fixable with stratified or block randomization).
— Investigators who screened patients differently once they knew the next slot was "treatment."
— Trials using quasi-randomization (medical record number, alternating assignment, day of admission).

— Simple randomization: Coin flip, random number generator, or table of random numbers. Each patient has an independent probability of assignment. Risk: in small trials, groups may end up unequal in size or imbalanced on prognostic factors by chance.
— Block randomization: Patients assigned in small "blocks" (e.g., blocks of 4 or 6) to guarantee approximately equal group sizes throughout enrollment. Useful when interim analyses are planned or enrollment may stop early.
— Stratified randomization: Patients first divided into strata based on key prognostic variables (age, stage, center), then randomized within each stratum. Used to prevent baseline imbalance on known important confounders.
— Cluster randomization: Groups (clinics, hospitals, villages) rather than individuals are randomized. Used when the intervention is delivered at a group level (e.g., a hand-hygiene protocol across ICUs).
— "Randomized within each tumor stage" → stratified.
— "After every 4 patients, 2 had received drug and 2 placebo" → block of 4.
— "Each participating hospital was randomly assigned" → cluster.
— "Computer-generated sequence accessed only after consent" → adequate allocation concealment.

— Central randomization by telephone, web portal, or interactive voice response system (IVRS) controlled by an independent coordinating center.
— Sequentially numbered, opaque, sealed envelopes (SNOSE) opened only after the patient is enrolled and consented.
— Pharmacy-controlled randomization, in which the investigational pharmacy dispenses identical-appearing study drug per a sequence unknown to the clinician.
— Transparent or unsealed envelopes; envelopes opened before consent.
— Assignment by alternation, date of birth, medical record number, day of the week, or order of arrival.
— A list posted in a clinic where staff can see upcoming assignments.
— Investigator-held randomization list accessible at the time of enrollment.

— Step 1: Was a random sequence generated? Look for "computer-generated," "random number table," or "permuted blocks." If the stem says "alternating" or "by date," the sequence is not random.
— Step 2: Was the sequence concealed until allocation? Look for central/IVRS systems, pharmacy control, or SNOSE.
— Step 3: Was the baseline table balanced? Examine age, sex, disease severity, comorbidities across arms. Substantial imbalance in a large trial suggests a randomization failure or fraud; in a small trial, it may reflect chance.
— Step 4: Was the analysis intention-to-treat (ITT)? ITT preserves the benefit of randomization by analyzing patients in their originally assigned group regardless of crossover or non-adherence.
— Numbers screened, randomized, allocated, lost to follow-up, and analyzed. Asymmetric loss to follow-up between arms suggests attrition bias, which can undo the protection randomization provided.
— P-values on baseline characteristics that are not systematically small (some imbalance is expected).
— Reported method of sequence generation and concealment (Cochrane requires both for low risk).
— Pre-registered protocol (ClinicalTrials.gov) matching the published methods.

— Variable (mixed) block sizes (randomly choosing blocks of 2, 4, or 6).
— Maintaining blinding of block size to investigators.
— Requires adjustment for the intraclass correlation coefficient (ICC).
— Design effect = 1 + (m − 1) × ICC, where m is average cluster size. Multiply standard sample size by design effect.
— Risk of identification/recruitment bias if clusters are randomized first and patients recruited afterward by personnel who know cluster assignment.
— Response-adaptive randomization alters allocation ratios based on interim outcomes — efficient but raises ethical questions about equipoise and statistical issues with time-trend confounding.
— Platform trials (e.g., RECOVERY) randomize across multiple interventions simultaneously with shared controls.

— Large multicenter Phase III trial with thousands of patients → simple or block randomization is sufficient; chance imbalance is minimal at scale.
— Small trial (<200 patients) where age and stage strongly predict outcome → stratified block randomization by age and stage.
— Intervention delivered at the clinic or hospital level (e.g., a checklist, an EMR alert) → cluster randomization.
— Stable chronic condition with within-patient outcome assessment (e.g., crossover migraine prophylaxis) → crossover design with washout.
— Rare disease or limited eligible patients → consider n-of-1 trials, adaptive designs, or registry-based randomization.
— Switch from envelopes held by investigators to central web-based randomization.
— Add stratification by center in multicenter studies.
— Use variable block sizes to prevent prediction.
— Pre-register the protocol and analysis plan to prevent outcome switching.

— Validated statistical software (R, SAS, Stata) using a reproducible seed.
— Online randomization services (e.g., Sealed Envelope, Randomizer.org) with audit trails.
— Avoid: spreadsheets with manually typed sequences, dice/coin flips for large trials (not reproducible).
— IVRS/IWRS (Interactive Voice/Web Response Systems): Most rigorous for industry trials. Coordinator calls/clicks after enrollment, receives the assignment.
— Pharmacy-controlled dispensing: Drug arrives pre-labeled by kit number, blinding investigator to contents. Works well for drug trials but not procedural trials.
— SNOSE (Sequentially Numbered Opaque Sealed Envelopes): Acceptable for low-resource settings; must be opaque (often with carbon paper or aluminum foil lining), tamper-evident, and opened only after the patient signs consent.
— Gain more safety data on the new agent.
— Improve recruitment when patients prefer the experimental arm.
— Note: unequal ratios reduce statistical power slightly for the primary comparison.
— Maintain a randomization log with date/time stamps.
— Record any emergency unblinding events with rationale.
— DSMB reviews maintain integrity during the trial.

— The surgeon cannot be blinded to the procedure performed.
— Allocation concealment is still feasible by opening sealed envelopes or contacting central randomization after the patient is anesthetized and prepped, preventing pre-operative selection bias.
— Sham/placebo surgery (e.g., sham arthroscopy trials for knee OA) provides blinding of patient and assessors but raises ethical concerns about risk without benefit.
— Operator learning curves as confounders — early procedures may have worse outcomes regardless of randomization. Mitigated by requiring a minimum operator experience threshold.
— Sponsor blinding of outcome adjudicators via independent clinical events committees (CEC).
— Often unblindable; rely on objective outcomes and blinded outcome assessors.
— Cluster designs prevent contamination between treatment and control patients in the same clinic.
— Test interventions in real-world settings with broad eligibility, often using simple randomization and ITT analysis. Trade internal validity for external validity (generalizability).

— Stratified randomization on the 1–2 most prognostic baseline variables.
— Minimization as an alternative for balancing multiple covariates simultaneously.
— Pre-specified covariate adjustment in the primary analysis (ANCOVA), which improves power and addresses residual imbalance.
— Stratify by major effect modifiers (age category, eGFR category).
— Pre-specify subgroup analyses to assess effect heterogeneity; never let subgroup findings drive the primary conclusion.
— Test for interaction, don't just compare within-subgroup p-values.
— Frequently underrepresented in pivotal trials, limiting external validity.
— When randomized, stratify by age ≥ 75 or by frailty index to ensure balance.
— Often excluded from Phase III trials, so guideline extrapolation requires caution.
— When included, stratify by CKD stage to balance baseline risk.
— Post-marketing studies (Phase IV) and registry-based RCTs increasingly fill these evidence gaps.
— Acknowledge limited evidence.
— Use shared decision-making.
— Consider lower starting doses and closer monitoring.
— Consult specialty guidelines that may have already adjudicated extrapolation.

— When trials are conducted (e.g., COVID-19 vaccine trials, antihypertensive trials in pregnancy), they require enhanced ethics review and often use adaptive designs to minimize fetal exposure to inferior arms.
— Stratification by gestational age or trimester is common.
— Assent (child) + parental consent required.
— Often use age-stratified randomization to ensure developmental balance.
— Extrapolation studies (modeling adult data with pediatric PK confirmation) may substitute for full RCTs when ethical or feasibility constraints exist.
— Require additional IRB protections under 45 CFR 46 (Common Rule) subparts B, C, D.
— Randomization itself is not unethical, but inducement and coercion risks demand careful enrollment procedures.
— Patients must understand that treatment will be assigned by chance, not selected by their physician.
— A patient who refuses randomization may still receive standard care outside the trial.
— Some patients have a "preference effect" — performing better in the arm they prefer — which threatens trial validity. Zelen's design (randomize, then consent) addresses this but is controversial and rarely IRB-approved in the US.
— Verify the trial's IRB-approved inclusion of pregnant patients.
— Provide enhanced consent disclosing fetal risk uncertainty.
— Document shared decision-making.

— Empirical evidence: trials with inadequate concealment overestimate effects by 30–40% on average (Schulz et al., JAMA 1995).
— Adjustment can mitigate measured confounders but never unmeasured ones — this is precisely why randomization is so powerful when it works.

— Efficacy stopping: Crossing a stringent boundary (e.g., O'Brien-Fleming) at interim analysis suggests overwhelming benefit. Halt and offer the experimental arm to controls.
— Futility stopping: Conditional power below threshold suggests the trial cannot achieve a positive result.
— Safety stopping: Unacceptable adverse events in the experimental arm.
— Overestimation of effect size because trials that stop early often catch a random high point of the effect estimate.
— Reduced power for secondary outcomes and subgroups.
— Differential dropout exceeding 20% → escalate to DSMB review.
— Suspected fraud (e.g., implausibly balanced baseline tables, identical patient records) → notify IRB, sponsor, and potentially Office for Research Integrity (ORI).
— Major protocol deviations → amend protocol with IRB approval before continuing.
— Adaptive randomization changes (allocation ratio shifts) must be pre-specified.
— Sample size re-estimation must use blinded data when possible.

— Quasi-randomization (alternation, date of birth, MRN): Predictable next assignment → selection bias. Not acceptable as evidence equivalent to RCT.
— Historical controls: Compare current treated patients to past untreated patients. Confounded by temporal trends in care, diagnosis, and outcome measurement.
— Concurrent non-randomized controls (cohort study): Susceptible to confounding by indication — sicker patients may receive (or avoid) the treatment.
— Parallel-group RCT: Standard design; patients randomized once and followed in their assigned arm.
— Crossover RCT: Each patient receives both interventions in randomized order, separated by washout. Reduces between-patient variance but requires stable conditions.
— Factorial RCT: Two or more interventions randomized independently (e.g., 2×2 design testing aspirin and a statin simultaneously). Efficient but requires no significant interaction between interventions.
— Cluster RCT: Groups, not individuals, randomized.
— Stepped-wedge cluster RCT: All clusters eventually receive the intervention, with timing randomized. Useful when withholding the intervention from some clusters is unacceptable.

— Randomization: Generates the assignment sequence (a process producing groups balanced by chance).
— Allocation concealment: Hides the upcoming assignment from those enrolling patients (a process protecting the integrity of randomization).
— Blinding (masking): Hides the actual assignment from patients, providers, assessors, and analysts after enrollment (a process protecting against performance and detection bias).
— Single-blind: patient unaware.
— Double-blind: patient and provider unaware.
— Triple-blind: adds outcome assessors or data analysts.
— Modern reporting (CONSORT) prefers specifying who is blinded rather than counting blinds.
— ITT: Analyzes patients in originally assigned groups regardless of adherence. Preserves randomization. Tends toward null in efficacy trials and is the primary analysis for superiority trials.
— PP: Analyzes only adherent patients. May overestimate effect. Preferred as a sensitivity analysis or in non-inferiority trials alongside ITT.
— All require randomization; design differs in the hypothesis tested.
— Non-inferiority trials are particularly vulnerable to "bias toward equivalence" when adherence is poor or measurement is imprecise — ITT alone is insufficient; PP is co-primary.

— Use central randomization via IVRS/IWRS or pharmacy control.
— Stratify by center in multicenter trials and by 1–2 key prognostic variables.
— Use variable block sizes to prevent prediction.
— Pre-register the protocol on ClinicalTrials.gov with primary outcome and analysis plan.
— Plan ITT analysis as primary, with PP as sensitivity.
— Establish DSMB with pre-specified stopping rules.
— Follow CONSORT 2010 reporting standards, including the flow diagram and a statement on randomization method, concealment, and blinding.
— Disclose all funding sources and conflicts.
— Make individual patient-level data available where possible.
— Systematic reviewers use Cochrane RoB 2 or similar tools to grade randomization and concealment quality.
— Trials with high risk of bias from inadequate concealment are downgraded in GRADE assessments, reducing confidence in the pooled estimate.
— When choosing therapy, prefer interventions supported by multiple well-randomized trials with low risk of bias over those supported by observational data alone.
— Recognize that guideline strength correlates with evidence quality; a Class I, Level A recommendation rests on multiple high-quality RCTs.
— Treat single-trial findings with caution until replicated, especially if effect size is large.

— Read the Methods section first, not the abstract. Confirm randomization and concealment described before believing the result.
— Examine the CONSORT flow diagram for differential dropout.
— Check the baseline characteristics table for plausibility (not perfect balance — some variation is expected by chance).
— Verify primary outcome matches pre-registration; outcome switching is a red flag.
— Note whether subgroup findings are pre-specified or post hoc.
— Single trials can be overturned by replication or larger trials (e.g., the early hormone replacement therapy observational data overturned by WHI RCT).
— Network meta-analyses and living systematic reviews offer updated synthesis.
— Guidelines update cycles typically run 3–5 years; major trials may prompt rapid interim updates.
— Translate effect sizes into absolute risk reductions and numbers needed to treat — more intuitive than relative risks.
— Acknowledge uncertainty; discuss confidence intervals.
— Match the evidence's population to the patient's situation; flag extrapolation.
— Use tools like UpToDate, DynaMed, or guideline aggregators that grade evidence.
— Cross-check key claims against the primary trial when feasible.
— Assess the underlying study design (RCT vs observational).
— Evaluate randomization, concealment, and blinding.
— Consider replication status.
— Reframe in terms of absolute benefit vs harm for that patient.
— Avoid promising results not supported by rigorous evidence.

— That treatment will be assigned by chance.
— All foreseeable risks of both arms.
— The right to withdraw without penalty.
— Alternative treatments available outside the trial.
— Whether placebo will be used and what its implications are.
— Step 3 favorite: a patient who says "I'll join only if I get the new drug" cannot be enrolled — true randomization requires acceptance of either assignment.
— Serious adverse events to IRB and FDA per timelines (typically 7–15 days).
— Unanticipated problems posing risk to subjects to IRB promptly.
— Suspected research misconduct to institutional research integrity office and potentially federal ORI.
— Clear documentation of which arm they received once unblinded.
— Communication to the primary care physician about any ongoing monitoring needs.
— Continuation of effective therapy if assigned to the experimental arm and benefit is established post-trial.
— Safety monitoring for delayed adverse effects of the investigational agent.


— Answer: Selection bias from inadequate allocation concealment (this is quasi-randomization, with predictable next assignment).
— Answer: Adjust for the imbalanced covariate in the analysis (chance imbalance in a small trial is expected; the primary analysis remains valid, with covariate adjustment as planned sensitivity).
— Answer: Adjustment for intraclass correlation / use of design effect in sample size and analysis (cluster RCT requirement).
— Answer: Selection bias from inadequate allocation concealment — the open-label design is acceptable for surgery, but the posted list breaks concealment.
— Answer: Effect size is likely overestimated (truth inflation); the early stop may not reflect the true effect.
— Answer: Central randomization via web-based or telephone system (IWRS/IVRS).
— Answer: To ensure approximately equal group sizes throughout enrollment within each stratum.
— Answer: Explain that random assignment ensures groups are comparable so that the trial can determine which treatment truly works; patient choice would bias results.

Randomization balances confounders by chance, allocation concealment protects that randomization from selection bias at enrollment, and blinding protects against performance and detection bias after enrollment — together they form the methodologic backbone of a credible RCT.
— Randomization (simple, block, stratified, cluster, adaptive) generates the assignment sequence — its job is to balance both measured and unmeasured confounders across arms, a feat no observational method can match.
— Allocation concealment (central IVRS/IWRS, pharmacy control, SNOSE) hides the upcoming assignment from enrollers — its failure (alternation, posted lists, transparent envelopes, quasi-randomization by date/MRN) causes selection bias and inflates treatment effects by 30–40%.
— Blinding is a separate, post-randomization safeguard against performance and detection bias; an open-label trial can still have flawless allocation concealment, and a concealed trial can still fail to blind.
— ITT analysis preserves the benefit of randomization by analyzing patients in their assigned groups regardless of adherence — it is the primary analysis for superiority trials, with per-protocol as sensitivity.

