Patient Safety & Systems-Based Practice
Root cause analysis: methodology and application
— Mandated by The Joint Commission for all sentinel events; results submitted within 45 days of event recognition along with a corrective action plan ("RCA²").
— CMS Conditions of Participation also require hospitals to track adverse events and implement quality improvement (QAPI).
— Wrong-site/wrong-patient/wrong-procedure surgery
— Retained foreign object
— Suicide of an admitted/recently discharged patient (≤7 days)
— Hemolytic transfusion reaction from ABO mismatch
— Infant abduction or discharge to wrong family
— Severe maternal morbidity, unanticipated death of a full-term infant
— Fall with serious injury, medication error causing death
— "Unexpected outcome" reaching the patient
— "Near-miss with high potential for harm" (e.g., wrong unit of blood caught at bedside check)
— Cluster of similar errors signaling a systemic issue

— "The hospital quality committee convenes…"
— "A 45-day review is initiated…"
— "The risk management team requests a multidisciplinary meeting…"
— "What is the most appropriate next step in addressing this event?"
— Medication error: heparin overdose in NICU (look-alike vials), insulin/U-100 vs U-500 mix-up, methotrexate daily instead of weekly
— Wrong-site surgery: timeout skipped, site not marked, two patients with same last name
— Diagnostic error: critical lab not communicated, incidental nodule on CT lost to follow-up
— Handoff failure: post-op patient's anticoagulation not resumed after transfer; ED-to-floor handoff missed sepsis criteria
— Equipment failure: infusion pump free-flow, alarm fatigue leading to missed bradycardia
— Reconstruct a detailed timeline of the event ("what happened, when, in what order")
— Interview all involved staff individually, in a non-punitive, confidential setting
— Review the chart, MAR, EHR audit logs, device logs, policies in effect at the time
— Identify contributing factors across categories: people, environment, equipment, policies, communication, training, staffing
— Peer review evaluates individual practitioner competence (protected under state statutes).
— M&M is educational, case-based, often clinical reasoning–focused.
— RCA is systems-focused, multidisciplinary, and action-oriented — its deliverable is a corrective action plan with measurable outcomes, not a judgment of a person.

— Each slice of cheese = a layer of defense (timeout, double-check, barcode scan, pharmacist verification, alarm).
— Each hole = a weakness (fatigue, distraction, missing policy, poor UI design).
— Harm occurs when holes align across all layers.
— Active error = unsafe act at the sharp end by a frontline worker (nurse, resident, surgeon). Visible, immediate.
— Latent condition = blunt-end organizational decision (understaffing, poor scheduling, confusing label design, inadequate training) that lay dormant until triggered.
— RCA targets latent conditions because fixing them prevents recurrence; punishing an active error rarely does.
— Slip = right plan, wrong execution (grabbed wrong vial despite knowing correct one) — attention failure
— Lapse = forgot a step (skipped timeout) — memory failure
— Mistake = wrong plan based on wrong knowledge/rule (treated PE as MI) — knowledge failure
— Violation = deliberate deviation from protocol (workaround of barcode scanner)
— Cognitive load, fatigue, interruptions, alarm fatigue
— Poor ergonomics, look-alike/sound-alike (LASA) drugs
— Hierarchical culture suppressing speak-up
— Production pressure (OR turnover, ED boarding)

— Example: Patient received 10× heparin dose.
— Why? Nurse drew from wrong vial. Why? 10,000 U/mL and 1,000 U/mL stored adjacently. Why? Pharmacy stocks both concentrations on the unit. Why? No policy restricting high-concentration heparin to pharmacy only. Why? Formulary review never flagged pediatric LASA risk.
— Root cause = formulary/storage policy gap → fix by removing high-concentration heparin from unit stock (forcing function).
— People (training, fatigue, communication)
— Process (policy, workflow, handoffs)
— Equipment (devices, IT, alarms)
— Environment (lighting, noise, layout)
— Materials (drugs, supplies, labels)
— Management (leadership, staffing, culture)

— Team leader (often patient safety officer)
— Subject matter experts (physician, nurse, pharmacist relevant to event)
— Frontline staff representatives
— Patient/family representative when appropriate (increasingly emphasized)
— Human factors expert if available
— Exclude individuals directly involved in the event from the deliberative team (they are interviewed, not adjudicators)
— Initial event report: within 24–72 hours
— Full RCA and action plan: within 45 days of event awareness
— Strong (preferred): forcing functions (oxygen-only connectors), physical changes, removing the hazard, automation, simplification, standardization
— Intermediate: checklists, double-checks, redundancy, enhanced communication tools (SBAR), read-back
— Weak: education, training, new policy, warnings, signage, "be more careful"
— Specific intervention
— Responsible owner
— Deadline
— Measurable outcome metric
— Follow-up date to assess effectiveness

— Human error (inadvertent slip/lapse) → console the individual; fix the system
— At-risk behavior (drift from protocol, taking shortcuts perceived as low risk) → coach the individual; address why the workaround felt justified
— Reckless behavior (conscious disregard of substantial unjustifiable risk) → discipline the individual
— Sentinel event / Serious Safety Event (SSE): reached patient, caused serious harm or death → full RCA²
— Precursor safety event: reached patient, minimal/no harm → focused review
— Near-miss (close call): did not reach patient → aggregate review, still high-value learning opportunity
— Unsafe condition: latent hazard identified before any event (e.g., expired crash cart meds) → proactive correction

— Required ethically (AMA Code) and by many state laws and Joint Commission standards
— Content: the facts of what happened, expression of empathy/apology, what is being done to investigate, commitment to share findings, no speculation about cause or blame
— Who: attending physician of record, often with risk management and a patient relations representative
— CANDOR (Communication and Optimal Resolution) program — AHRQ-endorsed framework integrating disclosure, support, and early resolution
— Frontline staff → electronic safety event reporting system (anonymous if desired) → patient safety officer → risk management → quality committee → RCA team activation
— Non-punitive reporting culture is essential; punishing reporters drives errors underground
— Joint Commission: voluntary sentinel event reporting, but mandatory RCA submission if reviewed
— State health department: many states mandate reporting of specific adverse events (NQF "never events")
— CMS: hospital-acquired conditions, never events tied to nonpayment
— FDA MedWatch: device/drug-related events
— State medical/nursing boards: only if individual practitioner competence/impairment is the issue — not routine for system errors

— Specific (e.g., "remove 10,000 U/mL heparin vials from all non-pharmacy locations")
— Measurable (audit vial inventory monthly)
— Assignable (Director of Pharmacy owns)
— Realistic (achievable within current resources)
— Time-bound (complete within 30 days; reassess at 6 months)
— Eliminate the hazard: remove the dangerous drug concentration from stock
— Engineering controls / forcing functions: oxygen-specific connectors (DISS), one-way valves, hard stops in CPOE
— Substitution / simplification: premixed IV bags instead of bedside compounding
— Standardization: single-concentration heparin protocol institution-wide
— Automation with appropriate alerts: barcode medication administration (BCMA), smart pumps with dose limits
— Checklists and cognitive aids: WHO Surgical Safety Checklist, sepsis bundles
— Communication tools: SBAR, I-PASS handoff, read-back of verbal orders
— Education and policy (weakest — only as adjunct, never sole intervention)
— Universal Protocol / timeout (response to wrong-site surgery)
— Color-coded and dose-banded pediatric resuscitation (Broselow tape)
— Removal of concentrated electrolytes from nursing units
— Tall-man lettering (e.g., hydrOXYzine vs hydrALAZINE)
— Mandatory two-RN independent verification for high-alert medications (insulin, heparin, chemo)

— Fewer dedicated quality staff
— Diagnostic errors (missed/delayed diagnosis) dominate over procedural errors
— Communication failures across specialists, labs, and patients (results not conveyed)
— Medication reconciliation gaps at transitions
— Designate a safety champion (often a partner physician or office manager)
— Use abbreviated RCA (mini-RCA) for lower-severity events: timeline, Five Whys, action item, follow-up
— Leverage Patient Safety Organizations (PSOs) for shared learning and legal protection of analyses
— Participate in PCMH and MIPS quality reporting — both incentivize systematic error tracking
— Limited specialty backup → transfer-related errors common
— Telemedicine handoffs require explicit documentation protocols
— RCAs often regional/networked across systems
— Failure to order indicated test
— Failure to follow up abnormal result ("test result loop closure")
— Cognitive biases: anchoring, premature closure, availability bias
— Mitigation: closed-loop result reporting, automated EHR flags for unacknowledged critical results, structured diagnostic time-outs
— Disaggregate safety data by race, language, insurance status
— Adverse events disproportionately affect patients with limited English proficiency (LEP) — solution: mandatory professional interpreter use, not family members or ad hoc bilingual staff
— Health literacy gaps contribute to medication errors at discharge

— 10× errors from decimal point misplacement
— LASA concentrations (heparin, morphine, epinephrine)
— Corrective interventions: standardized concentrations, Broselow tape for emergency dosing, smart pumps with pediatric profiles, mandatory pharmacist verification of all pediatric chemo and high-alert drugs, weight in kilograms only in the EHR (banning pounds eliminates conversion errors)
— Maternal hemorrhage and severe preeclampsia are leading causes of preventable maternal death; bundles (AIM safety bundles) are RCA-derived
— Shoulder dystocia, fetal distress recognition, magnesium dosing errors
— TeamSTEPPS and simulation drills are evidence-based interventions following obstetric RCAs
— Maternal Mortality Review Committees perform state-level RCAs on every maternal death
— Preprocedure verification of patient, procedure, site, consent
— Site marking by the operating surgeon with patient awake when possible
— Time-out immediately before incision involving the entire team, with active verbal confirmation
— Wrong-site surgery is a "never event" — recurrence triggers mandatory RCA and CMS nonpayment

— Hindsight bias: in retrospect, the error looks "obvious," leading reviewers to blame the individual rather than appreciate the situational difficulty
— Premature closure on a single root cause: most events are multifactorial; stopping the Five Whys too early misses contributing factors
— Weak corrective actions: defaulting to "reeducate" or "create a policy" without forcing functions
— No follow-up measurement: action plan filed and forgotten; recurrence rate not tracked
— Blame culture leakage: even with stated Just Culture, frontline staff perceive RCA as punitive → underreporting
— Tokenism: patient/family representative invited but not heard; frontline staff outranked by administrators
— Recurrence of the same event within 12 months → signals weak or unsustained action
— Drop in safety event reporting volume after a punitive response → not improvement, but suppression
— Staff turnover following scapegoating → loss of institutional knowledge
— Programs: peer support (e.g., "RISE" at Johns Hopkins, "forYOU" at Missouri), EAP, leave of absence, removal from immediate clinical duties as needed
— Third victim = the institution's reputation and trust

— The Joint Commission Sentinel Event Database: voluntary submission of de-identified RCA findings contributes to national learning and Sentinel Event Alerts
— CMS: reports certain hospital-acquired conditions (HACs) and never events; nonpayment policy
— State health departments: many require mandatory reporting of NQF-defined serious reportable events ("never events") — e.g., wrong-site surgery, retained foreign object, ABO-incompatible transfusion
— State medical/nursing licensing boards: only when individual practitioner impairment, incompetence, or unprofessional conduct is identified (substance use, sexual misconduct, repeated reckless behavior)
— DEA: controlled substance diversion
— FDA MedWatch / ECRI: device or drug failures with broader population implications
— OSHA: occupational injury to staff (sharps, workplace violence)
— CDC NHSN: healthcare-associated infections
— Suspected criminal act (intentional harm, diversion, assault)
— Child or elder abuse identified during investigation (mandatory reporting)
— Patient abduction
— Not for ordinary medication errors or system failures — criminalizing error destroys safety culture
— Communicable diseases
— Gunshot/stab wounds
— Suspected child/elder/dependent adult abuse
— Impaired drivers (some states)
— Certain birth injuries and maternal deaths
— Events suggesting systemic, recurring patterns → board of trustees, C-suite involvement
— Events involving multiple departments → enterprise-wide RCA
— Events with media or public attention → coordinated communications, but never compromise patient confidentiality (HIPAA)

— Prospective — applied before implementing a new process or technology
— Systematically identifies how a process could fail, severity × probability × detectability = Risk Priority Number (RPN)
— Example: before rolling out a new chemotherapy ordering system, FMEA identifies failure modes (wrong drug, wrong dose, wrong patient) and builds in safeguards
— Mnemonic: FMEA = Future; RCA = Rear-view
— Iterative improvement methodology — small, rapid tests of change
— Used to implement corrective actions identified by RCA
— Plan the change → Do (small pilot) → Study (measure outcomes) → Act (adopt, adapt, or abandon)
— Step 3: PDSA is the execution engine that follows RCA's diagnosis
— Eliminates waste (muda) in healthcare processes
— Concepts: value stream mapping, kaizen (continuous improvement), gemba walks (go to where the work happens), 5S workplace organization
— Tackles inefficiency rather than discrete safety events
— Define, Measure, Analyze, Improve, Control
— Statistical, defect-reduction focus — aims for 3.4 defects per million opportunities
— Used for high-volume, measurable processes (lab turnaround, CLABSI rates)

— Evaluates individual practitioner competence and behavior
— Protected under state peer-review privilege statutes (variable)
— Outcomes: credentialing decisions, focused professional practice evaluation (FPPE), corrective action against the individual
— Not a systems analysis; not equivalent to RCA
— Educational, case-based review by clinical service
— Focuses on clinical decision-making and learning
— May identify cases that warrant escalation to formal RCA but is itself less structured and less systems-focused
— Traditionally has been blame-tinged; modern M&M increasingly adopts Just Culture framing
— Often used interchangeably with RCA, but technically the reporting/oversight process; RCA is the analytic methodology that fulfills the requirement
— QA: retrospective, threshold-based, "Is care meeting the standard?" — often punitive in tone
— QI: prospective, continuous, "How can we improve?" — non-punitive, systems-focused — RCA lives here
— Institutional function focused on liability and financial exposure
— Overlaps with patient safety but distinct goal — risk management may guide disclosure language and litigation strategy, while RCA team focuses on system fixes
— Both functions ideally collaborate but should not be conflated
— Used during active emergencies (mass casualty, disaster, IT outage)
— Different from RCA, which occurs after the event
— Separate track for reckless behavior or repeated violations
— Just Culture algorithm determines whether HR involvement is appropriate; the RCA itself is not a disciplinary venue
— Tool used within RCA and QA, not a methodology unto itself

— Preoccupation with failure: treat every near-miss as a teacher
— Reluctance to simplify: resist single-cause explanations
— Sensitivity to operations: leaders walk the front line (gemba)
— Commitment to resilience: ability to detect, contain, and recover from errors
— Deference to expertise: decisions made by those closest to the problem, regardless of hierarchy
— AHRQ Surveys on Patient Safety Culture (SOPS) — periodic staff survey assessing perceptions of teamwork, communication openness, non-punitive response
— Track reporting rates — rising rates reflect improving culture, not worsening safety
— Track recurrence of sentinel event categories
— Universal Protocol, time-outs, two-patient identifiers
— Barcode medication administration (BCMA)
— CPOE with clinical decision support
— Standardized handoff tools (I-PASS, SBAR)
— Medication reconciliation at every transition
— Daily safety huddles, leadership safety rounds
— TeamSTEPPS communication training
— Simulation training for low-frequency/high-risk events (massive transfusion, malignant hyperthermia, code blue)
— Discharge medication reconciliation
— Follow-up appointment scheduled before discharge
— Test result tracking with assigned ownership
— PCP notified of hospitalization within 48 hours
— Pending labs/imaging communicated explicitly
— Joint Commission National Patient Safety Goals (updated annually) — many derive from aggregated RCA findings
— CMS Hospital-Acquired Condition Reduction Program — financial penalties for high HAC rates
— IHI Triple Aim → Quadruple Aim (adding clinician well-being)

— Define a specific outcome metric for each corrective action (e.g., "100% pharmacist verification of pediatric high-alert orders within 30 days")
— Reassess at defined intervals: typically 30, 90, 180 days and 12 months
— If metric not met → reopen RCA; modify or strengthen intervention
— Report results to quality committee and board
— Real-time safety event reporting trends
— Trigger tools (IHI Global Trigger Tool) — sample chart review using predefined triggers (e.g., naloxone administered, abrupt medication stop, transfer to higher level of care) to detect unreported adverse events
— Patient experience and complaint data — often early signal of safety issues
— Aggregate RCAs by category (medication, falls, handoffs, diagnostic)
— Identify systemic themes warranting institution-wide initiatives
— Compare to national benchmarks (Leapfrog, NHSN)
— New-hire orientation includes Just Culture, error reporting, safety event system use
— Annual competency for high-alert medication administration
— Simulation training for rare emergencies
— Interprofessional team training (TeamSTEPPS)
— CRM (Crew Resource Management) principles from aviation — communication, leadership, situational awareness
— "Speak Up" campaigns (Joint Commission)
— Patient access to records (OpenNotes, Cures Act information blocking rule)
— Family on rounds
— Patient advisors on safety committees
— Burnout surveys (Maslach), turnover rates, near-miss reporter follow-up
— EAP utilization data (de-identified)

— Patients/families have a right to be informed of harmful errors affecting their care
— Disclosure should occur within 24 hours when feasible, performed by attending of record with risk management support
— Content: facts known, expression of empathy/apology, commitment to investigate and share findings, support resources
— Withholding disclosure to avoid litigation is unethical and, paradoxically, increases legal risk
— A patient sustains harm from a complication that was not discussed in the informed consent process. RCA may reveal the consent process is inadequate. Both disclosure of the event and revision of the institution's consent procedure are required. Failure to disclose because "it was a known complication" is incorrect — the patient still has a right to know what happened.
— Created Patient Safety Organizations (PSOs) — federally listed entities to which providers can submit safety data
— Provides federal privilege and confidentiality for "patient safety work product"
— Encourages candid analysis without fear of discovery in litigation
— Does not shield underlying medical records or facts of the event
— RCA discussions involve PHI — limit to minimum necessary, secure documents, de-identify when sharing externally (e.g., Joint Commission submission)
— Public/media communications must not disclose PHI even after a publicized event
— Suspected child, elder, or dependent adult abuse
— Gunshot/stab wounds
— Certain communicable diseases
— Impaired physician → state PHP and medical board where required
— Never events to state health department in mandatory-reporting states
— Discharge without medication reconciliation, follow-up appointment, or pending result tracking is a leading medicolegal vulnerability and frequent RCA finding. Closing the loop at every transition is both a safety and legal imperative.


— Cue: wrong-site surgery, fatal med error, retained foreign object
— Best answer: convene multidisciplinary root cause analysis and disclose to family
— Distractors: terminate involved clinician; file lawsuit; do nothing if "no harm intended"
— Cue: error involving look-alike vials, ambiguous labels, identical patient names
— Best answer: system/design issue (LASA storage, lack of barcode scanning, identical room numbering)
— Distractor: "individual carelessness"
— Best answer: forcing function or physical/process redesign (remove dangerous concentration from stock; install oxygen-specific connectors)
— Distractors: "in-service education," "create a policy," "remind staff to be careful"
— "Before launching a new infusion pump" → FMEA
— "After a fatal pump programming error" → RCA
— Honest error by competent nurse following protocol → console; analyze system, do not discipline
— Repeated, knowing bypass of barcode scanner → discipline appropriate
— Family asks what happened after an error → honest, empathic disclosure within 24 hours, including apology, regardless of litigation concerns
— System error → internal RCA, possibly state never-event reporting
— Impaired physician → state medical board / PHP
— Suspected child abuse identified during care → mandatory CPS report
— Caught before reaching patient → still report and analyze — do not dismiss
— Two options offered; pick the one higher on the intervention hierarchy (physical/forcing > checklist > education)
— Missed follow-up, dropped test result → answer involves closed-loop result tracking, medication reconciliation, scheduled follow-up before discharge
— Clinician distressed after event → peer support / EAP referral, not discipline, not "carry on as usual"

Root cause analysis is a structured, retrospective, non-punitive, multidisciplinary methodology that uncovers latent system failures behind sentinel events and translates them into strong, measurable corrective actions — embodying the principle that safe care is engineered, not exhorted.

