Evidence-Based Medicine: Core Concepts and Study Designs

.title[
# Evidence-Based Medicine: Core Concepts and Study Designs
]
.author[
### Austin Meyer, MD, PhD, MS, MPH, MS, FAAP, etc.
]
.date[
### June 13, 2025
]

---

# Evidence-Based Medicine: Core Concepts and Study Designs

### A Lecture for Pediatric Residents

---

# Agenda: Full Lecture Overview

.pull-left[
**Part 1: Statistics & Measures (Approx. 30 minutes)**
1.  **Introduction to EBM** - 2 minutes
2.  **Basic Statistical Terms** - 5 minutes
3.  **Accuracy versus Precision** - 2 minutes
4.  **Statistical Inference & Distributions** - 6 minutes
5.  **Hypothesis Testing** - 5 minutes
6.  **Common Clinical/Epidemiological Metrics** - 8 minutes
7.  **Important Caveats** - 2 minutes
]

.pull-right[
**Part 2: Study Designs & Biases (Approx. 30 minutes)**
1.  **Review of Key Concepts** - 2 minutes
2.  **Types of Biases** - 8 minutes
3.  **Overview of Study Types** - 12 minutes
4.  **Limitations of Each Study Type** - 6 minutes
5.  **Conclusion & Q&A** - 2 minutes
]

---

# Basic Statistical Terms

---

# P-value: What It Actually Means

.pull-left[
**Definition:** Probability of observing results this extreme (or more extreme) **IF the null hypothesis is true**

**What p < 0.05 means:**
- IF there's truly no effect
- THEN there's < 5% chance of seeing this result
- We reject "no effect" hypothesis
]

.pull-right[
**Common Misconceptions:**
- ❌ 5% chance null hypothesis is true
- ❌ 95% chance alternative is true  
- ❌ Probability result was due to chance

**Correct Interpretation:**
- ✅ Evidence against null hypothesis
- ✅ Statistical significance at α = 0.05
]

---

# Confidence Interval (CI)

**95% CI Interpretation:**
- If we repeated the study many times
- 95% of calculated CIs would contain the true value
- Provides precision estimate
]

.pull-right[
**Relationship to P-value:**
- For differences: CI excluding 0 → p < 0.05
- For ratios: CI excluding 1 → p < 0.05
- Wider CI = less precision
- Narrower CI = more precision
]

---

# Efficacy vs Effectiveness

.pull-left[
### Efficacy
**"Can it work?"**
- Ideal conditions
- Controlled settings
- High adherence
- Selected patients
- RCT environment
]

.pull-right[
### Effectiveness  
**"Does it work in practice?"**
- Real-world conditions
- Routine clinical practice
- Variable adherence
- Diverse patients
- Practical constraints
]

---

# Accuracy and Precision

---

# Visual Reminder: Accuracy vs Precision

---

# Accuracy vs Precision Definitions

.pull-left[
**Accuracy**
- How close to the TRUE value
- Think: hitting the bullseye
- Threatened by **bias** (systematic error)
- Results close to reality
]

.pull-right[
**Precision**
- How consistent/reproducible
- Think: tight grouping of shots
- Threatened by **random error** (small sample)
- Narrow confidence intervals
]

---

# Statistical Inference & Distributions

---

# Distribution Properties Matter

.pull-left[
<img src="Ex_files/figure-html/normal-dist-1.png" width="100%" style="display: block; margin: auto;" />
]

.pull-right[
<img src="Ex_files/figure-html/t-dist-1.png" width="100%" style="display: block; margin: auto;" />
]

---

# Real Data Can Be Complex

.pull-left[
<img src="Ex_files/figure-html/unimodal-1.png" width="100%" style="display: block; margin: auto;" />

**Single peak** - homogeneous population
]

.pull-right[
<img src="Ex_files/figure-html/bimodal-1.png" width="100%" style="display: block; margin: auto;" />

**Two peaks** - distinct subgroups
]

---

# Skewed Distributions: Order Matters

.pull-left[
<img src="Ex_files/figure-html/right-skew-1.png" width="100%" style="display: block; margin: auto;" />
]

.pull-right[
<img src="Ex_files/figure-html/left-skew-1.png" width="100%" style="display: block; margin: auto;" />
]

---

# Quick Quiz: Skewed Distribution

A. mean, median, mode  
B. mode, mean, median  
C. median, mode, mean  
D. mode, median, mean  
E. mean, mode, median

**Remember:** Tail direction = skew direction
]

.pull-right[
<img src="Ex_files/figure-html/quiz-plot1-1.png" width="100%" style="display: block; margin: auto;" />
]

---

# Quick Quiz: Skewed Distribution

A. mean, median, mode  
B. mode, mean, median  
C. median, mode, mean  
D. **mode, median, mean**  
E. mean, mode, median

**Remember:** Tail direction = skew direction
]

.pull-right[
<img src="Ex_files/figure-html/quiz-plot2-1.png" width="100%" style="display: block; margin: auto;" />
]

---

# Hypothesis Testing

---

# Statistical Tests by Data Type

.pull-left[
**Categorical Data (Counts)**
- **Chi-square test:** 2+ categorical variables
- **Fisher exact test:** 2×2 tables, small cells
- *Example:* Vaccination status vs disease outcome

**Quantitative + Groups**
- **T-test:** Compare 2 group means
  - *Paired:* Before/after same subjects
  - *Independent:* Different groups
- **Mann-Whitney U:** Non-parametric alternative
- **ANOVA:** Compare 3+ group means
]

.pull-right[
**Ordinal Data (Ranked)**
- **Spearman correlation:** Monotonic relationships
- *Example:* Disease severity (mild/moderate/severe) vs pain score

**Two Quantitative Variables**
- **Pearson correlation:** Linear relationships
- *Example:* Age vs height in children
]

---

# Question

| | Clot | No Clot | Total |
|---|---:|---:|---:|
| **OCP Use** | 500 | 400 | 900 |
| **No OCP Use** | 80 | 20 | 100 |
| **Total** | 580 | 420 | 1000 |
]

A. Two sample T-test  
B. Analysis of variance  
C. Pearson correlation  
D. Chi-square test    
E. Spearman correlation
]

---

# Question

| | Clot | No Clot | Total |
|---|---:|---:|---:|
| **OCP Use** | 500 | 400 | 900 |
| **No OCP Use** | 80 | 20 | 100 |
| **Total** | 580 | 420 | 1000 |
]

A. Two sample T-test  
B. Analysis of variance  
C. Pearson correlation  
D. **Chi-square test** ✓  
E. Spearman correlation
]

**Why Chi-square?** Two categorical variables (OCP: Yes/No, Clot: Yes/No)

---

# Hypothesis Testing Framework

.pull-left[
**Null Hypothesis (H₀)**
- Default assumption
- "No effect," "no difference," "no relationship"
- What we test against

**Alternative Hypothesis (H₁)**
- What we want to demonstrate
- "There IS an effect/difference/relationship"
- Never "proven," only supported
]

.pull-right[
**Decision Rules**
- **Reject H₀ if p < 0.05** (typical)
- For ratios: 95% CI not including 1
- For differences: 95% CI not including 0

**Board Exam Tip:** Assume H₀ rejected when p < 0.05
]

---

# Four Possible Outcomes

.center[
| | H₀ True | H₀ False |
|---|---|---|
| **Reject H₀** | Type I Error (α) | Correct ✓ |
| **Fail to Reject H₀** | Correct ✓ | Type II Error (β) |
]

.pull-left[
**Type I Error (α = False Positive)**
- Conclude effect when none exists
- Usually set at 5%

**Power (1-β)**
- Probability of detecting true effect
- Increases with sample size
]

.pull-right[
**Type II Error (β = False Negative)**  
- Miss a real effect
- Often due to small sample

**Key Point:** Bigger sample = More power
]

---

# Clinical/Epidemiological Metrics

---

# Essential Metrics You Must Know

.pull-left[
### The Big Four
1. **Odds Ratio (OR)**
2. **Risk Ratio (RR)**  
3. **Risk Difference (RD)**
4. **Number Needed to Treat (NNT)**
]

.pull-right[
### When to Use Each
- **OR:** Case-control studies, rare diseases
- **RR:** Cohort studies, RCTs  
- **RD:** When absolute risk matters
- **NNT:** Clinical decision-making
]

---

# Sample Data for Calculations

---

# Standard 2×2 Table Format

| | Disease | No Disease | Total |
|---|---:|---:|---:|
| **Exposed** | 30 (a) | 70 (b) | 100 (a+b) |
| **Not Exposed** | 20 (c) | 80 (d) | 100 (c+d) |
| **Total** | 50 (a+c) | 150 (b+d) | 200 (N) |

**Cell Labels:** Use a, b, c, d for formulas

---

# Understanding Odds

**Exposed Group:**
- P(Disease) = 30/100 = 0.3
- P(No Disease) = 70/100 = 0.7  
- **Odds = 30/70 = 0.43**
]

.pull-right[
**Non-Exposed Group:**
- P(Disease) = 20/100 = 0.2
- P(No Disease) = 80/100 = 0.8
- **Odds = 20/80 = 0.25**

**Shortcut:** Odds = a/b and c/d
]

---

# Odds Ratio (OR) Calculation

.pull-left[
**Formula:**
`$$OR = \frac{\text{Odds}_{\text{exposed}}}{\text{Odds}_{\text{unexposed}}} = \frac{a \times d}{b \times c}$$`

**Our Data:**
`$$OR = \frac{30 \times 80}{70 \times 20} = \frac{2400}{1400} = 1.71$$`
]

.pull-right[
**95% Confidence Interval:**
`$$e^{\ln(OR) \pm 1.96\sqrt{\frac{1}{a}+\frac{1}{b}+\frac{1}{c}+\frac{1}{d}}}$$`

**Result:** (0.89, 3.29)

**Interpretation:** Exposed group has 1.71 times the odds of disease
]

---

# Risk Ratio (RR) Calculation

.pull-left[
**Formula:**
`$$RR = \frac{\text{Risk}_{\text{exposed}}}{\text{Risk}_{\text{unexposed}}}$$`

**Calculations:**
- Risk_exposed = 30/100 = 0.3
- Risk_unexposed = 20/100 = 0.2  
- **RR = 0.3/0.2 = 1.5**
]

**Interpretation:** Exposed group has 1.5 times the risk

**Note:** Only use with cohort studies/RCTs!
]

---

# Risk Difference & NNT

.pull-left[
**Risk Difference (RD):**
`$$RD = \text{Risk}_{\text{exposed}} - \text{Risk}_{\text{unexposed}}$$`
`$$RD = 0.3 - 0.2 = 0.1$$`

**Interpretation:** 10% absolute increase in risk
]

**Interpretation:** For every 10 people exposed, 1 additional person gets disease

**Clinical Relevance:** Practical impact measure
]

---

# Interpretation Guide

.pull-left[
**Odds Ratio (OR)**
- OR = 1: No association
- OR > 1: Increased odds 
- OR < 1: Decreased odds (protective)

**Risk Ratio (RR)** 
- RR = 1: No association
- RR > 1: Increased risk
- RR < 1: Decreased risk (protective)
]

.pull-right[
**Risk Difference (RD)**
- RD = 0: No difference
- RD > 0: Increased absolute risk
- RD < 0: Decreased absolute risk

**Statistical Significance**
- **For ratios:** 95% CI excludes 1
- **For differences:** 95% CI excludes 0
]

---

# Important Caveats

---

# Critical Points to Remember

.pull-left[
**When CI Includes 1 (for ratios):**
- Result NOT statistically significant
- "No effect" is plausible
- Be cautious interpreting

**Risk Ratios vs Odds Ratios:**
- RR only from population samples
- OR from case-control studies
- OR approximates RR for rare diseases
]

.pull-right[
**Subgroup Analysis Warning:**
- Seems clinically relevant
- **Major source of false positives**
- Better for hypothesis generation
- Avoid definitive conclusions

**Multiple Testing Problem:**
- 20 tests → expect 1 false positive
- α = 0.05 means 5% false positive rate
]

---

# Multiple Testing Reality Check

Table: Example: Multiple Comparisons Problem

|Variable  |P_value |Significant |
|:---------|:-------|:-----------|
|Age       |0.042*  |Yes         |
|Gender    |0.156   |No          |
|BMI       |0.678   |No          |
|Smoking   |0.023*  |Yes         |
|Exercise  |0.891   |No          |
|Diet      |0.034*  |Yes         |
|Sleep     |0.456   |No          |
|Stress    |0.012*  |Yes         |
|Education |0.789   |No          |
|Income    |0.067   |No          |
|Location  |0.445   |No          |
|Season    |0.028*  |Yes         |
]
]

.pull-right[
**The Problem:**
- 12 tests performed
- 5 "significant" results  
- Expected by chance: 12 × 0.05 ~ 1

**Reality Check:**
- Most are likely false positives due to nearness to 0.05
- Need correction for multiple testing
- Better for hypothesis generation
]

---

# Part 2: Study Designs & Biases

---

# Key Concepts Review

.pull-left[
**Internal Validity**
- Results true for study participants
- Free from bias and confounding  
- Cause-and-effect relationships
- High control = high internal validity
]

.pull-right[
**External Validity (Generalizability)**
- Results apply to other populations
- Relevant to YOUR patients
- Real-world applicability
- Often trade-off with internal validity
]

---

# Types of Biases

---

# Design & Hidden Variable Biases

.pull-left[
**Selection Bias**
- Non-random group assignment
- Systematic baseline differences
- *Example:* Healthier patients get new treatment
- *Solution:* Randomization

**Observer-Expectancy Bias**  
- Researcher expectations influence observations
- *Example:* Doctor expects drug to work
- *Solution:* Blinding
]

.pull-right[
**Effect Modification**
- Effect varies by third variable
- *Example:* Drug works in boys, not girls
- *Solution:* Stratified analysis

**Confounding**
- Third variable causes apparent association
- *Example:* Coffee → cancer (but smoking is real cause)
- *Solutions:* Randomization, regression analysis
]

---

# Information (Measurement) Biases

.pull-left[
**Recall Bias**
- Diseased patients remember exposures better
- *Example:* Birth defect mothers recall medications
- *Solutions:* Objective data, medical records

**Procedure Bias**
- Systematic differences in data collection
- *Example:* More time spent with treatment group
- *Solutions:* Standardized protocols, blinding
]

.pull-right[
**Instrument Bias**
- Faulty or inconsistent measurement tools
- *Example:* Miscalibrated BP cuff, unclear survey
- *Solutions:* Calibration, validation, reliability testing
]

---

# Time & Completion Biases

.pull-left[
**Lead-Time Bias**
- Earlier detection ≠ longer survival
- *Example:* Screening finds cancer 5 years earlier, same death age
- *Solution:* Measure from symptom onset

**Attrition Bias**
- Systematic withdrawal related to outcome
- *Example:* Side effects cause dropout
- *Solutions:* Intention-to-treat analysis
]

.pull-right[
**Loss-to-Follow-Up**
- Participants don't return for visits
- Can be random or systematic
- *Solutions:* Good contact maintenance, incentives
- **Concern if >20% loss**
]

---

# The pyramid of evidence is a hierarchy

---

# Experimental Trials

---

# Randomized control trial is in the name

---

# Randomized control - gold standard

- This is widely considered the gold standard for clinical evidence
--
 
- Question: __Primary__ purpose of randomization?
--

- Answer: To eliminate __selection bias__
  - Selection bias (at the time of randomization) is eliminated if randomization is technically correct
--

- Question: Secondary goal of randomization?
--

- Answer: To help with confounding
 - Confounders are not necessarily eliminated even with perfect technical execution
--
 
- Can use relative risk because investigator knows prevalence of disease and prior exposures

---

# Randomized control - limitations

- Can still technically suffer from selection bias (as defined by epidemiologists). Why?
--
 
 - Loss to follow up
--
 
- They may or may not externalize well
--
 
- There are may ethical concerns in pediatrics
--
 
- They are ridiculously expensive

---

# Crossover trial - groups switch

- This post hoc analysis is overly simplified for real life
 
- This understanding is sufficient for boards
 
- Includes initial randomization

- Also, confounders reduced because a patient can serve as their own control

---

# Observational Studies

---

# Prospective cohorts - into the future

---

# Retrospective cohorts - from the past

---

# Cohorts form the next level of evidence
 
--

- Can still use relative risk because investigator knows prevalence of exposure and disease
- Subjects vary by exposure status
- Can calculate incidence
--
 
- __Selection bias__ is the biggest problem
 - Investigator has infinite control over inclusion
--
 
- Other Prospective biases
 - Attrition - people leave the trial intentionally
 - Loss-to-follow up - people just stop showing up
 - Confounding - baseline characteristics or ongoing characteristics are difference
 - Hawthorne - people act differently once observed
--
 
- Retrospective bias
 - Information bias

---

# Case-control trials measure chance of exposure given disease

---

# Case-control forms the next level down from cohorts
--

- Must use odds ratio because investigator does not know prevalence of disease
--
 
- Subjects grouped by cases and controls
 - Measure __odds of exposure__ in case and control groups
--
 
- Significantly improved power and decreased resource requirements compared to cohorts 
 - Due to cases being selected at out set
--
 
- __Selection and Recall biases__ are the biggest problem
 - Selecting appropriate controls is __highly__ non-trivial
 - Sick people remember exposures (e.g. Melanoma patients stew about their sunburns)
--
 
- Also common
 - Information biases
--
 
- __Cannot calculate incidence or prevalence__

---

# Cross-sectional trials measure exposure and disease simultaneously

---

# Cross-sectional study - next level

- __Quick, cheap, and easy__
 - Typically this is a starting point
--
 
- Can establish prevalence of disease
--
 
- Must use chi-squared or correlation for statistical test
--
 
- Subjects can be grouped by exposure and diease in to the 2x2 contingency
--
 
- __Cannot establish causation__
--
 
- Cannot calculate risk metrics
 
---

# Questions?

### Thank you for your attention!

**Remember:** Evidence-based medicine combines the best research evidence with clinical expertise and patient values.

---