Propensity Score Matching (PSM)

A principled way to simulate randomization when treatment is self-selected.

Perfect when treatment is self-selected but covariates explain the selection

What PSM does

Propensity Score Matching reconstructs the conditions of a randomized experiment by pairing treated and control units that share similar likelihoods of receiving the treatment.

This removes confounding caused by selection bias—as long as the variables driving selection are observed in your dataset.

Why matching works

In observational data, people choose treatments for reasons correlated with the outcome. PSM solves this by estimating the propensity score:

p(X) = P(T = 1 | X)

Where X represents observed covariates influencing treatment choice. Treated users are then matched to control users with similar p(X).

After matching, treated and control groups become comparable in terms of X—making the remaining differences in outcomes interpretable as causal.

Key assumptions

1. Conditional Independence (CIA)

All drivers of treatment selection must be in X. If something important is missing (motivation, preference, intent, skill, etc.), matching cannot fix it.

2. Overlap

Every treated unit must have at least one similar control. If no comparable control exists, PSM cannot create a valid match.

3. Stable Unit Treatment Value Assumption (SUTVA)

Treatment of one unit must not affect the outcome of another.

How matching is done in

implements only the matching settings that data scientists actually need:

Matching ratio

Replacement

Caliper

Distance metric

After matching

automatically computes:

Estimating the treatment effect

ATEmatched = mean(Ytreated) − mean(Ycontrol, matched)

The estimate is unbiased if covariates successfully explain selection.

When PSM is ideal

When PSM struggles

PSM inside

provides a streamlined interface for dataset mapping, matching setup, diagnostics, and effect estimation. The workflow ensures clean, interpretable causal effects while preventing the common pitfalls that plague PSM.