What is CUPED (Variance Reduction)? — Glossary

Atticus Li

← Glossary · Statistics & Methodology

CUPED (Variance Reduction)

Controlled-experiment Using Pre-Experiment Data — a variance reduction technique that uses pre-period covariates to shrink metric variance and boost test power.

What Is CUPED?

CUPED, introduced by Deng, Xu, Kohavi, and Walker at Microsoft in 2013, is a technique that removes predictable variance from experiment metrics by regressing out pre-period behavior. In plain terms: if a user spent a lot last month, they will probably spend a lot this month regardless of which variant they see — CUPED subtracts that predictable portion so the leftover variance is closer to "what the experiment actually caused." The result is often 30–50% variance reduction and proportionally shorter tests or smaller MDEs.

Also Known As

Data science: CUPED, variance reduction via covariates, regression adjustment
Growth: "the technique that cuts test time in half"
Marketing: pre-period control adjustment
Engineering: covariate-adjusted estimator

How It Works

You are testing a change to product recommendations. Primary metric is 28-day revenue per user. You compute each user's 28-day revenue before the test (the covariate X). Your adjusted metric is: Y_cuped = Y - theta * (X - mean(X)), where theta = cov(Y, X) / var(X). If pre-period and in-period revenue correlate at r = 0.7, variance drops by r^2 = 49%. A test that would have taken 8 weeks at 0.8 power now takes ~4 weeks.

CUPED requires the covariate be measured before randomization and be uncorrelated with treatment assignment (which is trivially true for pre-period data).

Best Practices

Use long pre-periods (28–56 days) for heavy-tailed metrics like revenue.
Re-estimate theta on each experiment — correlations drift.
Validate with A/A tests that CUPED doesn't introduce bias before rolling out.
Apply CUPED to continuous metrics first (revenue, sessions, time) where variance is the biggest problem.
Combine with stratification on high-signal categorical variables (plan tier, region) for additional reduction.

Common Mistakes

Using a covariate measured during the experiment. This invalidates randomization and biases the estimate.
Forgetting new users. New users have no pre-period data; handle them with a separate estimator or default-to-mean.
Applying CUPED to binary conversion when the covariate correlation is near zero. You add code complexity for no benefit.

Industry Context

In SaaS/B2B, CUPED on MRR or usage metrics is a force multiplier when user-level stickiness is high. In ecommerce, CUPED on GMV or AOV can cut test duration dramatically on sites with repeat buyers. In lead gen, CUPED applies less to acquisition (no pre-period for new leads) but shines on retention and expansion tests.

The Behavioral Science Connection

Humans see noise as signal — a bias that is formalized in the "hot hand" and gambler's fallacies. CUPED removes one major source of noise (between-user variability) so the remaining signal is more clearly attributable to the treatment. It is a statistical instantiation of the behavioral principle: reduce ambient noise to hear the thing that matters.

Key Takeaway

CUPED is one of the highest-leverage tools in the modern experimentation stack. If you run tests on continuous metrics with stable user-level behavior, not using CUPED is leaving weeks of test time on the table.

← Browse All Terms