Skip to main content
Research

Practitioner
Research.

I run experiments for a living — and I treat the practice itself as a research subject. My work sits at the intersection of experimentation methodology, causal inference, behavioral economics, and applied AI, grounded in data from 280+ production experiments rather than lab conditions.

Read Lean Experiments → (opens contact form)

200+ issues · 12,000+ subscribers

Focus Areas

Research
Interests

Experimentation Methodology

How organizations should measure the true value of testing programs: win rate as a vanity metric, winner’s-curse correction, sequential testing tradeoffs, and long-term holdouts as program-level audits.

Causal Inference in Marketing

Geo-incrementality, matched-market designs, holdout experiments, and attribution beyond last-click — separating demand that marketing created from demand it merely captured.

Behavioral Economics

Which lab-validated effects survive contact with production traffic. Loss aversion, choice architecture, anchoring, and social proof — tested at scale rather than cited from a 1979 paper.

Applied AI for Growth

AI-assisted hypothesis generation, analysis acceleration, and personalization systems — and the governance layer they need: bias detection, guardrails, and transparency standards.

Working Papers

What I'm
Writing

Practitioner research drawn from a decade of production experimentation. Papers publish first as citable write-ups on this site and Lean Experiments, with PDF versions prepared for scholarly indexing.

In Progress Experimentation · Organizational Decision-Making

Win Rate Is a Vanity Metric: Toward Decision-Quality Measures for Experimentation Programs

Experimentation programs report win rates as their headline KPI, but win rate is trivially gamed by testing safe, small changes. This paper proposes a decision-quality framework — revenue per experiment, save rate, and learning velocity — drawn from operating a 100+ test/year enterprise program.

In Progress Knowledge Management · Experimentation

Experiment Repositories as Organizational Memory: Why Testing Programs Repeat Themselves

Most organizations lose the knowledge their experiments generate within two personnel changes. This paper examines experiment repositories as institutional memory infrastructure, and the incentive structures that determine whether teams contribute to them.

Planned Causal Inference · Measurement

Long-Term Holdouts and the Systematic Overstatement of Experimentation Program Value

Per-test impact models inflate program value through winner’s curse, novelty decay, and interaction effects. Drawing on holdout methodology popularized by Eppo and Statsig, this paper quantifies the gap between summed per-test readouts and holdout-measured cumulative lift.

Reading Notes

What I'm
Reading

Papers and technical writing I'm working through, with notes on how each one holds up against production data.

Lean Experiments Newsletter

Revenue Frameworks
for Growth Leaders

Every week: one experiment, one framework, one insight to make your marketing more evidence-based and your revenue more predictable.