Research — Experimentation, Causal Inference & Behavioral Economics

Atticus Li

Research

Practitioner
Research.

I run experiments for a living — and I treat the practice itself as a research subject. My work sits at the intersection of experimentation methodology, causal inference, behavioral economics, and applied AI, grounded in project-specific evidence including 100+ in-house NRG experiments in 2025.

Read Lean Experiments →

Free weekly field notes

Focus Areas

Research
Interests

Experimentation Methodology

How organizations should measure the true value of testing programs: win rate as a vanity metric, winner’s-curse correction, sequential testing tradeoffs, and long-term holdouts as program-level audits.

Causal Inference in Marketing

Geo-incrementality, matched-market designs, holdout experiments, and attribution beyond last-click — separating demand that marketing created from demand it merely captured.

Behavioral Economics

When loss aversion, choice architecture, anchoring, and social proof provide useful hypotheses—and what production evidence is needed before treating a mechanism as relevant.

Applied AI for Growth

AI-assisted hypothesis generation, analysis acceleration, and personalization systems — and the governance layer they need: bias detection, guardrails, and transparency standards.

Working Papers

What I'm
Writing

Practitioner research drawn from production experimentation. Papers publish first as citable write-ups on this site and Lean Experiments, with PDF versions prepared for scholarly indexing.

In Progress Experimentation · Organizational Decision-Making

Win Rate Is a Vanity Metric: Toward Decision-Quality Measures for Experimentation Programs

Experimentation programs report win rates as their headline KPI, but win rate is trivially gamed by testing safe, small changes. This paper proposes a decision-quality framework — revenue per experiment, save rate, and learning velocity — drawn from operating a 100+ test/year enterprise program.

In Progress Knowledge Management · Experimentation

Experiment Repositories as Organizational Memory: Why Testing Programs Repeat Themselves

Experiment knowledge can disappear when ownership changes or readouts are difficult to find. This paper examines repositories as institutional-memory infrastructure and the incentives that determine whether teams contribute to them.

Planned Causal Inference · Measurement

Long-Term Holdouts and the Systematic Overstatement of Experimentation Program Value

Winner’s curse, novelty decay, and interaction effects can make summed per-test impact differ from a program-level holdout estimate. This planned paper will examine that gap, the assumptions behind each method, and the conditions under which either estimate is decision-useful.

Reading Notes

What I'm
Reading

Papers and technical writing I'm working through, with notes on how each one holds up against production data.

Eppo Engineering Blog

Now Live: Holdouts — Measuring the Cumulative Impact of Experimentation (opens in new tab)

An example of productized program-level measurement. Holdouts can complement individual readouts by estimating cumulative program effects—a question finance leaders may ask before renewing investment.

Kohavi, Tang & Xu

Trustworthy Online Controlled Experiments (A Practical Guide to A/B Testing) (opens in new tab)

A core reference for experimentation at scale. The chapters on Twyman’s Law and sample-ratio mismatch shaped a QA habit: investigate instrumentation and data quality before treating a surprising result as a breakthrough.

Kahneman & Tversky

Prospect Theory: An Analysis of Decision under Risk (opens in new tab)

A foundational model of decision under risk. In applied CRO, loss framing is a mechanism worth testing—not a universal winner—and production context determines whether the effect appears or matters.

Go Deeper

Where the Research
Comes From

Methodology

The Method

The experimentation operating system this research is built on — PRISM, the idea-to-investment pipeline, and honest impact measurement.

See the method → Evidence

Case Studies

The production programs behind the data: $30M+ recorded in NRG internal program reporting at NRG Energy, and geo-measurement at Silicon Valley Bank.

See the work → Practitioner Evidence

Experiment Evidence Library

Portfolio syntheses, repeated patterns, and copy-ready evidence packs for CRO teams deciding what to test next.

Search the evidence → Open Questions

Ideas Lab

Product and research concepts I'm exploring — including AI-driven experimentation intelligence built on contributed test data.

See the ideas →

Lean Experiments Newsletter

Revenue Frameworks
for Growth Leaders

Every week: one experiment, one framework, one insight to make your marketing more evidence-based and your revenue more predictable.

Continue on Substack

Opens Substack to confirm · Free · Unsubscribe anytime

Browse issues

Read the archive

A growing archive of experiments, frameworks, and field reports from inside a Fortune 150 growth team.

Open Substack

ResearchInterests