Skip to main content
← Glossary · Statistics & Methodology

Beta Distribution (for Conversions)

A probability distribution on [0,1] that is the conjugate prior for binomial conversion data — the workhorse distribution of Bayesian A/B testing.

What Is the Beta Distribution (for Conversions)?

The Beta distribution is the default prior and posterior for a conversion rate. It has two parameters — alpha (successes + 1) and beta (failures + 1) — and lives between 0 and 1. Because it is conjugate to the Binomial likelihood, observing k conversions in n trials updates your prior Beta(a,b) to a posterior Beta(a+k, b+n-k). No simulation needed; the math is closed-form.

Also Known As

  • Data science: Beta-Binomial model, conjugate prior for binomial
  • Growth: conversion rate posterior
  • Marketing: probability distribution over rates
  • Engineering: Beta(α, β) distribution

How It Works

Your baseline is 4%. A Beta(40, 960) prior reflects that belief with the strength of 1,000 observations. Run a test with 5,000 visitors and 210 conversions. Posterior is Beta(250, 4750), mean = 0.05, 95% credible interval ≈ [4.4%, 5.6%]. Compare to variant B posterior Beta(260, 4740), mean = 0.052. Sample 10,000 draws from each; proportion B > A gives the probability the variant wins.

The Beta distribution elegantly handles small sample regimes: at n = 100, the posterior is still a proper distribution with quantified uncertainty, unlike the normal approximation, which misbehaves at low n.

Best Practices

  • Use Beta(1,1) (uniform) as an uninformative prior unless you have justified historical data.
  • Match prior strength to sample size — a Beta(100, 100) prior dominates results from the first 1,000 users.
  • Report credible intervals alongside the posterior mean. A 5.2% mean with CI [3%, 8%] is very different from CI [5%, 5.4%].
  • Plot posteriors at readouts. Visualizing two overlapping Beta distributions is the most interpretable readout format for non-statisticians.
  • Update continuously without correction — Bayesian posteriors remain valid under peeking when decision rules are calibrated properly.

Common Mistakes

  • Using a strongly informative prior you can't justify. This pre-biases the test toward your prior belief.
  • Confusing credible interval with confidence interval. Credible = "95% probability the true rate is in this range given data." Confidence = frequentist coverage property.
  • Applying Beta to non-binary metrics like revenue per user. Beta is for proportions; revenue needs different priors (log-normal, gamma).

Industry Context

In SaaS/B2B, Beta-Binomial models for conversion are the simplest Bayesian testing setup and often sufficient. In ecommerce, Beta for conversion plus log-normal or Gamma for AOV gives a joint revenue-per-visitor posterior. In lead gen, Beta on fill rate plus downstream qualification rates lets you construct a full pipeline posterior from first principles.

The Behavioral Science Connection

The Beta distribution makes uncertainty tangible. Early in a test, it is wide and flat — visually honest about how little we know. As data arrives, it narrows. This is the right mental model: belief should be proportional to evidence, and the Beta posterior makes that proportionality literal.

Key Takeaway

For any conversion-rate test, the Beta distribution is the right mathematical object. It makes Bayesian analysis tractable, interpretable, and visually honest.