Prospect Theory: The Behavioral-Economics Framework That Actually Replicates (Anti-Example)

Atticus Li

← The Replication Crisis · replication-crisis

Prospect Theory: The Behavioral-Economics Framework That Actually Replicates (Anti-Example)

Most behavioral-science findings in this hub did not survive scrutiny. Prospect theory did. Across 47 years, multiple paradigms, and dozens of countries, Kahneman and Tversky's 1979 framework remains the gold standard for what robust behavioral economics actually looks like — and the benchmark every other "behavioral" claim should be measured against.

By Atticus Li May 25, 2026 37 min read

Most behavioral-science findings in this hub did not survive scrutiny. Prospect theory did. Across 47 years, multiple paradigms, and dozens of countries, Kahneman and Tversky’s 1979 framework remains the gold standard for what robust behavioral economics actually looks like --- and the benchmark every other “behavioral” claim should be measured against.

If you have been reading through this hub, you have watched a long parade of canonical behavioral findings get dismantled. Ego depletion collapsed under Hagger 2016. Power posing failed and was recanted by one of its own authors. Money priming evaporated in preregistered replications. The marshmallow test shrank dramatically once SES was controlled for. The bystander effect’s Kitty Genovese mythology turned out to be a journalist’s invention. The grit construct shrank to near-conscientiousness redundancy. Stapel, Hauser, LaCour, Wansink --- the parade of fraud cases, on top of the parade of replication failures, has done real damage to the credibility of the entire field.

A rational reader by now might conclude that behavioral economics as a whole is unreliable. That conclusion would be wrong, and this article is the second of two pieces in this hub that exist specifically to make the case for why.

The first anti-example was the default effect --- the empirical observation that whichever option a chooser has been pre-assigned tends to be the option they end up with, at rates far higher than active preference would predict. That finding survived publication-bias correction, cross-national replication, and field-experiment scrutiny in domains as different as retirement savings, organ donation, and energy provider choice. It is the most reliable nudge in the behavioral toolkit.

This article is about something larger and more foundational. Not a single empirical regularity, but an entire formal framework: prospect theory, the model of decision under risk proposed by Daniel Kahneman and Amos Tversky in a 1979 paper in Econometrica that has, in the four decades since, become the most-cited paper ever published in that journal and one of the most-cited works in all of social science. The framework was extended in a 1992 follow-up paper that fixed the original specification’s mathematical limitations, has been the subject of a comprehensive 2010 textbook treatment by Peter Wakker, and was the central work cited in Kahneman’s 2002 Nobel Prize in Economics.

Prospect theory is not just another behavioral finding that happened to hold up. It is the framework that defined what behavioral economics could be when done with the same mathematical seriousness as the expected-utility theory it replaced. It is the gold standard. And the calibration this article is meant to deliver is: when someone pitches you a “behavioral economics” claim, prospect theory is the benchmark you should be comparing it against. If the claim engages with prospect theory’s formalism, the four-fold pattern of risk attitudes, the reference-point structure, and the probability-weighting function, it is operating in the serious tradition. If it does not --- if it is “loss aversion means losses hurt twice as much as gains” and nothing more --- it is the watered-down folk-psychology version, and it deserves the calibration that the loss-aversion article in this hub already gives it.

Here is what holds up, what is contested, and what this anti-example tells us about what robust behavioral economics actually looks like.

What Kahneman and Tversky 1979 Proposed

The founding paper is Kahneman, D., & Tversky, A. (1979). “Prospect theory: An analysis of decision under risk.” Econometrica, 47(2), 263—291. DOI: 10.2307/1914185.

Before prospect theory, the dominant economic model of how people make decisions under risk was expected utility theory, formalized by von Neumann and Morgenstern in 1944 and extended by Savage in 1954. Expected utility theory said: agents evaluate a risky prospect by computing the probability-weighted average of the utilities of its possible outcomes, where utility is a function of total final wealth. Under a small number of axioms --- completeness, transitivity, continuity, independence --- this is the uniquely rational way to make decisions under risk, and the theory was treated as both a positive and a normative model of behavior.

Kahneman and Tversky’s paper began with a catalogue of systematic violations of expected utility theory that had been documented in lab experiments going back to Maurice Allais in the 1950s. The Allais paradox showed that subjects’ choices between specific risky prospects violated the independence axiom. The “common ratio effect” showed that subjects’ risk preferences shifted predictably with the absolute size of probabilities. The “reflection effect” showed that subjects switched from risk-averse to risk-seeking when the same prospect was reframed in terms of losses rather than gains. The “isolation effect” showed that subjects evaluated identical compound prospects differently depending on how the components were presented.

None of this was new in 1979. What was new was that Kahneman and Tversky proposed a single formal alternative model that organized all of these phenomena under a coherent mathematical structure with three core components.

The value function. Where expected utility theory treated utility as a function of total final wealth, prospect theory proposed that people evaluate outcomes relative to a reference point --- typically the current state --- and that gains and losses are measured as deviations from that reference. The reference point is the kink in the value function. Above it, gains have diminishing marginal value (concave function). Below it, losses have diminishing marginal pain (convex function). And critically, the function is steeper for losses than for gains --- meaning the same dollar amount, framed as a loss, weighs more in the chooser’s decision than when framed as a gain. The kink at the reference point, and the asymmetry of slopes around it, is what generates loss aversion as a formal property of the model.

The probability weighting function. Where expected utility theory weighted outcomes by their objective probabilities, prospect theory proposed that people weight outcomes by a non-linear transformation of those probabilities. The transformation has a characteristic inverse-S shape: small probabilities are systematically overweighted, while medium-to-large probabilities are systematically underweighted. This is the part of the model that explains why the same people both buy lottery tickets (overweighting small probabilities of large gains) and buy insurance (overweighting small probabilities of large losses) --- behaviors that expected utility theory could only explain through awkward auxiliary assumptions about wealth-dependent risk preferences.

Editing operations. Prospect theory proposed that before evaluating a prospect, choosers perform a series of cognitive “editing” operations: combining or segregating outcomes, cancelling common components across prospects, simplifying probabilities by rounding, detecting dominance. These editing operations explain a number of framing effects that pure value-function-plus-probability-weighting could not handle.

The combination of these three components --- reference-dependent value function with kink and curvature, non-linear probability weighting, and pre-evaluation editing --- produced a formal model that could organize the Allais paradox, the reflection effect, the common ratio effect, and a half-dozen other documented violations of expected utility theory under a single framework. That is what made the paper a landmark. It was not just a critique of expected utility theory. It was a constructive replacement that could be calibrated against data and used to generate falsifiable predictions.

The paper’s original empirical evidence was based on small-sample hypothetical-choice experiments with university students --- the methodology of the era. The original specification also had some technical problems, as Kahneman and Tversky themselves acknowledged: the editing operations were somewhat ad hoc, and the model did not always satisfy stochastic dominance (a basic rationality condition that says if prospect A is at least as good as prospect B in every state of the world, then A should be at least as preferred as B). Those technical problems would be fixed in the 1992 follow-up. But the qualitative framework --- reference-dependence, loss aversion, diminishing sensitivity, probability weighting --- was already there in 1979, and it has held up.

The Four-Fold Pattern of Risk Attitudes

The most robust empirical prediction of prospect theory is what Tversky and Kahneman called the four-fold pattern of risk attitudes, and it is worth stating precisely because the popular caricatures of behavioral economics usually butcher it.

Expected utility theory predicts that an agent’s risk attitude --- aversion or seeking --- is a property of the agent, not of the prospect. If you are risk-averse over modest stakes, you should be risk-averse in every modest-stakes decision, regardless of how the decision is framed.

Prospect theory predicts the opposite. Risk attitudes flip systematically across four domains, depending on two variables: whether outcomes are gains or losses (relative to the reference point), and whether the probability of the non-zero outcome is large or small. The four cells are:

Risk-averse in the gains domain at large probabilities. Given a choice between a sure $500 and a 95% chance of $526 (same expected value), most people take the sure $500. This is the standard textbook risk aversion that expected utility theory was built to capture. Prospect theory predicts it via the concavity of the value function in the gains domain.

Risk-seeking in the losses domain at large probabilities. Given a choice between a sure loss of $500 and a 95% chance of losing $526 (same expected value), most people take the gamble. They prefer the small chance of avoiding the loss entirely, even though it carries a larger possible loss. This is the reflection effect, and expected utility theory cannot account for it without contradicting the previous prediction. Prospect theory predicts it via the convexity of the value function in the losses domain.

Risk-seeking in the gains domain at small probabilities. Given a choice between a sure $5 and a 0.1% chance of $5,000 (same expected value), many people take the gamble. This is the lottery-ticket behavior. Expected utility theory cannot easily explain it without assuming wealth-state-dependent preferences. Prospect theory predicts it via probability weighting --- the 0.1% probability is psychologically overweighted, making the lottery feel more attractive than its expected value would suggest.

Risk-averse in the losses domain at small probabilities. Given a choice between a sure loss of $5 and a 0.1% chance of losing $5,000 (same expected value), many people take the sure loss. This is the insurance-buying behavior. Same mechanism: the 0.1% probability of the large loss is psychologically overweighted, making the certain small loss feel worth the protection.

The four-fold pattern is not a vague qualitative observation. It is a sharp prediction that any single-utility-function model of risk attitude cannot accommodate without parameter-bending that itself violates the model’s spirit. Prospect theory predicts the pattern as a structural consequence of two independent components (value function curvature and probability weighting), and the pattern has been documented in dozens of replications across many cultures and decision domains. It is the prediction that most cleanly distinguishes prospect theory from expected utility theory, and it is the prediction that has held up best.

A reader who wants to test whether someone pitching a “behavioral economics” insight actually understands prospect theory can ask them to explain the four-fold pattern. If they can, they are operating in the serious tradition. If they cannot --- if their framework is just “people are loss-averse” --- they are working with a watered-down version that misses most of what prospect theory actually predicts.

The 1992 Cumulative Extension

The original 1979 specification had a couple of technical problems, the most serious of which was that it could violate stochastic dominance. If you constructed a prospect where outcome A weakly dominated outcome B in every state, the original prospect theory could in some parameterizations recommend B --- a clear violation of basic rationality.

Tversky and Kahneman addressed this in Tversky, A., & Kahneman, D. (1992). “Advances in prospect theory: Cumulative representation of uncertainty.” Journal of Risk and Uncertainty, 5(4), 297—323. DOI: 10.1007/BF00122574.

The 1992 paper introduced cumulative prospect theory (CPT). The key technical change was that probability weighting was applied to the cumulative probability distribution rather than to the individual probabilities of outcomes. This is the same mathematical move that Quiggin had made in 1982 for his rank-dependent utility model on the gains side; the 1992 paper extended it to both gains and losses, with separate weighting functions for each domain. The resulting model satisfies stochastic dominance, accommodates arbitrary numbers of outcomes, and applies to both risk (known probabilities) and uncertainty (unknown probabilities).

The 1992 paper also fit a specific parametric form to choice data from 25 graduate students at Berkeley and Stanford. This is the source of the famous “loss-aversion coefficient of 2.25” --- the parameter λ in the canonical Tversky-Kahneman calibration, indicating that losses were weighted about 2.25 times as heavily as equivalent gains in their sample. The coefficient escaped into popular usage and got summarized as “losses loom twice as large as gains,” which is the version most readers have encountered.

The popular version is more confident than the underlying evidence warrants. The 2.25 figure comes from a small sample of elite-university graduate students making hypothetical choices about modest monetary stakes. Subsequent large-sample work has produced loss-aversion coefficients ranging from roughly 1.4 to 2.5, with substantial heterogeneity across populations, stake sizes, and elicitation methods. This is the subject of the loss aversion article in this hub, which goes into the contestation in detail. The short version: the qualitative claim that losses are weighted more heavily than equivalent gains holds up across most studies; the specific 2-to-1 universality claim does not.

But the framework of cumulative prospect theory --- value function with reference point, loss aversion as a structural feature of the value function’s kink and slope asymmetry, probability weighting with overweighting of small probabilities, and the four-fold pattern as a derived prediction --- has held up. It is the framework that all subsequent behavioral-decision-theory research has been calibrated against, and the framework that the Wakker 2010 textbook treatment uses as its central organizing structure for the entire field of decision under risk and ambiguity.

What Has Replicated

The strongest synthesis of what holds up empirically is Colin Camerer’s chapter “Prospect Theory in the Wild: Evidence from the Field,” in Kahneman and Tversky’s 2000 edited volume Choices, Values, and Frames (Cambridge University Press). Camerer reviewed evidence from field settings --- not lab experiments --- in which prospect theory’s predictions could be tested against real-world high-stakes choices.

The chapter walks through a series of natural-experimental tests across financial markets, taxi-driver labor supply, horse-race betting, insurance purchase patterns, and consumer choices in dozens of other domains. In essentially every domain Camerer examined, the qualitative predictions of prospect theory --- reference-dependence, asymmetric reaction to gains and losses, the four-fold pattern --- showed up in real-world data. The magnitudes were sometimes contested and sometimes attenuated relative to lab parameters, but the patterns were there. The framework was making predictions that mattered for real markets and real money, and the predictions were borne out.

The other major piece of large-sample validation is Booij, A. S., van Praag, B. M., & van de Kuilen, G. (2010). “A parametric analysis of prospect theory’s functionals for the general population.” Theory and Decision, 68(1), 115—148. DOI: 10.1007/s11238-009-9144-4.

Booij et al. ran a structured experiment to estimate prospect theory’s full set of parameters --- the value function curvature in gains and losses, the loss-aversion coefficient, and the probability weighting function in both domains --- on a representative sample of N = 1,935 drawn from the Dutch general population. This is one of the largest and most representative empirical calibrations of prospect theory ever attempted, and it is the kind of test that the original 1979 and 1992 calibrations (small undergraduate or graduate-student samples) could not provide.

The results substantially confirmed the structure of the framework: utility was concave for gains and (more weakly) convex for losses; the probability weighting function had the predicted inverse-S shape in both domains; the estimated loss-aversion coefficient was 1.6, lower than the original 2.25 but still showing the predicted asymmetry. The Booij et al. paper is one of the most important external validations of the prospect-theory framework, because it shows the structure holds up when you take it out of the elite-university undergraduate lab and run it on a representative population sample.

The third major validation is the formal-theoretical body of work that has built decision theory out of prospect theory’s empirical foundations. The benchmark synthesis here is Wakker, P. P. (2010). Prospect Theory: For Risk and Ambiguity. Cambridge University Press. Wakker’s 503-page textbook is a comprehensive treatment of prospect theory as a mathematical model, with full axiomatic derivations, comparison to expected utility and rank-dependent utility models, treatment of both risk (known probabilities) and ambiguity (unknown probabilities), and explicit derivations of the parameter-estimation methodology that subsequent empirical work has used. The existence of a textbook of this depth and rigor is itself an indicator of prospect theory’s status as a serious mathematical framework, not just an empirical regularity. There is no analogous textbook for power posing or ego depletion, because those findings did not survive long enough or develop deep enough to support one.

The most important balanced review of what holds up is Barberis, N. C. (2013). “Thirty years of prospect theory in economics: A review and assessment.” Journal of Economic Perspectives, 27(1), 173—196. DOI: 10.1257/jep.27.1.173.

Barberis’s review is the right place to go for an honest assessment. He concludes that prospect theory is “the best available description of how people evaluate risk in experimental settings” and surveys its applications to financial markets, insurance, betting markets, and consumer choice. He also identifies the practical challenges of applying it in field settings --- particularly the question of how to specify the reference point in real-world decisions, which the framework leaves open and which different applications have resolved in different ways. The review is one of the clearest descriptions of what is robust and what remains contested, written for an economist audience.

The overall pattern: the qualitative predictions of prospect theory have replicated robustly across studies, paradigms, populations, and decision domains. The specific parametric estimates have varied substantially, with no universal coefficient values. The framework as a structure for organizing risk behavior has held up. The framework as a precise quantitative predictor with universal parameters has not, and was never claimed to.

What Is Contested

It is worth being precise about what remains contested, because the popular framing of prospect theory typically blurs the framework’s robust core with the more contested specific claims.

The exact loss-aversion ratio. The popular “losses loom twice as large as gains” formulation is not a universal law. The loss aversion article in this hub walks through the 2018 Gal and Rucker critique and the subsequent 2020 reply by Mrkva and colleagues. The short version: the qualitative asymmetry holds up; the universal 2-to-1 quantitative claim does not. Loss-aversion coefficients vary across populations (Booij et al. 2010 estimated 1.6 in the Dutch general population), across stake sizes (the coefficient typically attenuates at small stakes), and across decision domains (loss aversion is robust in lottery-style choice but more conditional in consumer-purchase contexts). The framework predicts asymmetry; the specific magnitude is more contested.

Reference-point specification in field applications. Barberis 2013 and others have noted that the framework leaves open how the reference point is set in any specific decision. In a lab experiment, the experimenter can stipulate the reference (e.g., “your current state is X”). In a field application --- a retirement-savings decision, an insurance purchase, a stock trade --- the reference point is endogenous and may depend on expectations, social comparisons, prior outcomes, or framing of the decision. Different reference-point specifications can produce different predictions, and the framework does not by itself tell you which specification to use. This is not a refutation of prospect theory; it is an honest acknowledgment that applying the framework requires additional auxiliary assumptions that the theory itself does not pin down.

Aggregation across decisions. Prospect theory was specified for single-decision contexts. When agents face many decisions over time, the framework needs to be extended with assumptions about how losses and gains aggregate or are evaluated separately. This is the subject of substantial work on “narrow framing” and “mental accounting” --- both of which extend prospect theory in productive ways, but neither of which is settled.

Cumulative prospect theory’s internal consistency. A 2022 paper by Regenwetter, Robinson, Wang, and colleagues (“Four Internal Inconsistencies in Tversky and Kahneman’s (1992) Cumulative Prospect Theory Article,” Advances in Methods and Practices in Psychological Science) documented some mathematical inconsistencies in the 1992 specification. These have been addressed in subsequent reformulations and do not threaten the empirical robustness of the framework, but they are worth knowing about for any reader who wants to understand the formal-theoretical literature.

Application to ambiguity. Wakker 2010 extended prospect theory to decisions under ambiguity (unknown probabilities), but the empirical evidence base for the ambiguity extensions is thinner than for the original risk version. Ellsberg-paradox-style ambiguity aversion is well-documented; the precise functional form of how prospect-theory-like effects operate under ambiguity is still being worked out.

None of these contested items overturn the framework. They are exactly the kind of detailed empirical and theoretical refinements you would expect to see in a mature, productive research program --- which is what prospect theory has become. Compare this to the contested status of, say, power posing or ego depletion, where the questions are not “exactly what magnitude does the effect have” but “does the effect exist at all.” Prospect theory has graduated from existence questions to calibration questions, and that is the mark of a robust framework.

Why Prospect Theory Held Up Where Many Behavioral Findings Didn’t

Stepping back from the specific findings, the meta-question is worth asking: what is different about prospect theory that has allowed it to survive 47 years of scrutiny when so many other behavioral-economics constructs have not? I think there are four reasons, and they are useful as a checklist for evaluating any other behavioral framework.

Clear mathematical formalism. Prospect theory is a formal model with specified functional forms, axiomatic foundations, and explicit parameters that can be estimated and tested. It is not “people are loss-averse” as a vague qualitative claim; it is a value function with reference-dependence, asymmetric slopes, and curvature, paired with a probability weighting function with specified shape. Vague constructs can be defended indefinitely against empirical challenge by reinterpretation. Formal models can be falsified, calibrated, and improved. Most of the failed behavioral findings in this hub were vague constructs; prospect theory is a formal model, and the formalism is what made it productively testable.

Multiple independent empirical paradigms. The evidence base for prospect theory comes from lab choice experiments, parametric estimation studies, field-data analyses of insurance and financial markets, betting-market behavior, labor-supply decisions, and structured population surveys. No single paradigm dominates. When you have a framework that makes consistent predictions across this many independent paradigms, the probability that the entire body of evidence is a publication-bias or methodological artifact is much lower than for a finding that lives entirely within one paradigm. Most of the failed behavioral findings in this hub came from a narrow paradigm (usually a single experimental design with convenience samples); prospect theory’s evidence base spans the full range of decision-research methodologies.

Cross-cultural and cross-population replication. Prospect-theoretic patterns have been documented in samples from dozens of countries, in age groups from children to retirees, in populations with different levels of education and economic development, in professional traders and amateur retail investors, and in laboratory and field contexts. The framework is not a quirk of American undergraduates. Most failed behavioral findings in this hub were tested on American or Western European undergraduate samples and never extended; prospect theory survived the extension.

Applied predictive power across domains. Prospect theory has been productively applied to insurance pricing (why people overpay for low-deductible policies), financial markets (why retail investors hold losing stocks too long --- the “disposition effect” first documented by Shefrin and Statman 1985), labor economics (why taxi drivers stop working when they hit a daily income target), policy design (why default-based interventions work, via the reference-point structure), gambling (why people both buy lottery tickets and avoid certain modest losses), and health decision-making (why patients accept or reject treatments based on framing). A framework that generates productive predictions across this many applied domains over four decades is not a fluke. Most of the failed behavioral findings in this hub had narrow applied uptake; prospect theory has been a workhorse model for applied behavioral economics since the 1990s.

The combination of formalism, multiple paradigms, cross-cultural replication, and applied predictive power is what robust behavioral economics looks like when it works. Most of the constructs in this hub have one or zero of these properties. Prospect theory has all four.

How Prospect Theory Is Used in Applied Settings

A reader from a CRO, growth, or strategy background may want to know what prospect theory looks like operationally --- where it actually shows up in real decisions that companies make. Four domains are worth noting.

Insurance pricing and product design. The classic prospect-theoretic prediction that small probabilities of large losses are overweighted explains why consumers pay above expected-value prices for low-deductible policies, why extended warranties are dramatically overpurchased relative to their actuarial value, and why “peace of mind” framings of insurance products outperform purely actuarial pitches. Insurance companies that price products with attention to these prospect-theoretic phenomena typically capture more consumer surplus than companies that price on pure expected value. The “deductible aversion” finding in particular --- that consumers will pay disproportionately to reduce a $1,000 deductible to $500 --- is one of the most reliably replicated applied findings in the field.

Financial decision architecture. The disposition effect --- retail investors’ tendency to sell winning stocks too early and hold losing stocks too long --- is a prospect-theoretic prediction (sale of a winner converts a paper gain into a realized one in the concave gains region; sale of a loser converts a paper loss into a realized one, crossing the steep loss region around the reference point). Brokerage interfaces that surface the prospect-theoretic structure to investors (showing reference points explicitly, reframing portfolio outcomes in terms of long-run versus daily reference points) can shift this behavior. Default rebalancing, defaults for tax-loss harvesting, and similar architectural choices are best designed with attention to the reference-point structure prospect theory describes.

Default architecture and reference-point setting. The defaults-anti-example article in this hub describes the default effect’s robustness in detail. Prospect theory provides the mechanism: the default sets the reference point, and the asymmetric loss function makes deviating from the default psychologically expensive relative to its symmetric expected-utility cost. If you wanted to predict why defaults work without prospect theory, you would need ad hoc mechanism stories; with prospect theory, the prediction falls out of the structure. The combination of the defaults finding and prospect theory is one of the cleaner examples of empirical regularity and formal model reinforcing each other.

Policy framing. Public policy applications --- from organ donation defaults to tax-compliance messaging to retirement-savings nudges --- have used prospect-theoretic framing systematically. The U.K. Behavioural Insights Team, the Office of Evaluation Sciences in the U.S., and similar units in Australia, Canada, and Singapore have all used prospect-theoretic reasoning to design field interventions. Not all of those interventions have worked (the Mertens 2022 / Maier 2022 debate documents how the broader nudge literature has fared less well under publication-bias correction than the defaults-specific evidence base). But the framework itself --- as a theoretical lens for thinking about which framings will move behavior and which will not --- has been a workhorse for applied behavioral policy, and is one of the few academic frameworks that has had real, traceable impact on real-world institutional design.

The point of listing these is not to claim that prospect theory has solved applied behavioral economics. It is to show that the framework has had traction in actual applied settings over four decades --- the kind of traction that fragile lab findings never achieve. A behavioral framework that has been operationally useful to insurance underwriters, financial brokerages, public-policy units, and retirement-plan administrators for multiple decades is not in the same epistemic category as power posing or money priming.

The 2002 Nobel Prize and Vindication

In October 2002, the Royal Swedish Academy of Sciences awarded the Bank of Sweden Prize in Economic Sciences in Memory of Alfred Nobel --- the Nobel Prize in Economics, in common parlance --- to Daniel Kahneman of Princeton University “for having integrated insights from psychological research into economic science, especially concerning human judgment and decision-making under uncertainty.” The prize was shared with Vernon Smith of George Mason University, who received it for foundational work in experimental economics.

The Academy’s official citation specifically called out prospect theory as the central work being recognized. The longer “Advanced Information” document published by the Nobel Committee in December 2002 walks through the 1979 paper, the 1992 cumulative extension, the heuristics-and-biases program, and the subsequent applied uptake in financial economics, insurance, and policy. The prize was, in substantive terms, a prize for the prospect-theoretic research program.

Amos Tversky, Kahneman’s collaborator and co-author on the original prospect theory paper, had died in 1996 at age 59. The Nobel Prize is not awarded posthumously. Kahneman accepted the prize alone and was explicit in his Nobel lecture and in his subsequent writing about the extent to which the prospect-theoretic work had been a genuinely shared intellectual project with Tversky. Kahneman’s 2011 book Thinking, Fast and Slow devotes substantial space to the collaboration and is partly an acknowledgment that Tversky should have shared the recognition.

The Nobel Prize is not by itself a guarantor of scientific validity --- the prize has been awarded to research programs that later did not hold up well, and behavioral economics has not been immune to the broader replication problems in social science. But the 2002 prize is a useful reference point for the status of prospect theory specifically: it indicates that the framework had, by the early 2000s, achieved the kind of cross-disciplinary recognition and applied uptake that distinguishes a genuinely transformative empirical-theoretical contribution from a one-paper finding. The applied research community --- insurers, regulators, financial-services firms, policymakers --- had been using prospect-theoretic reasoning for two decades by then, and the framework had survived the kind of sustained empirical scrutiny that most behavioral findings in this hub did not.

What This Anti-Example Tells Us About Robust Behavioral Economics

The pattern this hub has documented, with prospect theory and defaults as the two anti-examples and everything else as failures, is the right way to understand the empirical state of behavioral economics. The field is not uniformly broken, and it is not uniformly sound. It is a mix, and the mix has structure.

The findings that survive tend to share four properties, which can serve as a calibration checklist:

Formal model rather than vague construct. A behavioral finding that comes wrapped in a formal mathematical structure with specified functional forms and estimable parameters is more likely to survive than a finding that is “people do X because of Y” without further specification. Prospect theory has a value function with explicit reference-dependence and curvature, and a probability weighting function with specified shape. Vague constructs like “ego depletion” or “willpower” did not survive the same scrutiny because the constructs themselves could be reinterpreted to absorb any empirical challenge.

Multiple independent empirical paradigms producing convergent results. Prospect theory has been validated in lab choice experiments, parametric population surveys, field data from insurance and financial markets, betting-market behavior, and natural experiments in labor supply. When the same predictions emerge from this many independent methodologies, the body of evidence is much more robust to publication bias and methodological artifacts than evidence from a single paradigm. Most failed behavioral findings came from a single dominant paradigm with limited cross-method validation.

Cross-cultural and cross-population replication. Prospect-theoretic patterns have been documented across many countries, age groups, education levels, and professional contexts. Findings that are confined to American or Western European undergraduate samples are at much higher risk of being culture-specific artifacts than findings that have been extended cross-culturally. Many of the failed behavioral findings in this hub were never extended outside their original WEIRD undergraduate samples.

Applied predictive power over time. Prospect theory has been productively used by insurance pricing, financial-product design, retirement-plan architecture, organ-donation policy, and a half-dozen other applied domains for multiple decades. Frameworks that produce useful predictions in real applied settings over long periods have a kind of validation that lab evidence cannot supply --- if the framework were wrong, the applied users would have noticed by now and stopped using it. Most failed behavioral findings had narrow applied uptake; prospect theory has been a workhorse model since the 1990s.

These four properties --- formal model, multiple paradigms, cross-cultural replication, applied predictive power --- are what robust behavioral economics looks like. They are also relatively rare. The default effect satisfies all four. Prospect theory satisfies all four. Most of the other constructs in this hub satisfy zero or one. The pattern is real, and it has predictive value for evaluating any new behavioral claim you encounter.

What This Means for Strategists Evaluating “Behavioral Economics” Claims

The operational takeaway for a CEO or strategist evaluating “behavioral economics” claims is calibration. Prospect theory is the benchmark. When someone pitches you a behavioral insight, the comparison to make is: does this framework have the structural properties that made prospect theory robust, or does it have the structural properties that made power posing fail?

Three diagnostic questions.

First: is the claim a formal model with estimable parameters, or is it a vague construct? “People are loss-averse” is not a formal model. Prospect theory’s value function with kink and asymmetric slopes is. “People want autonomy” is not a formal model. The reference-point specification with explicit dependence on prior outcomes and framing is. Vague constructs can be defended against any empirical evidence by reinterpretation. Formal models can be tested and refined. Prefer pitches that come with formal structure over pitches that come with TED-talk-style stories.

Second: how many independent empirical paradigms support the claim? A behavioral framework supported only by lab experiments with undergraduates is weaker evidence than one supported by lab work plus field data plus natural experiments plus structured population surveys. Prospect theory has all four. The defaults effect has lab plus field experiments plus administrative-data outcomes. Most failed behavioral claims have lab evidence only, often from a single experimental paradigm. Multiple-paradigm convergence is a stronger signal than within-paradigm replication.

Third: has the claim been applied operationally over multiple decades by users who would have noticed if it didn’t work? Insurance pricing has used prospect-theoretic reasoning since the 1980s; if the framework didn’t produce useful predictions, insurance underwriters would have noticed. Auto-enrollment retirement plans have used default architecture since the early 2000s; if defaults didn’t actually move participation, plan administrators would have noticed. Applied operational use over decades is a robustness check that lab evidence cannot supply. Prefer behavioral claims with traceable applied-use history over claims that have lived entirely in academic journals.

These three questions --- formal structure, multi-paradigm evidence, applied traction --- will sort most behavioral claims into the “credible enough to deploy” bucket or the “needs much more scrutiny” bucket. Prospect theory and the default effect pass all three. Most of the other constructs in this hub fail two or three. If you remember nothing else from this hub, remember the diagnostic checklist, and use it whenever someone tries to sell you a behavioral framework.

The calibration this hub has been trying to deliver, across 67 articles of takedowns and two anti-examples, is the same calibration that prospect theory’s robustness demonstrates: behavioral economics is not uniformly broken. It is a mix of robust formal frameworks and fragile lab demonstrations, and the difference between them is observable using the criteria above. The job of a strategist is to know the difference. Prospect theory is the benchmark for what the robust side of the field looks like, and it is the right standard against which every other behavioral claim should be measured.

Sources

Kahneman, D., & Tversky, A. (1979). Prospect theory: An analysis of decision under risk. Econometrica, 47(2), 263—291. DOI: 10.2307/1914185
Tversky, A., & Kahneman, D. (1992). Advances in prospect theory: Cumulative representation of uncertainty. Journal of Risk and Uncertainty, 5(4), 297—323. DOI: 10.1007/BF00122574
Wakker, P. P. (2010). Prospect Theory: For Risk and Ambiguity. Cambridge University Press. ISBN: 978-0-521-74868-1.
Barberis, N. C. (2013). Thirty years of prospect theory in economics: A review and assessment. Journal of Economic Perspectives, 27(1), 173—196. DOI: 10.1257/jep.27.1.173
Booij, A. S., van Praag, B. M. S., & van de Kuilen, G. (2010). A parametric analysis of prospect theory’s functionals for the general population. Theory and Decision, 68(1), 115—148. DOI: 10.1007/s11238-009-9144-4
Camerer, C. F. (2000). Prospect theory in the wild: Evidence from the field. In D. Kahneman & A. Tversky (Eds.), Choices, Values, and Frames. Cambridge University Press.
Royal Swedish Academy of Sciences (2002). The Bank of Sweden Prize in Economic Sciences in Memory of Alfred Nobel 2002 --- Press Release. nobelprize.org/prizes/economic-sciences/2002/press-release
Regenwetter, M., Robinson, M. M., Wang, C., & et al. (2022). Four internal inconsistencies in Tversky and Kahneman’s (1992) Cumulative Prospect Theory article. Advances in Methods and Practices in Psychological Science, 5(1).
Mertens, S., Herberz, M., Hahnel, U. J. J., & Brosch, T. (2022). The effectiveness of nudging: A meta-analysis of choice architecture interventions across behavioral domains. PNAS, 119(1), e2107346118. DOI: 10.1073/pnas.2107346118
Maier, M., Bartoš, F., Stanley, T. D., Shanks, D. R., Harris, A. J. L., & Wagenmakers, E.-J. (2022). No evidence for nudging after adjusting for publication bias. PNAS, 119(31), e2200300119. DOI: 10.1073/pnas.2200300119

Browse the full Replication Crisis Hub for other behavioral-science findings, including:

Loss Aversion --- the specific 2-to-1 claim, more contested than the framework
Defaults / Status Quo Bias (Anti-Example) --- the other behavioral finding that holds up
Sunk Cost Fallacy --- a prospect-theory-adjacent phenomenon with mixed evidence
Decoy Effect / Asymmetric Dominance --- another reference-dependent choice phenomenon
Endowment Effect --- a direct application of prospect-theoretic loss aversion to ownership

FAQ

What about loss aversion specifically? Is the 2-to-1 ratio real?

The qualitative claim that losses are weighted more heavily than equivalent gains is robust across most studies and populations. The specific universal “2-to-1 ratio” popularized from the original 1992 Tversky-Kahneman estimation (where the coefficient was actually 2.25, based on 25 graduate-student subjects) is not a universal law. Subsequent large-sample work has produced loss-aversion coefficients ranging from roughly 1.4 to 2.5 depending on population, stake size, and elicitation method. Booij et al. 2010 estimated 1.6 in a representative Dutch population sample. The framework prediction of asymmetric weighting holds up; the specific universal coefficient does not. The loss-aversion article in this hub goes into the 2018 Gal-Rucker challenge and the 2020 reply in more detail.

Is prospect theory always right?

No, and no serious researcher claims it is. The framework predicts the qualitative patterns of risk choice better than expected utility theory does, but the specific parametric estimates vary substantially across contexts, and reference-point specification in field applications is not pinned down by the framework itself. There are also decision contexts --- particularly involving ambiguity (unknown probabilities), inter-temporal choice, and very large stakes --- where the framework requires extensions or auxiliary assumptions. The right way to think about prospect theory is as the best-available formal model for choice under risk in most decision contexts, with explicit acknowledgment of where it does not apply or where the parameter estimates are uncertain.

What about ambiguity aversion --- isn’t that a different framework?

The Ellsberg paradox documented that people behave differently when probabilities are unknown (ambiguity) versus when they are known (risk), and the original 1979 prospect theory was specified for risk only. Wakker 2010 extended the framework to ambiguity, and there is a substantial subsequent literature on prospect-theoretic models of ambiguity. The ambiguity-aversion findings are robust; the precise functional form of how prospect-theory-like effects operate under ambiguity is still being worked out. For most applied purposes, the risk version of prospect theory is the relevant framework, with ambiguity as a complication for specific applications (e.g., insurance under uncertain coverage, financial decisions under model uncertainty).

Should I use prospect theory in pricing strategy?

Yes, with calibration. The reference-point structure and asymmetric loss function predict that consumers will react more strongly to a price increase framed as a loss from a reference price than to a price increase framed as a smaller surcharge above a different reference. The four-fold pattern predicts that low-probability product failures will be over-weighted in purchase decisions, which is why warranties and guarantees work as pricing components. Specific quantitative predictions about elasticity require domain-specific calibration --- do not import the 1992 graduate-student loss-aversion coefficient into a B2B SaaS pricing model and expect accurate predictions. Use the framework as a structural guide to what variables matter (reference points, framing, low-probability outcomes), and run domain-specific empirical work to calibrate magnitudes.

Why didn’t prospect theory get hit by the replication crisis the way other behavioral findings did?

Four reasons (described in the article above): clear mathematical formalism that could be productively tested and refined; multiple independent empirical paradigms producing convergent results; cross-cultural and cross-population replication; and applied predictive power across domains over multiple decades. Most of the failed behavioral findings in this hub had zero or one of these properties; prospect theory has all four. The combination of formal structure and broad empirical validation is what distinguishes durable behavioral economics from fragile lab demonstrations.

What is the difference between prospect theory and cumulative prospect theory?

The 1979 original used probability weighting applied to individual outcome probabilities, which produced some technical problems including occasional violations of stochastic dominance. The 1992 cumulative version applied probability weighting to the cumulative probability distribution instead, fixing the dominance problem and allowing extension to arbitrary numbers of outcomes and to ambiguity. Cumulative prospect theory is the version used in essentially all subsequent applied work; the original 1979 specification is mostly of historical interest. When the literature says “prospect theory” without further qualification, it usually means cumulative prospect theory.

Did Amos Tversky share the 2002 Nobel Prize?

No --- Tversky died in 1996, and the Nobel Prize is not awarded posthumously. Kahneman accepted the prize alone and has been consistently explicit about the extent to which the prospect-theoretic work was a genuinely shared intellectual project with Tversky. The Nobel Committee’s citation acknowledges the collaboration, and Kahneman’s 2011 book Thinking, Fast and Slow devotes substantial space to it. Substantively, the prize is for work that Kahneman and Tversky did together; institutionally, it was awarded to Kahneman alone because of the posthumous-award rule.

If prospect theory is so robust, why has it had relatively few well-known applications in economics?

This is the question Barberis 2013 takes up directly. His answer is that applying prospect theory in field settings is harder than the framework’s empirical robustness might suggest, because the reference point in any specific real-world decision is endogenous and may depend on expectations, prior outcomes, social comparisons, or framing of the choice. Different reference-point specifications can produce different predictions, and the framework does not by itself tell you which to use. This is not a defect of prospect theory; it is an honest acknowledgment that the framework is a structural model that requires auxiliary specification for application. The applied work that has succeeded has typically been in domains where the reference point is relatively unambiguous (purchase price for the endowment effect, prior wage for taxi-driver labor supply, premium price for insurance demand). In domains where the reference point is more contested, application is harder, which is one reason the applied uptake has been slower than the empirical evidence base might suggest.

replication-crisisprospect-theorykahneman-tversky Behavioral Science evidence-evaluation

Atticus Li

Experimentation and growth leader. CXL-certified CRO practitioner, Mindworx-certified in behavioral economics. Led 100+ in-house experiments at NRG in 2025, with project evidence and limits documented in the case studies.

About LinkedIn Newsletter

What Kahneman and Tversky 1979 Proposed

The Four-Fold Pattern of Risk Attitudes

The 1992 Cumulative Extension

What Has Replicated

What Is Contested

Why Prospect Theory Held Up Where Many Behavioral Findings Didn’t

How Prospect Theory Is Used in Applied Settings

The 2002 Nobel Prize and Vindication

What This Anti-Example Tells Us About Robust Behavioral Economics

What This Means for Strategists Evaluating “Behavioral Economics” Claims

Sources

Related

FAQ

Related Articles

Cohen's d And The Misuse Of "Small/Medium/Large" Effect Sizes

The False Consensus Effect: Why You Think Everyone Agrees With You

The Barnum/Forer Effect: Why Personality Tests And Horoscopes Feel So Accurate

Get the WeeklyExperimentation Playbook

Get the Weekly
Experimentation Playbook