The best self-report instrument ever developed — conscientiousness on the Big Five — explains 4% of job performance variance. Half of MBTI test-takers receive a different type within five weeks. Not because they changed. Because the instrument was measuring something that doesn't exist at a meaningful level of precision: your opinion of yourself, under no pressure.
Self-report has a fundamental limit. Not a technical one — a theoretical one. Nisbett and Wilson established in 1977 that people have little or no direct introspective access to the processes that drive their decisions. When asked why they chose something, they confabulate. They produce plausible narratives. They don't know.
We're not building a better personality test. We're measuring something different.
In behavioral economics, revealed preference is the principle that what you choose under real constraints is more informative than what you say you'd choose. Paul Samuelson established this in 1938. It took personality psychology another eighty years to fully reckon with it.
Origin Protocol applies this principle directly. You're placed in a lunar colony in 2033. 180 people. No return. Every decision carries weight: resource scarcity, moral conflict, allegiance under pressure. You can't optimize for a good result, because there's no answer key.
After 21 decisions, we don't ask what you think you are. We show you what you did.
Situational judgment tests — the category Origin Protocol belongs to — consistently outperform self-report personality measures in predictive validity. The optimally weighted Big Five battery achieves R = .16 for job performance. A well-designed situational measure achieves ρ = .26–.34. The entire personality testing industry has been optimizing the wrong instrument.
Fleeson's whole trait theory explains why aggregating across 21 scenarios works: while any single behavioral observation correlates with stable traits at only r ≈ .25, aggregating across scenarios produces behavioral distribution stability of r = .91–.97. The noise cancels. The pattern remains.
Pressure matters because of dual-process architecture. Under constraint, System 1 dominates — fast, automatic, shaped by deep learning and habitual patterns. Impression management requires System 2. Moderate pressure deactivates it.
MBTI is a billion-dollar industry built on an instrument that fails basic psychometric standards. Big Five is more defensible — but it's still asking you to describe yourself on a quiet Tuesday.
Origin Protocol occupies a different category entirely. Behavioral assessment under constraint. Observed decision patterns, not stated dispositions. A permanent record — not of who you say you are, but of what you did when the stakes were real.
The record doesn't ask for your permission. And it doesn't forget.
We aggregate 21 behavioral observations across four phases of escalating pressure: arrival, stakes, rupture, legacy. Fleeson (2001) demonstrated that trait-behavior correlations rise from r ≈ .25 for single observations to r ≈ .50 for aggregated behavioral distributions. At 21 scenarios, we're not measuring a single decision — we're measuring the distribution of how you respond when comfortable options are removed.
The scenarios are designed using Trait Activation Theory (Tett & Burnett, 2003): pressure must be trait-relevant, not merely stressful. An overly constraining situation forces everyone to respond the same way, producing no variance. Origin Protocol scenarios occupy the diagnostic middle: strong enough situational pressure to suppress deliberate self-presentation, structured specifically to activate the behavioral differences being measured.
Check-in reassessment is available at 90-day intervals. This is not arbitrary. Memory contamination and practice effects dissipate substantially by 10 weeks (Scharfen et al., 2018). Mood-state carryover from shared life circumstances remains below threshold at 90 days. Genuine personality trait change at this interval is negligible — rank-order stability is approximately r = .98 annually (Conley, 1984). What does change is behavioral expression and decision-making repertoire: how you respond now, given what has happened since. The reassessment captures adaptation, not transformation.
Usage notes: Hero primary is the recommended default. The "direct challenge" variant (C) is more aggressive — test for audience fit before deploying widely. One-liners are copy-paste ready for social, press kit, or pitch decks. Methodology paragraphs are written to cite real academic sources — don't alter the numbers without checking against science.html.