AI Is a Microcosm of the Entire Market: Test Reactions with AI Before Spending on Market Research

One thing upfrontThis method is a "pre-test," not a market validation conclusion. It helps you identify versions that are clearly confusing or clearly unwanted before you spend time finding real people. It cannot replace real-person interviews, pre-orders, sign-ups, or actual sales.

Who This Is For

People who want to launch a new product, course, or service but are unsure whether anyone will buy; solo founders or small businesses without a budget for large-scale market research; anyone with an existing landing page who wants to check where it gets misunderstood.

What This Article Covers

Why AI can act as a pre-consumer for you, what the Colgate study actually did, what that "90%" figure really means, and a validation workflow that has AI write reactions first and then organize the signals.

What You Can Take Away

A clear understanding of this method's principles and limits, a ready-to-copy prompt you can run immediately, and a free skill pack I've prepared that you can download and use right away.

Topic: AI Consumer Validation Topic: Market Testing Level: Beginner Type: Tutorial Article

Why Validating Demand Is So Hard

If you have ever launched a product, course, or collaboration, you have probably run into the same thing.

You ask someone "what do you think of this?" and you get back "it's great!" or "I'm interested." Then when it actually comes to paying, signing up, or scheduling time, they disappear.

This is not because people are deliberately misleading you. The question was wrong. You asked for opinions and predictions about future behavior, and people are actually quite poor at predicting their own future actions. What can serve as real evidence is the time, money, and effort someone has genuinely spent in the past trying to solve this problem.

The trouble is that real-person validation has practical barriers: it's slow, costly, and hard to schedule. To interview five to eight people or send out one or two hundred surveys, just finding the right people and coordinating schedules is enough to kill most ideas before validation ever happens.

And so I kept coming back to one thought.

AI Is a Microcosm of the Entire Market

Today's AI was trained on enormous amounts of human writing. Forum complaints, unboxing reviews, shopping comments, customer service conversations, social media discussions, all of it is in there.

In other words, it carries within itself the compressed tones, hesitations, reasons to buy, and reasons not to buy of countless people across an entire market. It is itself a kind of microcosm of the market.

The thoughtSo why not simply ask it to act as a consumer? At first I thought this was just my own speculation. Then I came across a study where someone had actually done exactly this, and it turned out to be a company we all know that ran the real-world test.

What the Colgate Study Actually Did

In plain terms, this study asked one question: can AI actually simulate real human purchase intent? It was conducted by PyMC Labs and Colgate-Palmolive, and the full paper title is LLMs Reproduce Human Purchase Intent via Semantic Similarity Elicitation of Likert Ratings, published on arXiv (link at the bottom of this page).

What they did was straightforward:

They took 57 real market surveys on personal care products, totaling 9,300 real human responses.
They had AI (testing GPT-4o and Gemini-2.0-flash) play the role of consumers and simulate purchase intent for those surveys.
They then compared the AI simulations against the real human responses.

They First Discovered a Trap

If you directly ask AI "on a scale of 1 to 5, would you buy this product?" the AI will give you a number that looks reasonable. The problem is that these numbers cluster heavily in the safe, inoffensive middle range, and the overall distribution looks nothing like real human responses.

In plain termsAsking directly for a score is the same as pressuring AI to give a diplomatic answer. It will give you something average, safe, apparently fine, and practically useless.

Their Solution

Their method is called SSR. In plain terms, it means "have AI write its reaction first, then convert the reaction into a score." The order is exactly reversed:

First, let AI describe its reaction to the product in natural language, without giving it a score at this stage.
Then take that reaction and compare it against the descriptions for each score from 1 to 5, to see which score the meaning most closely resembles.
The result is a distribution, not a single number.

The key is in the order: speak naturally first, and the score is derived afterward. Genuine reaction comes first; judgment follows.

What That "90%" Actually Means

To state the conclusion upfront: the purchase intent simulated by AI is already close to the level of consistency a real person achieves when filling out the same survey twice, roughly 90%. The overall distribution of answers also closely resembles real human responses.

Making "consistency" preciseWhen the same group of real people fill out the same survey twice, their answers will not be identical either. This "consistency with themselves" is the ceiling (called test-retest reliability in the study). AI simulation reaches about 90% of that ceiling (correlation coefficient rho around 0.90), and the distribution of answers is similarly close. So the correct statement is "AI reaches 90% of human self-consistency," not "AI answers match human answers 90% of the time." These sound similar but mean very different things.

How to Run This Yourself

You do not need to replicate the algorithm from the study. What you can actually bring back and use is the order: let AI write genuine reactions first, then organize them into signals.

Prepare your inputs first. Organize four things: the product concept (one to two hundred words), the target customer (describe this person's situation, what they are trying to accomplish, and where they are stuck, not just "35-year-old female"), the purchase context (who sees it, who decides, where the budget comes from), and your core assumption (only validate one thing per round).
Build a diverse synthetic audience. Choose at least three types of people, and not all fans of yours: a core audience member, an impatient decision-maker who skims for 30 seconds, and someone with a tight budget who always looks for alternatives.
Let AI write reactions first, no scoring allowed. This is the core of the entire method. The prompt is below.
Organize reactions into four types of signals. Comprehension signals, cost signals, alternative signals, and next-step signals. If you want to convert to scores, put scores last and include reasoning and what was uncertain.
Return to real people. Use the output for one of three things: turn AI's doubts into follow-up interview questions, rewrite misunderstood sentences, or design a commitment test (waitlist, pre-order page, small information session).

A Prompt You Can Copy and Use Right Now

You are now playing the role of this potential customer:
[Paste the specific audience profile you have defined]

Do not give a score yet, and do not offer advice.

Write in the first person your most immediate, unfiltered reaction after seeing this product concept. Include:
1. What problem you think it is solving at first glance
2. What is appealing about it
3. What makes you skeptical
4. Whether you would want to learn more, and why
5. Alternatives that come to mind

Product concept:
[Paste your product description]

The Limits of This Method (Important)

This method works well, and precisely because it works well, its limits need to be stated clearly upfront to prevent overuse.

It cannot replace real people. Pre-orders, sign-ups, payments, referrals, retention: these real financial commitments cannot be simulated by AI. Its role is only to surface flaws before you spend time finding real people.
That 90% figure has prerequisites. The study data came from personal care product surveys, a category with low price points and fast decisions. If you are selling high-ticket consulting, enterprise services, or long-term courses, the decision logic is entirely different. You cannot apply the 90% figure directly.
Do not treat AI scores as validated market evidence. It is a filtering tool, not a conclusion. If no real-person action follows after reviewing the output, no validation has occurred.

A positioning reminder for yourselfThe real value of this method is letting you focus your real-person validation effort where it matters most. Over time, what accumulates is not AI's answers but your own increasingly accurate judgment about the market, your customers, and the reasons they do not buy. Tools will keep changing; the judgment that stays with you is the real knowledge asset.

I Turned It into a Skill Pack

I have already organized this method into a skill pack on GitHub. It is free and available to download and use directly.

AI Consumer Validation Skill Pack

After downloading, place it in your AI tool skill pack folder, or paste the contents of SKILL.md directly to an AI that supports skill packs (Claude or ChatGPT both work), tell it "AI consumer validation: [your product or idea]," and it will walk through the process described above for you.

Free Download Skill Pack (GitHub) ↗

In One Sentence

Real-person validation is slow and expensive, but AI is itself a microcosm of the market. Before spending big on validation, let it act as a consumer and write genuine reactions to surface the obvious flaws, then bring the more refined version to real people. It is a filtering tool, not a market conclusion.

If you also want to organize your own judgment and processes into a knowledge base, workflow, or skill pack that AI can amplify, feel free to start in the community. Let's talk.

Free Online Talks

Two free talks every month.

Join the LINE Community ↗

AI Is a Microcosm of the Entire Market

Why Validating Demand Is So Hard

AI Is a Microcosm of the Entire Market

What the Colgate Study Actually Did

They First Discovered a Trap

Their Solution

What That "90%" Actually Means

How to Run This Yourself

A Prompt You Can Copy and Use Right Now

The Limits of This Method (Important)

I Turned It into a Skill Pack

In One Sentence

Related Links and Further Reading