AI Hallucination Risk Checker

How likely is your AI response to contain hallucinated facts?

Paste in what you know about an AI response and get a hallucination risk score. The tool weighs confidence level, query complexity, training data freshness, claim density, and whether sources were cited to tell you how hard you should verify before acting.

Updated July 2026 · How this works

AI Confidence Score (%)

Query Complexity

Training Data Recency

Number of Factual Claims

Response Length (words)

Citations or Sources Provided

—

See a way to make this better?

Worth knowing

Learn more

How It Works

The formula, explained simply

When an AI model generates a response, it does not look anything up in a database — it predicts the next most plausible token based on patterns in its training data. That means a response can sound authoritative and specific while being entirely fabricated. The model does not know it is wrong. This is what researchers call a hallucination: a confident, fluent, false output.

The hallucination risk score works by stacking risk factors multiplicatively. It starts from the model's stated confidence level — a 15% base risk for the example inputs — then amplifies or shrinks that number based on how hard the question is, how fresh the relevant training data is, whether the model backed its claims with sources, and how densely packed the response is with specific assertions. Each factor applies independently as a multiplier, so a single bad factor can double the risk, and several bad factors compound aggressively.

The result is a single percentage that tells you where to focus your verification energy. A 5% score on a simple lookup with full citations means a quick sanity-check is probably enough. A score in the high or very high range on an expert topic with no sources means you should treat the AI output as a rough draft, not a conclusion, and verify each specific claim before acting on any of them.

When To Use This

Right tool, right situation

Use this tool before acting on any AI-generated content that includes specific factual claims — research summaries, medical or legal background, technical specifications, historical facts, or business data. It is especially useful when you are evaluating responses from a model you have not used before and do not have a calibrated sense of its reliability on the topic in question.

This tool is also the right sanity-check when you are in a time-pressured situation and need to decide how much verification effort to allocate. A low-risk score on a simple topic with current training and full citations means a quick read is probably sufficient. A very high risk score means you need to budget real time for verification — treat the AI output as raw material, not as a finished answer.

Do not rely on this tool when the cost of being wrong is extremely high — medical diagnoses, legal filings, safety-critical engineering decisions. In those contexts, no risk score is a substitute for domain-expert review. The score helps you triage your verification effort; it does not replace expert judgment. Similarly, this tool is not designed to evaluate AI-generated creative content, code, or opinion pieces where factual hallucination is less the concern than logical or stylistic quality.

Common Mistakes

Why results sometimes look wrong

Mistake 1 — Treating confidence score as a reliability guarantee. The confidence score is the model's self-assessment, not an external audit. Models are miscalibrated: they often express high confidence on topics where their training data was sparse or contradictory. A 95% confidence score on an expert-domain question with outdated training data still produces a meaningful risk contribution from the other multipliers. Never short-circuit the full calculation just because confidence is high.

Mistake 2 — Counting only obvious numerical claims. When counting factual claims, people tend to count statistics and dates while missing softer claims: named organizations, attributed quotes, procedural steps presented as definitive, and descriptions of how systems work. These are all hallucination targets. Undercounting claims artificially suppresses the density multiplier and understates the true risk.

Mistake 3 — Assuming citations mean verified facts. AI models can cite sources that do not exist, or that exist but do not say what the model claims they say. Selecting yes for citations reduces the risk multiplier because cited responses are somewhat more likely to be accurate — but it does not eliminate risk. If you are making a consequential decision, follow the actual URLs and read the actual sources, do not just note that citations were present.

∑

The Math

Worked examples and deeper derivation

The formula follows a multiplicative chain. First, compute the base risk: Base Risk = 100 minus the confidence score. For the example with a confidence score of 85, the base risk is 15%.

Next, apply four independent multipliers in sequence. Complexity: Simple queries use a 0.8x multiplier; Moderate a 1.0x; Complex a 1.4x; Expert a 1.8x. Training recency: Current topics use 0.7x; Recent 1.0x; Outdated 1.5x; Unknown 1.2x. Citations: Provided sources use 0.6x; Partial 0.9x; None 1.2x.

Claim density is computed as factual claims divided by (response length divided by 100). The density multiplier equals 1 plus (density times 0.1), capped at 2x and floored at 1x. For the example, that yields a density multiplier of 1.2xx. The final formula is: Risk = Base Risk times Complexity times Recency times Citations times Density. The result is then capped between 0 and 100 for display. The example yields 16.2%.

A straightforward fact-check on a well-known topic

Confidence 85%, moderate complexity, recent training data, 3 claims in 150 words, partial citations

Starting from a confidence score of 85%, the base risk is 15%. Moderate complexity applies a 1.0x multiplier, recent training data applies a 1.0x multiplier, and partial citations apply a 0.9x multiplier. With 3 claims across 150 words, the claim density drives a density multiplier of 1.2xx. The final risk score is 16.2%, placing this response in the moderate risk category. The right move here is to verify the most important of the 3 factual claims before acting — the partial citations help, but do not fully substitute for independent confirmation.

A high-stakes expert query with no cited sources

Confidence 65%, expert complexity, outdated training data, 8 claims in 200 words, no citations

A confidence score of 65% yields a base risk of 35. Expert complexity multiplies by 1.8, outdated training data by 1.5, and no citations by 1.2. With 8 claims in 200 words, the claim density is high and the density multiplier reaches 2x. The uncapped calculation exceeds 100%, so the displayed risk is capped at 100. This is a textbook very high risk result — every one of those 8 factual claims needs independent verification before the response is used for anything consequential.

A simple lookup with full citations and high confidence

Confidence 92%, simple complexity, current training data, 2 claims in 100 words, yes citations

A confidence score of 92% gives a base risk of 8. Simple complexity applies a 0.8x multiplier, current training data 0.7x, and cited sources 0.6x. With only 2 claims in 100 words the density multiplier is 1.2x. The final risk score is 2.7 — solidly in the low risk category. For this type of response, a quick mental sanity-check is usually sufficient; full independent verification would be overkill.

Expert Unlock

The thing most explanations skip

The multiplicative structure of this formula means risk can technically exceed 100% before capping — a mathematical artifact that signals when multiple high-risk factors are stacked. The cap keeps the display interpretable, but practitioners should note that a capped score does not mean all risks are equal: a response that computes to 180% uncapped is categorically more concerning than one that computes to 65% uncapped, even though both display as very high risk. When the uncapped value matters, check the base risk and density multiplier sub-outputs to understand which factors are driving the score.

The density multiplier has a deliberate ceiling of 2x to prevent a single very dense paragraph from dominating the entire score. In practice, this means the formula treats extremely claim-dense responses the same as moderately dense ones once the cap is hit — which is a known simplification. In a real evaluation, a 20-claim response in 100 words deserves more scrutiny than the capped multiplier alone would suggest.

What does my AI hallucination risk score actually mean?

What is a safe hallucination risk score for a business decision?

Scores below 15% are classified as low risk, meaning the combination of model confidence, query type, training freshness, and citation quality all point toward a reliable response. For any decision with real consequences — financial, medical, legal — even a low-risk score warrants a spot-check of the most critical claims. The score tells you how hard to verify, not whether to verify.

Can a high confidence score from the AI mean low hallucination risk?

High confidence lowers the base risk, but it does not override all other factors. A model can be confidently wrong — especially on expert-level topics with outdated training data and no citations provided. If your inputs score complex, outdated, and no citations, even a 90% confidence score can still produce a moderate or high overall risk rating because the other multipliers stack on top of the base.

Why does claim density increase hallucination risk?

Each factual claim in an AI response is an independent opportunity for the model to fabricate or misremember a detail. A response that packs many specific facts into a short passage — dates, figures, names, citations — gives the model more chances to get something wrong than a response that makes few concrete assertions. The density multiplier captures this: more claims per 100 words means higher cumulative risk, up to a cap of 2x.

Need something this doesn't cover?

Suggest a tool — we'll build it →