Auspexi
Technology

Pre‑generation Hallucination Risk Guard

Idea: hallucinations aren’t random. Estimate risk before answering, bound your hallucination rate with a calibrated threshold, and take safe actions.

How it works

Why pre‑generation risk?

Hallucinations correlate with internal uncertainty and weak support. Estimating risk before generation lets systems avoid costly wrong answers by fetching evidence or abstaining outright, improving precision under fixed latency budgets.

Quick start

// Target rate
target = 0.05
// Calibrate threshold on recent samples
threshold = calibrate(samples, target)
// Decide per request
action = decide(risk, threshold)

Signals and weights

Calibration procedure

  1. Collect (features, label) pairs where label indicates a known hallucination or correctness.
  2. Compute risk for each sample and sort by ascending risk.
  3. Choose the largest threshold where empirical hallucination rate ≤ target on held‑out data.

Policy integration

Below threshold: generate. Near threshold: fetch more context (expand top‑k, add tools). Above high‑risk buffer: abstain or reroute to a stronger model. Record decisions and outcomes for audits.

Evidence & audits

Log thresholds and outcomes with selective prediction and SLO status inside evidence bundles. Keep it falsifiable and reproducible.

Common pitfalls

Further reading

Attribution: Inspired by recent work on pre‑generation risk estimation and hallucination controls, including the resources above.

See also: Stability Demo · Whitepaper · FAQ