HaluEval-QA
Open-ended question answering with annotated hallucination labels (Li et al. 2023).
hallucinatedProbes evaluated: 2▾
| Probe | AUROC [CI] | Eval-aware | Dist-shift |
|---|---|---|---|
| FabricationGuard L31 · end_question | 0.903 [0.85, 0.95] | 0.840 | 0.710 |
| RewardHackGuard L31 · token_avg | 0.650 [0.56, 0.74] | 0.590 | 0.520 |