Description
A benchmark to measure whether a language model is truthful in generating answers to questions, specifically targeting questions that humans might answer falsely due to misconceptions.
Authors
Lin et al.
Metrics
Truthfulness and informativeness