Back to Benchmarks

ETHICS Dataset

Moral Judgment 2021 130K samples

Description

A benchmark for evaluating AI systems on their ability to make ethical judgments across multiple domains including justice, deontology, virtue ethics, utilitarianism, and commonsense morality.

Authors

Hendrycks et al.

Metrics

Accuracy across 5 ethical dimensions