Publications by Iason Gabriel

Publications by authors named "Iason Gabriel"

Page 1 of 1

STELA: a community-centred approach to norm elicitation for AI alignment.

Stevie Bergman Nahema Marchal John Mellor Shakir Mohamed Iason Gabriel

Sci Rep

March 2024

Value alignment, the process of ensuring that artificial intelligence (AI) systems are aligned with human values and goals, is a critical issue in AI research. Existing scholarship has mainly studied how to encode moral values into agents to guide their behaviour. Less attention has been given to the normative questions of whose values and norms AI systems should be aligned with, and how these choices should be made.

View Article and Find Full Text PDF

Using the Veil of Ignorance to align AI systems with principles of justice.

Laura Weidinger Kevin R McKee Richard Everett Saffron Huang Tina O Zhu Iason Gabriel

Proc Natl Acad Sci U S A

May 2023

The philosopher John Rawls proposed the Veil of Ignorance (VoI) as a thought experiment to identify fair principles for governing a society. Here, we apply the VoI to an important governance domain: artificial intelligence (AI). In five incentive-compatible studies ( = 2, 508), including two preregistered protocols, participants choose principles to govern an Artificial Intelligence (AI) assistant from behind the veil: that is, without knowledge of their own relative position in the group.

View Article and Find Full Text PDF