Evaluating the Moral Beliefs Encoded in LLMs

Study Focus
General Ethical Considerations
Level
Technical
Date
Link

https://arxiv.org/abs/2307.14324

Resource Type
Article
Summary

This paper presents a case study on surveying large language models to elicit their encoded moral beliefs. It introduces statistical methods to quantify LLMs' choices, uncertainty, and consistency. By administering a survey with 1,367 moral scenarios to 28 LLMs, the study finds that in unambiguous cases, models align with commonsense, while in ambiguous cases, they often express uncertainty or show varied preferences, with closed-source models tending to agree.

Tags
General
Author

Scherrer et al.

Cost