LessWrong AI
· Communities
Do LLMs Have Desires?
Work conducted with Yujun Zhou (yzhou25@nd.edu) and supported by SPARTL;DR:In paired-choice paradigms, LLMs report consistent preferences over outcomes (e.g., types and number of lives saved, types of policies enacted)Some have suggested that this indicates that LLMs have human-like value systemsWe design an experiment