Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works
arxiv(2024)
摘要
Large language models (LLMs) have demonstrated impressive performance and
spurred numerous AI applications, in which role-playing agents (RPAs) are
particularly popular, especially for fictional characters. The prerequisite for
these RPAs lies in the capability of LLMs to understand characters from
fictional works. Previous efforts have evaluated this capability via basic
classification tasks or characteristic imitation, failing to capture the
nuanced character understanding with LLMs. In this paper, we propose evaluating
LLMs' character understanding capability via the character profiling task,
i.e., summarizing character profiles from corresponding materials, a widely
adopted yet understudied practice for RPA development. Specifically, we
construct the CroSS dataset from literature experts and assess the generated
profiles by comparing ground truth references and their applicability in
downstream tasks. Our experiments, which cover various summarization methods
and LLMs, have yielded promising results. These results strongly validate the
character understanding capability of LLMs. We believe our constructed resource
will promote further research in this field. Resources are available at
https://github.com/Joanna0123/character_profiling.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要