Human-centred mechanism design with Democratic AI (Jul, 10.1038/s41562-022-01383-x, 2022)

NATURE HUMAN BEHAVIOUR(2022)

引用 37|浏览319
暂无评分
摘要
Koster, Balaguer et al. show that an AI mechanism is able to learn to produce a redistribution policy which is preferred to alternatives by humans in an incentivized game. Building artificial intelligence (AI) that aligns with human values is an unsolved problem. Here we developed a human-in-the-loop research pipeline called Democratic AI, in which reinforcement learning is used to design a social mechanism that humans prefer by majority. A large group of humans played an online investment game that involved deciding whether to keep a monetary endowment or to share it with others for collective benefit. Shared revenue was returned to players under two different redistribution mechanisms, one designed by the AI and the other by humans. The AI discovered a mechanism that redressed initial wealth imbalance, sanctioned free riders and successfully won the majority vote. By optimizing for human preferences, Democratic AI offers a proof of concept for value-aligned policy innovation.
更多
查看译文
关键词
Economics,Science,technology and society,Life Sciences,general,Behavioral Sciences,Neurosciences,Microeconomics,Personality and Social Psychology,Experimental Psychology
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要