当前位置: X-MOL 学术arXiv.cs.AI › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
The Incentives that Shape Behaviour
arXiv - CS - Artificial Intelligence Pub Date : 2020-01-20 , DOI: arxiv-2001.07118
Ryan Carey, Eric Langlois, Tom Everitt and Shane Legg

Which variables does an agent have an incentive to control with its decision, and which variables does it have an incentive to respond to? We formalise these incentives, and demonstrate unique graphical criteria for detecting them in any single decision causal influence diagram. To this end, we introduce structural causal influence models, a hybrid of the influence diagram and structural causal model frameworks. Finally, we illustrate how these incentives predict agent incentives in both fairness and AI safety applications.

中文翻译:

塑造行为的动机

代理人有动力控制哪些变量,有动力响应哪些变量?我们将这些激励形式化,并展示了在任何单个决策因果影响图中检测它们的独特图形标准。为此,我们引入了结构因果影响模型,这是影响图和结构因果模型框架的混合体。最后,我们说明了这些激励如何在公平和 AI 安全应用中预测代理激励。
更新日期:2020-01-22
down
wechat
bug