...
【24h】

Model-based Utility Functions

机译:基于模型的效用函数

获取原文

摘要

Orseau and Ring, as well as Dewey, have recently described problems, including self-delusion, with the behavior of agents using various definitions of utility functions. An agent's utility function is defined in terms of the agent's history of interactions with its environment. This paper argues, via two examples, that the behavior problems can be avoided by formulating the utility function in two steps: 1) inferring a model of the environment from interactions, and 2) computing utility as a function of the environment model. Basing a utility function on a model that the agent must learn implies that the utility function must initially be expressed in terms of specifications to be matched to structures in the learned model. These specifications constitute prior assumptions about the environment so this approach will not work with arbitrary environments. But the approach should work for agents designed by humans to act in the physical world. The paper also addresses the issue of self-modifying agents and shows that if provided with the possibility to modify their utility functions agents will not choose to do so, under some usual assumptions.
机译:Orseau和Ring以及Dewey最近描述了各种问题,包括自欺欺人,以及使用各种效用函数定义的代理行为。代理的效用函数是根据代理与其环境交互的历史定义的。本文通过两个示例认为,可以通过分两步制定效用函数来避免行为问题:1)从交互作用推断环境模型,以及2)根据环境模型计算效用。将效用函数基于主体必须学习的模型意味着,效用函数必须首先按照要与所学习的模型中的结构匹配的规范来表示。这些规范构成了有关环境的先前假设,因此该方法不适用于任意环境。但是这种方法应该适用于人类设计的在现实世界中行动的主体。该文件还讨论了自我修改代理的问题,并表明,如果有可能修改其效用功能,那么在某些通常的假设下,代理将不会选择这样做。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号