Model-based Utility Functions

Bill Hibbard

首页> 外文期刊>Journal of Artificial General Intelligence >Model-based Utility Functions

【24h】

Model-based Utility Functions

机译：基于模型的效用函数

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Orseau and Ring, as well as Dewey, have recently described problems, including self-delusion, with the behavior of agents using various definitions of utility functions. An agent's utility function is defined in terms of the agent's history of interactions with its environment. This paper argues, via two examples, that the behavior problems can be avoided by formulating the utility function in two steps: 1) inferring a model of the environment from interactions, and 2) computing utility as a function of the environment model. Basing a utility function on a model that the agent must learn implies that the utility function must initially be expressed in terms of specifications to be matched to structures in the learned model. These specifications constitute prior assumptions about the environment so this approach will not work with arbitrary environments. But the approach should work for agents designed by humans to act in the physical world. The paper also addresses the issue of self-modifying agents and shows that if provided with the possibility to modify their utility functions agents will not choose to do so, under some usual assumptions.

机译：Orseau和Ring以及Dewey最近描述了各种问题，包括自欺欺人，以及使用各种效用函数定义的代理行为。代理的效用函数是根据代理与其环境交互的历史定义的。本文通过两个示例认为，可以通过分两步制定效用函数来避免行为问题：1）从交互作用推断环境模型，以及2）根据环境模型计算效用。将效用函数基于主体必须学习的模型意味着，效用函数必须首先按照要与所学习的模型中的结构匹配的规范来表示。这些规范构成了有关环境的先前假设，因此该方法不适用于任意环境。但是这种方法应该适用于人类设计的在现实世界中行动的主体。该文件还讨论了自我修改代理的问题，并表明，如果有可能修改其效用功能，那么在某些通常的假设下，代理将不会选择这样做。

著录项

来源
《Journal of Artificial General Intelligence》 |2012年第1期|共24页
作者
Bill Hibbard;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. How does the perception of pain determine the selection between different treatments? Experimental evidence for convex utility functions over pain duration and concave utility functions over pain intensity [J] . Schosser Stephan, Trarbach Judith N., Vogt Bodo Journal of Economic Behavior & Organization . 2016,第ptaB期

机译：疼痛感如何决定不同疗法之间的选择？超过疼痛持续时间的凸效用函数和超过疼痛强度的凹效用函数的实验证据
2. Automatic Privacy and Utility Preservation for Mobility Data: A Nonlinear Model-Based Approach [J] . Cerf Sophie, Bouchenak Sara, Robu Bogdan, IEEE transactions on dependable and secure computing . 2021,第1期

机译：移动数据的自动隐私和公用事业保护：基于非线性模型的方法
3. Empagliflozin in Type 2 Diabetes Mellitus Patients with High Cardiovascular Risk: A Model-Based Cost-Utility Analysis in China [J] . Peng Men, Tianbi Liu, Suodi Zhai Diabetes, metabolic syndrome and obesity: targets and therapy . 2020,第6期

机译：Empagliflozin in 2型糖尿病患者高血管内风险：中国的模型成本实用分析
4. Learning Multicriteria Utility Functions with Random Utility Models [C] . Geraldine Bous, Marc Pirlot International conference on algorithmic decision theory . 2013

机译：使用随机效用模型学习多准则效用函数
5. A COMPARATIVE STUDY OF THE EMPIRICAL IMPACT OF THE JOINT ESTIMATION APPROACH AND THE ALTERNATIVE SELECTION APPROACH FOR APPLICATION TO ECONOMETRIC MODELS WITH A PURE AUTOCORRELATION STRUCTURE: THE CASE OF THE GENERAL FECHNER-THURSTONE DIRECT UTILITY FUNCTION (MICROECONOMICS, ECONOMETRICS, MARGINAL UTILITY, COST LIVING INDEX, ECONOMIC POLICY). [D] . LIEU, PANG-TIEN. 1985

机译：具有纯自相关结构的经济模型的联合估计方法和替代选择方法的实证影响的比较研究：通用费舍尔－瑟尔斯顿直接实用性函数（微观经济学，经济学，经济学，经济学，经济学，经济学，经济学，经济学和经济学），经济政策）。
6. Development of a multiplicative multi-attribute utility function and eight single-attribute utility functions for the Health Utilities Index Mark 3 in Japan [O] . Shinichi Noto, Takeru Shiroiwa, Makoto Kobayashi, 2020

机译：在日本为卫生公用事业指数Mark 3开发了乘法多属性效用函数和八个单属性效用函数
7. Model-based Utility Functions [O] . Hibbard, Bill 2012

机译：基于模型的效用函数

Model-based Utility Functions

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅