首页> 外文OA文献 >Autonomous Agents in Bargaining Games: An Evolutionary Investigation of Fundamentals, Strategies, and Business Applications
【2h】

Autonomous Agents in Bargaining Games: An Evolutionary Investigation of Fundamentals, Strategies, and Business Applications

机译:讨价还价游戏中的自治代理人:基础,战略和商业应用的进化调查

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Bargaining is becoming increasingly important due to developments within the field of electronic commerce, especially the development of autonomous software agents. Software agents are programs which, given instructions from a user, are capable of autonomously and intelligently realise a given task. By means of such agents, the bargaining process can be automated, allowing products and services together with related conditions, such as warranty and delivery time, to be flexible and tuned to the individual preferences of the people concerned. In this theses we concentrate on both fundamental aspects of bargaining as well as business-related applications of automated bargaining using software agents. The fundamental part investigates bargaining outcomes within a stylised world, and the factors that influence these outcomes. This can provide insights for the production of software agents, strategies, and setting up bargaining rules for practical situations. We study these aspects using computational simulations of bargaining agents. Hereby we consider adaptive systems, i.e., where agents learn to adjust their bargaining strategy given past experience. This learning behaviour is simulated using evolutionary algorithms. These algorithms originate from the field of artificial intelligence, and are inspired by the biological theory of evolution. Originally, evolutionary algorithms were designed for solving optimisation problems, but they are now increasingly being used within economics for modelling human learning behaviour. Besides computational simulations, we also consider mathematical solutions from game theory for relatively simple cases. Game theory is mainly concerned with the “rational man”, that is, with optimal outcomes within an stylised setting (or game) where people act rationally. We use the game-theoretic outcomes to validate the computational experiments. The advantage of computer simulations is that less strict assumptions are necessary, and that more complex interactions that are closer to real-world settings can be investigated. First of all, we study a bargaining setting where two players exchange offers and counter offers, the so-called alternating-offers game. This game is frequently used for modelling bargaining about for instance the price of a product or service. It is also important, however, to allow other product- and service-related aspects to be negotiated, such as quality, delivery time, and warranty. This enables compromises by conceding on less important issues and demanding a higher value for relatively important aspects. This way, bargaining is less competitive and the resulting outcome can be mutually beneficial. Therefore, we investigate using computational simulations an extended version of the alternating-offers game, where multiple aspects are negotiated concurrently. Moreover, we apply game theory to validate the results of the computational experiments. The simulation shows that learning agents are capable of quickly finding optimal compromises, also called Pareto-efficient outcomes. In addition, we study the effects of time pressure that arise if negotiations are broken off with a small probability, for example due to external eventualities. In absence of time pressure and a maximum number of negotiation rounds, outcomes are very unbalanced: the player that has the opportunity to make a final offer proposes a take-it-or-leave-it offer in the last round, which leaves the other player with a deal that is only slightly better than no deal at all. With relatively high time pressure, on the other hand, the first offer is most important and almost all agreements are reached in the first round. Another interesting result is that the simulation outcomes after a long period of learning in general coincide with the results from game theory, in spite of the fact that the learning agents are not “rational”. In reality, not only the final outcome is important, but also other factors play a role, such as the fairness of an offer. Using the simulation we study the influence of such fairness norms on the bargaining outcomes. The fairness norms result in much more balanced outcomes, even with no time pressure, and seem to be closer outcomes in the real world. Negotiations are rarely isolated, but can also be influenced by external factors such as additional bargaining opportunities. We therefore also consider bargaining within a market-like setting, where both buyers and sellers can bargain with several opponents before reaching an agreement. The negotiations are executed consecutively until an agreement is reached or no more opportunities are available. Each bargaining game is reduced to a single round, where player 1 makes an offer and player 2 can only respond by rejecting or accepting this offer. Using an evolutionary simulation we study several properties of this market game. It appears that the outcomes depend on the information that is available to the players. If players are informed about the bargaining opportunities of their opponents, the first player in turn has the advantage and always proposes a take-it-or-leave-it deal that leaves the other player with a relatively poor outcome. This outcome is consistent with a game-theoretic analysis which we also present in this thesis. If this information is not available, a theoretical analysis is very hard. The evolutionary simulation, however, shows that in this case the responder obtains a better deal. This occurs because the first player can no longer anticipate the response of the other player, and therefore bids lower to avoid a disagreement. In this thesis, we additionally consider other factors that influence the outcomes of the market game, such as negotiation over multiple issues simultaneously, search costs, and break off probabilities. Besides fundamental issues, this thesis presents a number of business-related applications of automated bargaining, as well as generic bargaining strategies for agents that can be employed in related areas. As a first application, we introduce a framework where negotiation is used for recommending shops to customers, for example on a web page of an electronic shopping mall. Through a market-driven auction a relevant selection of shops is determined in a distributed fashion. This is achieved by selling a limited number of banner spaces in an electronic auction. For each arriving customer on the web page, shops can automatically place bids for this “customer attention space” through their shop agents. These software agents bid based on a customer profile, containing personal data of the customer, such as age, interests, and/or keywords in a search query. The shop agents are adaptive and learn, given feedback from the customers, which profiles to target and how much to bid in the auction. The highest bidders are then selected and displayed to the customer. The feasibility of this distributed approach for matching shops to customers is demonstrated using an evolutionary simulation. Several customer models and auction mechanisms are studied, and we show that the market-based approach results in a proper selection of shops for the customers. Bargaining can be especially beneficial if not only the price, but other aspects are considered as well. This allows for example to customise products and services to the personal preferences of a user. We developed a system makes use of these properties for selling and personalising so-called information goods, such as news articles, software, and music. Using the alternating-offers protocol, a seller agent negotiates with several buyers simultaneously about a fixed price, a per-item price, and the quality of a bundle of information goods. The system is capable of taking into account important business-related conditions such as the fairness of the negotiation. The agents combine a search strategy and a concession strategy to generate offers in the negotiations. The concession strategy determines the amount the agent will concede each round, whereas the search strategy takes care of the personalisation of the offer. We introduce two search strategies in this thesis, and show through computer experiments that the use of these strategies by a buyer and seller agent, result in personalised outcomes, also when combined with various concession strategies. The search strategies presented here can be easily applied to other domains where personalisation is important. In addition, we also developed concession strategies for the seller agent that can be used in settings where a single seller agent bargains with several buyer agents simultaneously. Even if bargaining itself is bilateral (i.e., between two parties), a seller agent can actually benefit from the fact that several such negotiations occur concurrently. The developed strategies are focussed on domains where supply is flexible and can be adjusted to meet demand, like for information goods. We study fixed strategies, time-dependent strategies and introduce several auction-inspired strategies. Auctions are often used when one party negotiates with several opponents simultaneously. Although the latter strategies benefit from the advantages of auctions, the actual negotiation remains bilateral and consists of exchanging offers and counter offers. We developed an evolutionary simulation environment to evaluate the seller agent’s strategies. We especially consider the case where buyers are time-impatient and under pressure to reach agreements early. The simulations show that the auction-inspired strategies are able to obtain almost maximum profits from the negotiations, given sufficient time pressure of the buyers.
机译:由于电子商务领域的发展,特别是自主软件代理的发展,讨价还价变得越来越重要。软件代理是程序,可以根据用户的指令自动并智能地实现给定任务。通过这样的代理,讨价还价过程可以自动化,从而使产品和服务以及相关条件(例如保修和交货时间)变得灵活,并可以根据相关人员的个人喜好进行调整。在本文中,我们集中讨论了讨价还价的基本方面以及使用软件代理进行自动讨价还价的业务相关应用程序。基础部分研究了在一个程式化的世界中的讨价还价结果以及影响这些结果的因素。这可以为生产软件代理,策略以及为实际情况设置讨价还价规则提供见解。我们使用讨价还价代理的计算模拟研究了这些方面。因此,我们考虑采用自适应系统,即代理商根据过去的经验学习调整其议价策略。使用进化算法来模拟这种学习行为。这些算法起源于人工智能领域,并受到进化生物学理论的启发。最初,进化算法是为解决优化问题而设计的,但是现在在经济学中越来越多地使用它们来对人类学习行为进行建模。除了计算模拟,我们还考虑相对简单情况下博弈论的数学解决方案。博弈论主要与“理性人”有关,也就是说,在人们理性行动的程式化环境(或游戏)中具有最佳结果。我们使用博弈论的结果来验证计算实验。计算机模拟的优点在于,不需要严格的假设,并且可以研究更接近实际设置的更复杂的交互。首先,我们研究一个讨价还价的环境,其中两个玩家交换报价和​​还价,即所谓的交替报价游戏。该游戏通常用于为诸如产品或服务的价格等讨价还价建模。但是,允许协商其他与产品和服务相关的方面(例如质量,交货时间和保修)也很重要。通过承认不太重要的问题并要求相对重要的方面具有更高的价值,这可以实现折衷。这样,讨价还价的竞争性降低了,结果可能是互惠互利的。因此,我们使用计算仿真来研究交替要约游戏的扩展版本,在该版本中同时协商多个方面。此外,我们运用博弈论来验证计算实验的结果。仿真表明,学习主体能够快速找到最佳折衷方案,也称为帕累托有效结果。另外,我们研究了由于谈判的可能性小(例如由于外部事件)而导致的时间压力的影响。在没有时间压力和最大数量的谈判回合的情况下,结果是非常不平衡的:有机会提出最终要约的玩家在最后一轮中提出了接受还是放弃的要约,而其他玩家的交易总比没有交易好。另一方面,在相对较高的时间压力下,第一个报价最重要,并且几乎所有协议都在第一轮达成。另一个有趣的结果是,尽管学习主体不是“理性的”,但经过长时间学习后的模拟结果通常与博弈论的结果一致。实际上,不仅最终结果很重要,而且其他因素也很重要,例如要约的公平性。通过仿真,我们研究了这种公平规范对讨价还价结果的影响。公平规范即使在没有时间压力的情况下也能带来更加均衡的结果,并且在现实世界中似乎更接近结果。谈判很少是孤立的,但也会受到外部因素(例如额外的议价机会)的影响。因此,我们还考虑在类似市场的环境中进行讨价还价,在这种情况下,买卖双方都可以在达成协议之前与几个对手进行讨价还价。谈判将连续执行,直到达成协议或没有更多机会。每个讨价还价游戏减少到一个回合,玩家1提出要约,而玩家2只能通过拒绝或接受此要约来做出回应。使用进化模拟,我们研究了该市场游戏的几个属性。看来结果取决于玩家可用的信息。如果玩家被告知对手的讨价还价机会,则第一位玩家反而会获得优势,并且总是提出“先买后走”的交易,这会使其他玩家获得相对较差的结果。这一结果与我们在本文中也提出的博弈论分析是一致的。如果无法获得此信息,则很难进行理论分析。但是,进化仿真表明,在这种情况下,响应者可以获得更好的交易。发生这种情况是因为第一个玩家无法再预期其他玩家的反应,因此降低了出价以避免分歧。在本文中,我们还考虑了其​​他影响市场博弈结果的因素,例如同时就多个问题进行谈判,搜索成本和中断概率。除了基本问题外,本论文还介绍了自动议价的许多与业务相关的应用,以及可以在相关领域使用的代理商的通用议价策略。作为第一个应用程序,我们引入了一个框架,在该框架中,协商用于向顾客推荐商店,例如在电子购物中心的网页上。通过以市场为导向的拍卖,以分布方式确定了相关的商店选择。这可以通过在电子拍卖中出售数量有限的横幅空间来实现。对于网页上的每个到达客户,商店可以通过其商店代理商自动为该“客户关注空间”出价。这些软件代理基于客户资料进行出价,该资料包含客户的个人数据,例如年龄,兴趣和/或搜索查询中的关键字。店铺代理商具有适应性,可以根据客户的反馈进行学习,了解要定位的配置文件以及竞标价格。然后,选择最高出价者并将其显示给客户。通过进化仿真证明了这种将商店与客户匹配的分布式方法的可行性。研究了几种客户模型和拍卖机制,我们证明了基于市场的方法可以为客户正确选择商店。如果不仅考虑价格,还考虑其他方面,讨价还价会特别有益。例如,这允许根据用户的个人喜好定制产品和服务。我们开发了一种系统,利用这些属性来销售和个性化所谓的信息商品,例如新闻文章,软件和音乐。使用交替报价协议,卖方代理可以与几个买方同时就固定价格,每件价格和一捆信息商品的质量进行谈判。该系统能够考虑与业务相关的重要条件,例如谈判的公平性。代理商结合了搜索策略和让步策略以在谈判中产生要约。优惠策略决定了代理商每轮将让出的金额,而搜索策略则考虑了要约的个性化。我们在本文中介绍了两种搜索策略,并通过计算机实验证明,买方和卖方代理使用这些策略会导致个性化的结果,同时还会结合各种优惠策略。此处介绍的搜索策略可以轻松应用于个性化很重要的其他领域。此外,我们还为卖方代理制定了优惠策略,可在单个卖方代理同时与多个买方代理讨价还价的环境中使用。即使讨价还价本身是双边的(即,在两个当事方之间),卖方代理实际上也可以从多次同时进行这样的谈判中受益。制定的策略专注于供应灵活且可以调整以满足需求的领域,例如信息产品。我们研究固定策略,时间依赖型策略,并介绍几种拍卖启发型策略。当一方同时与多个对手进行谈判时,通常会使用拍卖。尽管后者的策略得益于拍卖的优势,但实际的谈判仍是双边的,包括交换要约和反要约。我们开发了一种进化模拟环境来评估卖方代理商的策略。我们特别考虑买家在时间上急躁并且承受着尽早达成协议的压力的情况。模拟表明,在买家有足够时间压力的情况下,拍卖启发型策略能够从谈判中获得几乎最大的利润。

著录项

  • 作者

    Gerding E.H.;

  • 作者单位
  • 年度 2004
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号