首页> 外文会议>Pacific Rim international conference on artificial intelligence >Toward Reciprocity-Aware Distributed Learning in Referral Networks
【24h】

Toward Reciprocity-Aware Distributed Learning in Referral Networks

机译:在推荐网络中实现互惠意识的分布式学习

获取原文

摘要

Distributed learning in expert referral networks is an emerging challenge in the intersection of Active Learning and Multi-Agent Reinforcement Learning, where experts—humans or automated agents— have varying skills across different topics and can redirect difficult problem instances to connected colleagues with more appropriate expertise. The learning-to-refer challenge involves estimating colleagues' topic-conditioned skills for appropriate referrals. Prior research has investigated different reinforcement learning algorithms both with uninforma-tive priors and partially available (potentially noisy) priors. However, most human experts expect mutually-rewarding referrals, with return referrals on their expertise areas so that both (or all) parties benefit from networking, rather than one-sided referral flow. This paper analyzes the extent of referral reciprocity imbalance present in high-performance referral-learning algorithms, specifically multi-armed bandit (MAB) methods belonging to two broad categories - frequentist and Bayesian - and demonstrate that both algorithms suffer considerably from reciprocity imbalance. The paper proposes modifications to enable distributed learning methods to better balance referral reciprocity and thus make referral networks win-win for all parties. Extensive empirical evaluations demonstrate substantial improvement in mitigating reciprocity imbalance, while maintaining reasonably high overall solution performance.
机译:专家推荐网络中的分布式学习是主动学习和多智能体强化学习相交中的新兴挑战,其中专家(人员或自动代理)在不同主题上具有不同的技能,并且可以将困难的问题实例重定向到具有更适当专业知识的互联同事。学习参考挑战涉及估算同事的主题条件技能,以进行适当的推荐。先前的研究已经研究了不同的强化学习算法,它们既具有非信息性的先验,也具有部分可用的(可能有噪声的)先验。但是,大多数人类专家都希望相互推荐,并且在他们的专业知识领域拥有返回推荐,这样双方(或所有)方都可以从网络中受益,而不是单方面的推荐流程。本文分析了高性能推荐学习算法中存在的推荐互惠不平衡的程度,特别是属于两大类(常客和贝叶斯)的多臂匪(MAB)方法,并证明这两种算法都存在互惠不平衡的问题。本文提出了一些修改,以使分布式学习方法能够更好地平衡推荐人的互惠性,从而使推荐人网络对所有各方都是双赢的。广泛的经验评估表明,在减轻互惠不平衡的同时,还可以保持相当高的总体解决方案性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号