首页> 外国专利> DEEP REINFORCEMENT LEARNING FOR LONG TERM REWARDS IN AN ONLINE CONNECTION NETWORK

DEEP REINFORCEMENT LEARNING FOR LONG TERM REWARDS IN AN ONLINE CONNECTION NETWORK

机译:在在线连接网络中长期奖励的深度增强学习

摘要

An online connection server is configured to more accurately predict connections for a viewing member of an online connection network. The online connection server may implement a machine-learning model that uses prior interactions by the viewing member to determine those connections that are likely to lead to more substantial interactions with the viewing member. The machine-learning model may be implemented using a reinforcement learning technique, such as a Deep Q network. The online connection server may further implement a state representation module that generates a state from a graph-based embedding of the viewing member profile, where the state is used to train the machine-learning model and determine an optimal candidate to recommend as a connection for the viewing member.
机译:在线连接服务器被配置为更准确地预测在线连接网络的查看成员的连接。在线连接服务器可以实现一种机器学习模型,该机器学习模型使用观看构件的先前交互来确定可能导致与观看构件更实质相互作用的那些连接。可以使用诸如深Q网络的加强学习技术来实现机器学习模型。在线连接服务器可以进一步实现一种状态表示模块,该状态表示模块从查看成员简档的基于图形的嵌入生成状态,其中状态用于训练机器学习模型并确定最佳候选者以推荐为连接观看会员。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号