首页> 外国专利> DEEP REINFORCEMENT LEARNING FOR LONG TERM REWARDS IN AN ONLINE CONNECTION NETWORK

DEEP REINFORCEMENT LEARNING FOR LONG TERM REWARDS IN AN ONLINE CONNECTION NETWORK

机译：在在线连接网络中长期奖励的深度增强学习

页面导航

摘要
著录项
相似文献

摘要

An online connection server is configured to more accurately predict connections for a viewing member of an online connection network. The online connection server may implement a machine-learning model that uses prior interactions by the viewing member to determine those connections that are likely to lead to more substantial interactions with the viewing member. The machine-learning model may be implemented using a reinforcement learning technique, such as a Deep Q network. The online connection server may further implement a state representation module that generates a state from a graph-based embedding of the viewing member profile, where the state is used to train the machine-learning model and determine an optimal candidate to recommend as a connection for the viewing member.

机译：在线连接服务器被配置为更准确地预测在线连接网络的查看成员的连接。在线连接服务器可以实现一种机器学习模型，该机器学习模型使用观看构件的先前交互来确定可能导致与观看构件更实质相互作用的那些连接。可以使用诸如深Q网络的加强学习技术来实现机器学习模型。在线连接服务器可以进一步实现一种状态表示模块，该状态表示模块从查看成员简档的基于图形的嵌入生成状态，其中状态用于训练机器学习模型并确定最佳候选者以推荐为连接观看会员。

著录项

公开/公告号US2021216944A1

专利类型
公开/公告日2021-07-15

原文格式PDF
申请/专利权人 MICROSOFT TECHNOLOGY LICENSING LLC;
展开▼

申请/专利号US202016743486
发明设计人 SIYUAN GAO;YIOU XIAO;PARAG AGRAWAL;AASTHA JAIN;
展开▼

申请日2020-01-15
分类号G06Q10/06;H04L29/08;G06Q50;G06N20;
国家 US
入库时间 2022-08-24 19:56:40

相似文献

专利
外文文献
中文文献