首页> 外国专利> PUSHING ITEMS TO USERS BASED ON A REINFORCEMENT LEARNING MODEL

PUSHING ITEMS TO USERS BASED ON A REINFORCEMENT LEARNING MODEL

机译：基于强化学习模型的用户项目

页面导航

摘要
著录项
相似文献

摘要

This disclosure is related to determining an item push list for a user based on a reinforcement learning model. In one aspect, a method includes obtaining M first item lists that have been predetermined for a first user. Each first item list includes i-1 items. For each first item list, an ith state feature vector is obtained. The ith state feature vector includes a static feature and a dynamic feature. The ith state feature vector is provided as input to the reinforcement machine learning model. The reinforcement model outputs a weight vector including weights of sorting features. A sorting feature vector of each item in a candidate item set corresponding to the first item list is obtained. The sorting feature vector includes feature values of sorting features. M updated item lists are determined for the first item lists based on a score for each item in M candidate item sets.

机译：本公开涉及基于强化学习模型为用户确定项目推送列表。在一个方面，一种方法包括获得已经为第一用户预定的M个第一项目列表。每个第一项列表包括i-1个项。对于每个第一项目列表，获得第i个状态特征向量。第i个状态特征向量包括静态特征和动态特征。第i个状态特征向量作为增强机器学习模型的输入提供。增强模型输出包含分类特征权重的权重向量。获得与第一项目列表相对应的候选项目集中的每个项目的分类特征向量。排序特征向量包括排序特征的特征值。基于M个候选项目集中每个项目的得分，为第一个项目列表确定M个更新的项目列表。

著录项

公开/公告号US2020342268A1

专利类型
公开/公告日2020-10-29

原文格式PDF
申请/专利权人 ALIBABA GROUP HOLDING LIMITED;
展开▼

申请/专利号US202016813654
发明设计人 CEN CHEN;XU HU;CHILIN FU;XIAOLU ZHANG;
展开▼

申请日2020-03-09
分类号G06K9/62;G06F9/30;G06F17/16;G06N5/04;
国家 US
入库时间 2022-08-21 11:22:52

相似文献

专利
外文文献
中文文献