首页> 外国专利> REDUCING INSTANCES OF INCLUSION OF DATA ASSOCIATED WITH HINDSIGHT BIAS IN A TRAINING SET OF DATA FOR A MACHINE LEARNING SYSTEM

REDUCING INSTANCES OF INCLUSION OF DATA ASSOCIATED WITH HINDSIGHT BIAS IN A TRAINING SET OF DATA FOR A MACHINE LEARNING SYSTEM

机译:减少包含与机器学习系统数据的训练数据集中与后视偏差相关的数据的情况

摘要

Instances of data associated with hindsight bias in a training set of data for a machine learning system can be reduced. A first set of data, having a first set of fields, can be received. Data in a first field can be analyzed with respect to data in a second field corresponding to an event to be predicted. A result can be that the data in the first field is associated with hindsight bias. A second set of data, having a second set of fields, can be produced. The second set of fields can exclude the first field. One or more features associated with the second set of data can be generated. A third set of data, having the second set of fields and fields that correspond to the one or more features, can be produced. The training set of data can be produced using the third set of data.
机译:可以减少与机器学习系统的训练数据训练集中的后视偏差相关的数据的实例。可以接收具有第一组字段的第一组数据。可以在与要预测的事件对应的第二字段中的数据分析第一字段中的数据。结果可以是第一字段中的数据与后视偏差相关联。可以产生具有第二组字段的第二组数据。第二组字段可以排除第一个字段。可以生成与第二组数据相关联的一个或多个特征。可以生成具有第二组字段和对应于一个或多个特征的字段的第三组数据集。可以使用第三组数据制作训练数据集。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号