首页> 外国专利> GENERIC WORKFLOW FOR CLASSIFICATION OF HIGHLY IMBALANCED DATASETS USING DEEP LEARNING

GENERIC WORKFLOW FOR CLASSIFICATION OF HIGHLY IMBALANCED DATASETS USING DEEP LEARNING

机译:使用深度学习的高度不平衡数据集分类的通用工作流程

摘要

Methods, systems, and computer-readable storage media for providing a binary classifier include receiving a biased dataset, the biased data set including a plurality of records, each record being assigned to a class of a plurality of classes, one class including a majority class, performing data engineering on at least a portion of the biased dataset to provide a revised dataset, providing a trained deep autoencoder (DAE) by training a DAE using only records assigned to the majority class from the revised dataset, the trained DAE including a binary classifier that classifies records into one of the majority class and a minority class, validating the trained DAE using validation data that is based on at least a portion of the biased dataset, and providing the trained DAE for production use within a production system.
机译:用于提供二进制分类器的方法,系统和计算机可读存储介质包括接收偏置数据集,包括多个记录的偏置数据集,每个记录被分配给多个类的类,包括多个类的一个类,在偏置数据集的至少一部分上执行数据工程以提供修改后的数据集,通过仅使用分配给大多数类的Records从修改的数据集训练DAE提供训练的深度自动介质(DAE),包括二进制数据将记录分类为多数类和少数群体类别的分类器,使用基于偏置数据集的至少一部分的验证数据验证训练的DAE,并在生产系统中提供培训的DAE用于生产使用。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号