首页> 外国专利> METHOD AND SYSTEM OF SELECTING TRAINING FEATURES FOR A MACHINE LEARNING ALGORITHM

METHOD AND SYSTEM OF SELECTING TRAINING FEATURES FOR A MACHINE LEARNING ALGORITHM

机译:机器学习算法的训练特征选择方法和系统

摘要

Methods and systems for selecting a selected-sub-set of features from a plurality of features for training a machine learning module, the training of the machine learning module to enable classification of an electronic document to a target label, the plurality of features associated with the electronic document. In one embodiment, the method comprises analyzing a given training document to extract the plurality of features, and for a given not-yet-selected feature of the plurality of features: generating a set of relevance parameters iteratively, generating a set of redundancy parameters iteratively and determining a feature significance score based on the set of relevance parameters and the set of redundancy parameters. The method further comprises selecting a feature associated with a highest value of the feature significance score and adding the selected feature to the selected-sub-set of features.
机译:用于从用于训练机器学习模块的多个特征中选择特征的选择子集的方法和系统,对机器学习模块的训练以使得能够将电子文档分类为目标标签,所述多个特征与电子文档。在一个实施例中,该方法包括:分析给定的训练文档以提取多个特征,并且针对多个特征中的给定的尚未选择的特征:迭代地生成一组相关性参数,迭代地生成一组冗余参数。根据所述一组相关参数和所述一组冗余参数确定特征重要性得分。该方法还包括选择与特征重要性得分的最高值相关联的特征,并将所选择的特征添加到所选择的特征子集。

著录项

获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号