首页> 外国专利> PRE-STATISTICS OF DATA FOR NODE OF DECISION TREE

PRE-STATISTICS OF DATA FOR NODE OF DECISION TREE

机译:决策树节点的数据前统计

摘要

Embodiments of the subject matter described herein relate to generating a decision tree based on data pre-statistics. A plurality of data samples for a node of the decision tree are obtained, and the plurality of data samples have corresponding feature values with respect to a first feature. A target range is determined from a plurality of predefined numerical ranges so that the number of feature values falling into the target range is greater than a predetermined threshold number. Then, the remaining of the feature values other than the feature values falling into the target range are assigned to the respective numerical ranges, and the feature values falling into all the numerical ranges are counted based on the assignment of the remaining of the feature values, for allocation of the plurality of data samples to child nodes of the node. Accordingly, the data processing efficiency is substantially improved.
机译:本文描述的主题的实施例涉及基于数据预统计来生成决策树。获得用于决策树的节点的多个数据样本,并且该多个数据样本具有关于第一特征的对应特征值。从多个预定数值范围确定目标范围,使得落入目标范围的特征值的数量大于预定阈值数量。然后,将落入目标范围的特征值以外的其余特征值分配给各个数值范围,并基于其余特征值的分配来计数落入所有数值范围的特征值,用于将多个数据样本分配给节点的子节点。因此,大大提高了数据处理效率。

著录项

  • 公开/公告号US2019355124A1

    专利类型

  • 公开/公告日2019-11-21

    原文格式PDF

  • 申请/专利权人 MICROSOFT TECHNOLOGY LICENSING LLC;

    申请/专利号US201816479536

  • 发明设计人 HUCHENG ZHOU;CUI LI;

    申请日2018-01-16

  • 分类号G06T7/162;G06N20;G06N5;G06F16/901;G06K9/62;

  • 国家 US

  • 入库时间 2022-08-21 11:20:35

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号