首页> 外国专利> Parallel decision-or regression-tree growing

Parallel decision-or regression-tree growing

机译:并行决策或回归树增长

摘要

The invention relates to a method of creating decision trees or regression trees for machine learning applications. The process of training the trees effectively uses a parallel computation including multiple computer processors in growing ensemble tree models. More specifically the invention is characterised by using processing units which have associated storage units comprising a data slice and a database management system operable to execute a method for growing multiple trees. The method comprising: creating subsets (or data bags) from a training dataset for training each of the trees to be grown, splitting the training set into disjoint data sub-sets and storing them in the data slices, creating root nodes for the trees, assigning data records of the bags to the root nodes of the trees to be grown and growing the trees iteratively wherein each iteration generates a node level of the ensemble of trees by passing through all of the data records in all slices by processing each slice in parallel.
机译:本发明涉及一种为机器学习应用创建决策树或回归树的方法。训练树的过程有效地使用了包括不断增长的集成树模型中的多个计算机处理器在内的并行计算。更具体地,本发明的特征在于使用具有相关联的存储单元的处理单元,该存储单元包括数据切片和可操作用于执行用于生长多棵树的方法的数据库管理系统。该方法包括:从训练数据集中创建子集(或数据袋),以训练每个要生长的树;将训练集拆分为不相交的数据子集,并将其存储在数据切片中;为树创建根节点;将袋子的数据记录分配给要生长的树木的根节点,然后迭代生长树木,其中,每次迭代通过并行处理每个切片来遍历所有切片中的所有数据记录,从而生成树木整体的节点级别。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号