首页> 外国专利> Methods for building regression trees in a distributed computing environment

Methods for building regression trees in a distributed computing environment

机译:在分布式计算环境中构建回归树的方法

摘要

Systems and methods are disclosed for building and using decision trees, preferably in a scalable and distributed manner. Our system can be used to create and use classification trees, regression trees, or a combination of regression trees called a gradient boosted regression tree (GBRT). Our system leverages approximate histograms in new ways to process large datasets, or data streams, while limiting inter-process communication bandwidth requirements. Further, in some embodiments, a scalable network of computers or processors is utilized for fast computation of decision trees. Preferably, the network comprises a tree structure of processors, comprising a master node and a plurality of worker nodes or “workers,” again arranged to limit necessary communications.
机译:公开了用于优选地以可缩放和分布式的方式来构建和使用决策树的系统和方法。我们的系统可用于创建和使用分类树,回归树或称为梯度增强回归树(GBRT)的回归树的组合。我们的系统以新的方式利用近似直方图来处理大型数据集或数据流,同时限制了进程间通信带宽的要求。此外,在一些实施例中,计算机或处理器的可扩展网络被用于快速计算决策树。优选地,网络包括处理器的树形结构,该处理器的树形结构包括主节点和多个工作节点或“工作人员”,再次被布置为限制必要的通信。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号