首页> 外文会议>Machine learning and data mining in pattern recognition >PMCRI: A Parallel Modular Classification Rule Induction Framework
【24h】

PMCRI: A Parallel Modular Classification Rule Induction Framework

机译:PMCRI:并行模块化分类规则归纳框架

获取原文
获取原文并翻译 | 示例

摘要

In a world where massive amounts of data are recorded on a large scale we need data mining technologies to gain knowledge from the data in a reasonable time. The Top Down Induction of Decision Trees (TDIDT) algorithm is a very widely used technology to predict the classification of newly recorded data. However alternative technologies have been derived that often produce better rules but do not scale well on large datasets. Such an alternative to TDIDT is the PrismTCS algorithm. PrismTCS performs particularly well on noisy data but does not scale well on large datasets. In this paper we introduce Prism and investigate its scaling behaviour. We describe how we improved the scalability of the serial version of Prism and investigate its limitations. We then describe our work to overcome these limitations by developing a framework to parallelise algorithms of the Prism family and similar algorithms. We also present the scale up results of a first prototype implementation.
机译:在一个大规模记录大量数据的世界中,我们需要数据挖掘技术来在合理的时间内从数据中获取知识。自上而下的决策树归纳(TDIDT)算法是一种用于预测新记录数据分类的非常广泛的技术。但是,已经找到了替代技术,这些技术通常会产生更好的规则,但在大型数据集上无法很好地扩展。 TDIDT的这种替代方法是PrismTCS算法。 PrismTCS在嘈杂的数据上表现特别出色,但在大型数据集上却无法很好地扩展。在本文中,我们介绍Prism并研究其缩放行为。我们描述了如何提高Prism串行版本的可伸缩性并研究其局限性。然后,我们通过开发框架来并行化Prism系列算法和类似算法来描述克服这些限制的工作。我们还介绍了第一个原型实现的放大结果。

著录项

  • 来源
  • 会议地点 Leipzig(DE);Leipzig(DE)
  • 作者单位

    University of Portsmouth, Buckingham Building, Lion Terrace,Portsmouth PO1 3HE, United Kingdom;

    rnUniversity of Portsmouth, Buckingham Building, Lion Terrace,Portsmouth PO1 3HE, United Kingdom;

    rnUniversity of Portsmouth, Buckingham Building, Lion Terrace,Portsmouth PO1 3HE, United Kingdom;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算机的应用;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号