首页> 外文期刊>Intelligent Data Analysis >Mining a large database with a parallel database server
【24h】

Mining a large database with a parallel database server

机译:使用并行数据库服务器挖掘大型数据库

获取原文
获取原文并翻译 | 示例

摘要

Data mining is a data-intensive computation activity. Parallel processing has often been used in data mining al- gorithms. However, when data do not fit in memory, some solutions do not apply and a database system may be required rather than flat files. Most of the implementations use the database system loosely coupled with the data mining techniques. Hence, the database system only issues quenes to be processed on the client machine. In this work. we address the data consuming activities through parallel processing on a database server providing a tight integration with data mining techniques. Experimental results showing the potential benefits of this integration were obtained. Despite the difficulties in processing a complex application, we extracted rules and obtained high performance on all the data-intensive activities such as the construction of the decision tree, pruning and rule extraction.
机译:数据挖掘是一项数据密集型计算活动。并行处理通常用于数据挖掘算法中。但是,当数据不适合内存时,某些解决方案将不适用,并且可能需要数据库系统而不是平面文件。大多数实现都使用与数据挖掘技术松散耦合的数据库系统。因此,数据库系统仅发出要在客户端计算机上处​​理的队列。在这项工作中。我们通过在数据库服务器上进行并行处理来解决数据消耗活动,并与数据挖掘技术紧密集成。实验结果表明了这种整合的潜在好处。尽管在处理复杂的应用程序方面存在困难,但是我们提取了规则并在所有数据密集型活动(例如决策树的构建,修剪和规则提取)中获得了高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号