首页> 外文期刊>Computing >A fast and distributed algorithm for mining frequent patterns in congested networks
【24h】

A fast and distributed algorithm for mining frequent patterns in congested networks

机译:一种用于挖掘拥塞网络中频繁模式的快速分布式算法

获取原文
获取原文并翻译 | 示例
           

摘要

With advances in technology, frequent pattern mining has been used widely in our daily lives. By using this technology, one can obtain interesting or useful information that would help one make decisions and apply judgment. For example, marketplace managers mine transaction data to obtain information that can help improve services, understand customer buying habits, determine a suitable scheme for placement of goods to increase profits, or for medical and biotechnology applications. However, the rate at which data is generated is very rapid, leading to problems caused by Big Data. Therefore, many researchers have studied distributed, parallel and cloud computing technology to select the best among them. However, data mining uses multiple computing nodes, which requires the transmission of a considerable amount of data in a network environment. The available network bandwidth is limited when many different tasks are being transmitted at the same time and many servers are working in the same network segment. This results in poor transmission, causing severe transfer delay, either internal or external to the network. Thus, we propose the fast and distributed mining algorithm for discovering frequent patterns in congested networks (FDMCN) algorithm, which is based on CARM. The main purpose is to reduce FP-tree transmission such that only a portion of the information is required for mining using computing nodes. The results of empirical evaluation under various simulation conditions show that the proposed method FDMCN delivers excellent performance in terms of execution efficiency and scalability when compared with the PSWS algorithm.
机译:随着技术的进步,频繁的模式挖掘已广泛应用于我们的日常生活中。通过使用这项技术,人们可以获得有趣或有用的信息,这些信息将有助于人们做出决定并做出判断。例如,市场经理挖掘交易数据以获得有助于改善服务,了解客户购买习惯,确定合适的商品放置方案以增加利润,或用于医疗和生物技术应用的信息。但是,数据的生成速度非常快,从而导致了由大数据引起的问题。因此,许多研究人员研究了分布式,并行和云计算技术,以从中选择最佳。但是,数据挖掘使用多个计算节点,这需要在网络环境中传输大量数据。当同时传输许多不同的任务并且许多服务器在同一网段中工作时,可用的网络带宽受到限制。这导致传输质量差,导致网络内部或外部的严重传输延迟。因此,我们提出了一种基于CARM的快速分布式挖掘算法,用于发现拥塞网络中的频繁模式(FDMCN)算法。主要目的是减少FP树传输,以便使用计算节点进行挖掘仅需要一部分信息。在各种仿真条件下的经验评估结果表明,与PSWS算法相比,所提出的方法FDMCN在执行效率和可伸缩性方面具有出色的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号