首页> 外国专利> Apparatus and method for association rule mining using frequent pattern-tree for incremental data processing

Apparatus and method for association rule mining using frequent pattern-tree for incremental data processing

机译:使用频繁模式树进行增量数据处理的关联规则挖掘的设备和方法

摘要

Disclosed are an apparatus and method for extracting association rules using a frequent pattern tree for processing progressively increasing data. The data sorting unit gradually increases over time and sorts the items corresponding to the plurality of transaction IDs included in the initial transaction data collected for a predetermined time, according to the sorting order in which the incidence decreases. The header table generator generates a header table in which items included in the initial transaction data and the frequentness corresponding to each item are sorted according to the sorting order. The tree generating unit generates a frequent pattern tree composed of nodes including identification codes and frequent degree information of each item included in the initial transaction data. The data updater updates the header table and the frequent pattern tree based on the transaction IDs and items included in the new transaction data sequentially collected over time after the initial transaction data is collected. The frequent pattern extractor searches each node of the frequent pattern tree and sequentially extracts a frequent pattern for each item from an item located at the bottom of the header table. According to the present invention, it is possible to reduce the calculation amount and increase the processing speed for extracting association rules by processing only newly collected transaction data without having to process the entire transaction data in order to extract the association rule from the gradually increasing transaction data. .
机译:公开了一种用于使用频繁模式树来提取关联规则以处理逐渐增加的数据的设备和方法。数据分类单元随时间逐渐增加,并根据发生率降低的分类顺序,对与在预定时间内收集的初始交易数据中包括的多个交易ID相对应的项目进行分类。标题表生成器生成标题表,在该标题表中,根据排序顺序对初始交易数据中包括的项目以及与每个项目相对应的频繁度进行排序。树生成单元生成由包括识别代码和初始交易数据中包括的每个项目的频繁程度信息的节点组成的频繁模式树。在收集初始交易数据之后,数据更新器基于随时间顺序收集的新交易数据中包括的交易ID和项目来更新报头表和频繁模式树。频繁模式提取器搜索频繁模式树的每个节点,并从位于标题表底部的项目中依次提取每个项目的频繁模式。根据本发明,通过仅处理新收集的交易数据而不必处理整个交易数据以便从逐渐增加的交易中提取关联规则,可以减少计算量并提高提取关联规则的处理速度。数据。 。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号