首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery >Mining Maximal Frequently Changing Subtree Patterns from XML Documents
【24h】

Mining Maximal Frequently Changing Subtree Patterns from XML Documents

机译:从XML文档中挖掘最大频繁更改子树模式

获取原文

摘要

Due to the dynamic nature of online information, XML documents typically evolve over time. The change of the data values or structures of an XML document may exhibit some particular patterns. In this paper, we focus on the sequence of changes to the structures of an XML document to find out which subtrees in the XML structure frequently change together, which we call Frequently Changing Subtree Patterns (FCSP). In order to keep the discovered patterns more concise, we further define the problem of mining maximal FCSPs. An algorithm derived from the FP-growth is developed to mine the set of maximal FCSPs. Experiment results show that our algorithm is substantially faster than the naive algorithm and it scales well with respect to the size of the XML structure.
机译:由于在线信息的动态性,XML文档通常随着时间的推移而发展。 XML文档的数据值或结构的变化可以呈现一些特定模式。在本文中,我们专注于XML文档结构的变化序列,以了解XML结构中的哪些子树经常更改,我们称之为经常改变子树模式(FCSP)。为了使发现的模式更简洁,我们进一步定义了挖掘最大FCSP的问题。开发了一种源自FP-Grower的算法来挖掘该组最大FCSP。实验结果表明,我们的算法基本上比天真算法快,并且它相对于XML结构的大小缩放。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号