首页> 外文会议>New frontiers in applied data mining. >A Structure Preserving Flat Data Format Representation for Tree-Structured Data
【24h】

A Structure Preserving Flat Data Format Representation for Tree-Structured Data

机译:树状结构数据的保留平面数据格式表示的结构

获取原文
获取原文并翻译 | 示例

摘要

Mining of semi-structured data such as XML is a popular research topic due to many useful applications. The initial work focused mainly on values associated with tags, while most of recent developments focus on discovering association rules among tree structured data objects to preserve the structural information. Other data mining techniques have had limited use in tree-structured data analysis as they were mainly designed to process flat data format with no need to capture the structural properties of data objects. This paper proposes a novel structure-preserving way for representing tree-structured document instances as records in a standard flat data structure to enable applicability of a wider range of data analysis techniques. The experiments using synthetic and real world data demonstrate the effectiveness of the proposed approach.
机译:由于许多有用的应用程序,诸如XML之类的半结构化数据的挖掘是一个流行的研究主题。最初的工作主要集中在与标签关联的值上,而最近的大多数发展集中在发现树状数据对象之间的关联规则以保存结构信息上。其他数据挖掘技术在树状结构数据分析中的使用受到限制,因为它们主要用于处理平面数据格式,而无需捕获数据对象的结构特性。本文提出了一种新的结构保留方法,用于将树状结构的文档实例表示为标准平面数据结构中的记录,以实现更广泛的数据分析技术的适用性。使用合成和真实世界数据进行的实验证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号