【24h】

Colorful XML

机译:多彩的XML

获取原文

摘要

XML has a tree-structured data model, which is used to uniformly represent structured as well as semi-structured data, and also enable concise query specification in XQuery, via the use of its XPath (twig) patterns. This in turn can leverage the recently developed technology of structural join algorithms to evaluate the query efficiently. In this paper, we identify a fundamental tension in XML data modeling: (i) data represented as deep trees (which can make effective use of twig patterns) are often un-normalized, leading to update anomalies, while (ii) normalized data tends to be shallow, resulting in heavy use of expensive value-based joins in queries.Our solution to this data modeling problem is a novel multi-colored trees (MCT) logical data model, which is an evolutionary extension of the XML data model, and permits trees with multi-colored nodes to signify their participation in multiple hierarchies. This adds significant semantic structure to individual data nodes. We extend XQuery expressions to navigate between structurally related nodes, taking color into account, and also to create new colored trees as restructurings of an MCT database. While MCT serves as a significant evolutionary extension to XML as a logical data model, one of the key roles of XML is for information exchange. To enable exchange of MCT information, we develop algorithms for optimally serializing an MCT database as XML. We discuss alternative physical representations for MCT databases, using relational and native XML databases, and describe an implementation on top of the Timber native XML database. Experimental evaluation, using our prototype implementation, shows that not only are MCT queries/updates more succinct and easier to express than equivalent shallow tree XML queries, but they can also be significantly more efficient to evaluate than equivalent deep and shallow tree XML queries/updates.
机译:XML具有树形结构的数据模型,该模型用于统一表示结构化和半结构化数据,并通过使用其XPath(树枝)模式在XQuery中启用简洁的查询规范。反过来,这可以利用最近开发的结构连接算法技术来有效地评估查询。在本文中,我们确定了XML数据建模中的一个基本张力:(i)表示为深树的数据(可以有效利用嫩枝模式)经常未规范化,从而导致异常更新,而(ii)规范化数据趋向于变浅,导致在查询中大量使用昂贵的基于值的联接。我们对这个数据建模问题的解决方案是一种新颖的多色树(MCT)逻辑数据模型,它是XML数据模型的演进扩展,并且允许具有多色节点的树表示它们参与多个层次结构。这为各个数据节点增加了重要的语义结构。我们扩展XQuery表达式以在结构相关的节点之间导航,同时考虑颜色,还创建新的彩色树作为MCT数据库的重组。尽管MCT是对作为逻辑数据模型的XML的重要进化扩展,但XML的关键作用之一是信息交换。为了实现MCT信息的交换,我们开发了将MCT数据库优化序列化为XML的算法。我们将讨论使用关系数据库和本机XML数据库的MCT数据库的替代物理表示形式,并在Timber本机XML数据库之上描述一种实现。使用我们的原型实现进行的实验评估表明,与同等的浅树XML查询/更新相比,MCT查询/更新不仅更简洁,更易于表达,而且与同等的深树和浅树XML查询/更新相比,它们的评估效率也大大提高。 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号