【24h】

Extended Path Expressions for XML

机译:XML的扩展路径表达式

获取原文
获取原文并翻译 | 示例

摘要

Query languages for XML often use path expressions to locate elements in XML documents. Path expressions are regular expressions such that underlying alphabets represent conditions on nodes. Path expressions represent conditions on paths from the root, but do not represent conditions on siblings, siblings of ancestors, and descendants of such siblings. In order to capture such conditions, we propose to extend underlying alphabets. Each symbol in an extended alphabet is a triplet (e_1, a, e_2), where a is a condition on nodes, and e_1 (e_2) is a condition on elder (resp. younger) siblings and their descendants; e_1 and e_2 are represented by hedge regular expressions, which are as expressive as hedge automata (hedges are ordered sequences of trees). Nodes matching such an extended path expression can be located by traversing the XML document twice. Furthermore, given an input schema and a query operation controlled by an extended path expression, it is possible to construct an output schema. This is done by identifying where in the input schema the given extended path expression is satisfied.
机译:XML的查询语言通常使用路径表达式来定位XML文档中的元素。路径表达式是正则表达式,因此基础字母表示节点上的条件。路径表达式表示从根开始的路径上的条件,但不表示兄弟姐妹,祖先的兄弟姐妹以及此类兄弟姐妹的后代的条件。为了捕获这种情况,我们建议扩展基础字母。扩展字母中的每个符号都是一个三元组(e_1,a,e_2),其中a是节点上的条件,而e_1(e_2)是年长(分别为较年轻)的兄弟姐妹及其后代的条件; e_1和e_2由树篱正则表达式表示,其表达与树篱自动机一样(树篱是树的有序序列)。可以通过遍历XML文档两次来定位与此类扩展路径表达式匹配的节点。此外,给定输入模式和由扩展路径表达式控制的查询操作,可以构造输出模式。通过确定输入模式中满足给定扩展路径表达式的位置来完成此操作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号