...
首页> 外文期刊>WSEAS Transactions on Computers >Reverse engineering XML documents into DTD Graph with SAX
【24h】

Reverse engineering XML documents into DTD Graph with SAX

机译:使用SAX将XML文档反向工程为DTD Graph

获取原文
获取原文并翻译 | 示例
           

摘要

We propose a systematic approach to reverse engineer arbitrary XML documents to their conceptual schema, DTD Graphs. The necessity for doing so is due to the fact that XML documents are frequently used for storing structured data and their schemas, such as in Document Type Definition (DTD) format, are missing, especially for those existing historical XML documents. As such, it is difficult for software developers or end users to make use of them. Even the schemas exist, they are difficult to read and undetermined of the underlying relationships among the elements in the documents. In view of this, it is necessary to determine the data semantics from the XML documents. If the DTDs of the XML documents exist with the identifications of the ID/IDREF(S) type attributes, then more data semantics can be derived. Another application of the determined data semantics is to verify the linkages implemented by ID/IDREF(S). If the element is referring to an incorrect XML element type, an extra data semantic will be determined as a result, and such findings can be used for verification purposes. Furthermore, the approaches proposed in this paper use Simple API for XML (SAX) so that the algorithms are applicable to small to huge sized XML documents.
机译:我们提出了一种系统化的方法来将任意XML文档反向工程为它们的概念模式DTD Graph。这样做的必要性是由于以下事实,即XML文档经常用于存储结构化数据,并且缺少其文档格式(如文档类型定义(DTD)格式),尤其是对于那些现有的历史XML文档而言。因此,软件开发人员或最终用户很难使用它们。即使存在模式,也很难阅读和确定文档中元素之间的潜在关系。鉴于此,有必要从XML文档中确定数据语义。如果XML文档的DTD带有ID / IDREF(S)类型属性的标识,则可以导出更多的数据语义。确定的数据语义的另一个应用是验证ID / IDREF(S)实现的链接。如果该元素引用了不正确的XML元素类型,则将确定额外的数据语义,这样的发现可用于验证目的。此外,本文提出的方法使用XML的简单API(SAX),因此该算法适用于小型到大型XML文档。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号