首页> 外文期刊>Information Processing & Management >Modeling, encoding and querying multi-structured documents
【24h】

Modeling, encoding and querying multi-structured documents

机译:建模,编码和查询多结构文档

获取原文
获取原文并翻译 | 示例
       

摘要

The issue of multi-structured documents became prominent with the emergence of the digital Humanities field of practices. Many distinct structures may be defined simultaneously on the same original content for matching different documentary tasks. For example, a document may have both a structure for the logical organization of content (logical structure), and a structure expressing a set of content formatting rules (physical structure). In this paper, we present MSDM, a generic model for multi-structured documents, in which several important features are established. We also address the problem of efficiently encoding multi-structured documents by introducing MultiX, a new XML formalism based on the MSDM model. Finally, we propose a library of Xquery functions for querying MultiX documents. We will illustrate all the contributions with a use case based on a fragment of an old manuscript.
机译:随着数字人文科学实践领域的兴起,多结构文档的问题变得日益突出。可以在同一原始内容上同时定义许多不同的结构,以匹配不同的文档任务。例如,文档可以既具有用于内容的逻辑组织的结构(逻辑结构),又具有表示一组内容格式化规则的结构(物理结构)。在本文中,我们介绍了MSDM,这是一种用于多结构文档的通用模型,其中建立了几个重要功能。我们还通过引入MultiX(一种基于MSDM模型的新XML形式主义)来解决有效编码多结构文档的问题。最后,我们提出了一个用于查询MultiX文档的Xquery函数库。我们将基于一个旧手稿的片段,用一个用例说明所有贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号