首页>
外国专利>
Modifying a hierarchical data structure according to a pseudo-rendering of a structured document by annotating and merging nodes
Modifying a hierarchical data structure according to a pseudo-rendering of a structured document by annotating and merging nodes
展开▼
机译:通过注释和合并节点,根据结构化文档的伪渲染来修改分层数据结构
展开▼
页面导航
摘要
著录项
相似文献
摘要
A structured document is translated into an initial hierarchical data structure in accordance with syntactic elements defined in the structured document. The initial hierarchical data structure includes a plurality of nodes, and each node corresponds to one of the syntactic elements. The method then annotates a node with a set of attributes including geometric parameters of semantic elements in the structured document that are associated with the node in accordance with a pseudo-rendering of the structured document. Finally, the method merges the nodes in the initial hierarchical data structure into a tree of merged nodes in accordance with their respective attributes and a set of predefined rules such that each merged node is associated with a semantically distinct region of the pseudo-rendered document. The predefined rules include rules for merging nodes associated with semantic elements that have nearby positions and/or compatible attributes in the pseudo-rendered document.
展开▼