...
【24h】

Extracting logical schema from the web

机译:从Web提取逻辑架构

获取原文
获取原文并翻译 | 示例

摘要

One of the main limitations when accessing the web is the lack of explicit structure, whose presence may help in understanding data semantics. Schema for web data can be constructed at different levels, structuring a single pages or a whole site or group of sites. Here we present an approach to give a logical schema to a web-site, first defining a model for a single page, where its contents is divided into "logical" sections, i.e. parts of a page each collecting related information. Then, we introduce a site model in which both physical and logical links among different page sections are represented: physical are existing hyperlinks, while logical links are links between sections containing semantically related information. We show how such links can be found and classified according to their relevance, also showing how schema is used in a structure-aware browser to improve both browsing and searching. [References: 19]
机译:访问Web时的主要限制之一是缺少显式结构,该结构的存在可能有助于理解数据语义。可以在不同级别上构造Web数据的架构,从而构造单个页面或整个站点或站点组。在这里,我们提出一种为网站提供逻辑模式的方法,首先为单个页面定义一个模型,其中其内容分为“逻辑”部分,即页面的每个部分都收集相关信息。然后,我们介绍一种站点模型,其中表示了不同页面部分之间的物理和逻辑链接:物理是现有的超链接,而逻辑链接是包含语义相关信息的部分之间的链接。我们将展示如何找到这些链接并根据它们的相关性对其进行分类,还展示如何在结构感知的浏览器中使用架构来改善浏览和搜索。 [参考:19]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号