...
首页> 外文期刊>Wuhan University Journal of Natural Sciences >Extracting Result Schema Based on Query Instances in the Deep Web
【24h】

Extracting Result Schema Based on Query Instances in the Deep Web

机译:在Deep Web中基于查询实例提取结果架构

获取原文
获取原文并翻译 | 示例
           

摘要

Deep Web sources contain a large of high-quality andquery-related structured date. One of the challenges in the DeepWeb is extracting result schemas of Deep Web sources. To address this challenge, this paper describes a novel approach thatextracts both result data and the result schema of a Web database.The approach first models the query interface of a Deep Websource and fills in it with a specifically query instance. Then theresult pages of the Deep Web sources are formatted in the treestructure to retrieve subtrees that contain elements of the queryinstance. Next, result schema of the Deep Web source is extractedby matching the subtree' nodes with the query instance, in which,a two-phase schema extraction method is adopted for obtainingmore accurate result schema. Finally, experiments on real DeepWeb sources show the utility of our approach, which provides ahigh precision and recall.
机译:深度Web源包含大量与查询相关的高质量结构化日期。 DeepWeb的挑战之一是提取Deep Web源的结果模式。为了应对这一挑战,本文介绍了一种新颖的方法,该方法可提取结果数据和Web数据库的结果模式。该方法首先对Deep Websource的查询接口进行建模,并使用特定的查询实例对其进行填充。然后,将Deep Web源的结果页面格式化为树结构,以检索包含queryinstance元素的子树。接下来,通过将子树的节点与查询实例进行匹配来提取Deep Web源的结果模式,其中采用两阶段模式提取方法来获得更准确的结果模式。最后,在真实的DeepWeb来源上进行的实验表明了我们方法的实用性,它提供了很高的精度和召回率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号