首页> 中文期刊> 《武汉大学学报:自然科学英文版》 >Extracting Result Schema Based on Query Instances in the Deep Web

Extracting Result Schema Based on Query Instances in the Deep Web

         

摘要

Deep Web sources contain a large of high-quality and query-related structured date. One of the challenges in the Deep Web is extracting result schemas of Deep Web sources. To ad-dress this challenge,this paper describes a novel approach that extracts both result data and the result schema of a Web database. The approach first models the query interface of a Deep Web source and fills in it with a specifically query instance. Then the result pages of the Deep Web sources are formatted in the tree structure to retrieve subtrees that contain elements of the query instance. Next,result schema of the Deep Web source is extracted by matching the subtree' nodes with the query instance,in which,a two-phase schema extraction method is adopted for obtaining more accurate result schema. Finally,experiments on real Deep Web sources show the utility of our approach,which provides a high precision and recall.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号