首页> 外文期刊>ACM transactions on database systems >Exploiting Web Querying for Web People Search
【24h】

Exploiting Web Querying for Web People Search

机译:利用网络查询进行网络人物搜索

获取原文
获取原文并翻译 | 示例
       

摘要

Searching for people on the Web is one of the most common query types submitted to Web search engines today. However, when a person name is queried, the returned Webpages often contain documents related to several distinct namesakes who have the queried name. The task of disambiguating and finding the Webpages related to the specific person of interest is left to the user. Many Web People Search (WePS) approaches have been developed recently that attempt to automate this disambiguation process. Nevertheless, the disambiguation quality of these techniques leaves major room for improvement. In this article, we present a new WePS approach. It is based on issuing additional auxiliary queries to the Web to gain additional knowledge about the Webpages that need to be disambiguated. Thus, the approach uses the Web as an external data source by issuing queries to collect co-occurrence statistics. These statistics are used to assess the overlap of the contextual entities extracted from the Webpages. The article also proposes a methodology to make this Web querying technique efficient. Further, the article proposes an approach that is capable of combining various types of disambiguating information, including other common types of similarities, by applying a correlation clustering approach with after-clustering of singleton clusters. These properties allow the framework to get an advantage in terms of result quality over other state-of-the-art WePS techniques.
机译:在Web上搜索人员是当今提交给Web搜索引擎的最常见的查询类型之一。但是,当查询人名时,返回的网页通常包含与具有查询名的多个不同同名相关的文档。消除和查找与特定感兴趣的人相关的网页的歧义的任务留给用户。最近开发了许多Web People Search(WePS)方法,这些方法试图使这种歧义消除过程自动化。尽管如此,这些技术的歧义质量仍然有很大的改进空间。在本文中,我们提出了一种新的WePS方法。它基于向Web发出附加的辅助查询以获取有关需要消除歧义的网页的附加知识。因此,该方法通过发出查询来收集共现统计信息,从而将Web用作外部数据源。这些统计信息用于评估从网页提取的上下文实体的重叠。本文还提出了一种使这种Web查询技术高效的方法。此外,本文提出了一种方法,该方法能够通过将相关性聚类方法与单例聚类的后聚一起应用,来组合各种类型的歧义消除信息,包括其他常见类型的相似性。这些特性使框架在结果质量方面优于其他最新的WePS技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号