首页> 外文会议>International Conference on Information Knowledge Engineering >Extracting Information from Semi-Structured Web Pages by Considering User's Context
【24h】

Extracting Information from Semi-Structured Web Pages by Considering User's Context

机译:通过考虑用户的上下文,从半结构化网页中提取信息

获取原文
获取外文期刊封面目录资料

摘要

Nowadays, many users use web search engines to find and gather information. User faces an increasing amount of various semi-structured information sources. The issue of correlating, integrating and presenting related information to users becomes important. When a user uses a search engine such as Yahoo and Google to seek a specific information, the results are not only information about the availability of the desired information, but also information about other pages on which the desired information is mentioned The number of selected pages is enormous. Therefore, the performance capabilities, the overlap among results for the same queries and limitations of web search engines are an important and large area of research. Extracting information from the web data sources also becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. It is more challenging when an extracted information which is relevant to a user might not be relevant to other users. Thus, an information extraction that considers user's context more specifically user preferences would provide better results to the user. Thus, this paper proposed a framework for extracting information from semi-structured web pages by considering user's context.
机译:如今,许多用户使用Web搜索引擎查找和收集信息。用户面临越来越多的各种半结构信息源。关联,集成和呈现与用户相关信息的问题变得重要。当用户使用诸​​如雅虎和谷歌的搜索引擎来寻找特定信息时,结果不仅是关于所需信息的可用性的信息,而且还提到了关于所需信息的其他页面的信息,所以所选页面的数量是巨大的。因此,性能能力,结果相同查询的重叠和网络搜索引擎的限制是一个重要的和大的研究领域。从Web数据源中提取信息也变得非常重要,因为用户可用于用户的互联网中的多样化和越来越多的半结构化信息源,以及使来自Web的信息提取过程的各种网页成为一个具有挑战性的问题。当与用户相关的提取信息可能与其他用户无关时,它更具挑战性。因此,考虑用户的上下文更具体地说是用户偏好的信息提取将为用户提供更好的结果。因此,本文提出了一种用于通过考虑用户的上下文从半结构化网页提取信息的框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号