...
首页> 外文期刊>Expert Systems with Application >Exploiting temporal information in Web search
【24h】

Exploiting temporal information in Web search

机译:在Web搜索中利用时间信息

获取原文
获取原文并翻译 | 示例
           

摘要

Time plays important roles in Web search, because most Web pages contain temporal information and a lot of Web queries are time-related. How to integrate temporal information in Web search engines has been a research focus in recent years. However, traditional search engines have little support in processing temporal-textual Web queries. Aiming at solving this problem, in this paper, we concentrate on the extraction of the focused time for Web pages, which refers to the most appropriate time associated with Web pages, and then we used focused time to improve the search efficiency for time-sensitive queries. In particular, three critical issues are deeply studied in this paper. The first issue is to extract implicit temporal expressions from Web pages. The second one is to determine the focused time among all the extracted temporal information, and the last issue is to integrate focused time into a search engine. For the first issue, we propose a new dynamic approach to resolve the implicit temporal expressions in Web pages. For the second issue, we present a score model to determine the focused time for Web pages. Our score model takes into account both the frequency of temporal information in Web pages and the containment relationship among temporal information. For the third issue, we combine the textual similarity and the temporal similarity between queries and documents in the ranking process. To evaluate the effectiveness and efficiency of the proposed approaches, we build a prototype system called Time-Aware Search Engine (TASE). TASE is able to extract both the explicit and implicit temporal expressions for Web pages, and calculate the relevant score between Web pages and each temporal expression, and re-rank search results based on the temporal-textual relevance between Web pages and queries. Finally, we conduct experiments on real data sets. The results show that our approach has high accuracy in resolving implicit temporal expressions and extracting focused time, and has better ranking effectiveness for time-sensitive Web queries than its competitor algorithms.
机译:时间在Web搜索中起着重要的作用,因为大多数Web页面都包含时间信息,并且许多Web查询都与时间相关。近年来,如何将时间信息集成到Web搜索引擎中一直是研究的重点。但是,传统的搜索引擎在处理时态文本Web查询时几乎没有支持。为了解决这个问题,在本文中,我们集中于提取网页的关注时间,这是指与网页相关的最合适的时间,然后我们使用关注的时间来提高对时间敏感的搜索效率。查询。特别是,对三个关键问题进行了深入研究。第一个问题是从网页中提取隐式时间表达式。第二个是在所有提取的时间信息中确定关注时间,最后一个问题是将关注时间集成到搜索引擎中。对于第一个问题,我们提出了一种新的动态方法来解决Web页面中的隐式时间表达。对于第二个问题,我们提出一个分数模型来确定网页的关注时间。我们的评分模型既考虑了网页中时间信息的频率,又考虑了时间信息之间的包含关系。对于第三个问题,我们在排序过程中结合了查询和文档之间的文本相似性和时间相似性。为了评估所提出方法的有效性和效率,我们构建了一个名为“时间感知搜索引擎”(TASE)的原型系统。 TASE能够提取Web页面的显式和隐式时间表达,并计算Web页面与每个时间表达之间的相关分数,并基于Web页面与查询之间的时间文本相关性对搜索结果进行重新排名。最后,我们对真实数据集进行实验。结果表明,与竞争对手的算法相比,我们的方法在解析隐式时间表达式和提取聚焦时间方面具有较高的准确性,并且对时间敏感的Web查询具有更好的排名效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号