首页> 外文会议>IEEE International Conference on computer supported cooperative work in design >TSIR: A Chinese Temporal semantics Information Retrieval system based on MapReduce
【24h】

TSIR: A Chinese Temporal semantics Information Retrieval system based on MapReduce

机译:TSIR:基于MapReduce的中文时态语义信息检索系统

获取原文

摘要

The significance of time in information production and consumption has been recognised in information retrieval research. Temporal information plays an important role in the webpage retrieval. The webpage has both the temporal metadata and temporal semantics in the content. However, the existing search engines conduct the information retrieval based on text keywords rather than temporal semantics. To address this issue, a Temporal semantics Information Retrieval (TSIR) System is proposed to deal with the Chinese temporal information retrieval. The TSIR system is deployed on Hadoop and implemented by the means of MapReduce. Firstly, the Chinese temporal regular expression rule is introduced to extract the explicit and implicit temporal phrases in the query keywords and webpages. Secondly, the scores of webpages are re-evaluated by taking text relevance and temporal semantics relevance into account and the returned results are ranked according to re-evaluation. Experiment shows that TSIR system could precisely and effectively match the keywords queries related to temporal expression.
机译:时间在信息生产和消费中的重要性已在信息检索研究中得到认可。时间信息在网页检索中起着重要的作用。该网页在内容中同时具有时间元数据和时间语义。但是,现有的搜索引擎基于文本关键字而不是时间语义来进行信息检索。为了解决这个问题,提出了一种时态语义信息检索(TSIR)系统来处理中文时态信息检索。 TSIR系统部署在Hadoop上,并通过MapReduce实施。首先,引入中文时态正则表达式规则,以提取查询关键词和网页中的显式和隐式时态短语。其次,通过考虑文本相关性和时间语义相关性来对网页的分数进行重新评估,并根据重新评估对返回的结果进行排名。实验表明,TSIR系统可以准确有效地匹配与时间表达相关的关键词查询。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号