首页> 外文会议>ACM conference on information and knowledge management >Incorporating Terminology Evolution for Query Translation in Text Retrieval with Association Rules
【24h】

Incorporating Terminology Evolution for Query Translation in Text Retrieval with Association Rules

机译:在文本检索中包含关联规则的术语演进中的术语演进

获取原文

摘要

Time-stamped documents such as newswire articles, blog posts and other web-pages are often archived online. When these archives cover long spans of time, the terminology within them could undergo significant changes. Hence when users pose queries pertaining to historical information over such documents, the queries need to be translated taking into account these temporal changes in order to provide accurate responses to users. For example, a query on Sri Lanka should automatically retrieve documents with its former name Ceylon. We call such concepts SITACs, i.e., Semantically Identical Temporally Altering Concepts. In order to discover SITACs, we propose an approach based on a novel framework constituting an integration of natural language processing, association rule mining and contextual similarity as a learning technique. The proposed approach has been experimented with real data and has been found to yield good results with respect to efficiency and accuracy.
机译:诸如NewsWire文章,博客帖子和其他网页之类的时间戳文件通常是在线存档。当这些档案涵盖长期时间长时,它们内的术语可能会发生重大变化。因此,当用户掌握与此类文档相关的历史信息有关的查询时,需要将查询转换为这些时间变化,以便为用户提供准确的响应。例如,Sri Lanka上的查询应自动将文档与其前姓名的锡塔仑一起检索。我们称之为Sitacs,即,语义上的概念,在语义上相同的时间更改概念。为了发现Sitacs,我们提出了一种基于一个基于基于自然语言处理,关联规则挖掘和上下文相似性作为学习技术的新颖框架的方法。拟议的方法已经尝试了实际数据,并已被发现在效率和准确性方面产生了良好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号