首页> 外文期刊>ACM transactions on Asian language information processing >Topic Tracking with Time Granularity Reasoning
【24h】

Topic Tracking with Time Granularity Reasoning

机译:时间粒度推理的主题跟踪

获取原文
获取原文并翻译 | 示例
       

摘要

Temporal information is an important attribute of a topic, and a topic usually exists in a limited period. Therefore, many researchers have explored the utilization of temporal information in topic detection and tracking (TDT). They use either a story's publication time or temporal expressions in text to derive temporal relatedness between two stories or a story and a topic. However, past research neglects the fact that people tend to express a time with different granularities as time lapses. Based on a careful investigation of temporal information in news streams, we propose a new strategy with time granularity reasoning for utilizing temporal information in topic tracking. A set of topic times, which as a whole represent the temporal attribute of a topic, are distinguished from others in the given on-topic stories. The temporal relatedness between a story and a topic is then determined by the highest coreference level between each time in the story and each topic time where the coreference level between a test time and a topic time is inferred from the two times themselves, their granularities, and the time distance between the topic time and the publication time of the story where the test time appears. Furthermore, the similarity value between an incoming story and a topic, that is the likelihood that a story is on-topic, can be adjusted only when the new story is both temporally and semantically related to the target topic. Experiments on two different TDT corpora show that our proposed method could make good use of temporal information in news stories, and it consistently outperforms the baseline centroid algorithm and other algorithms which consider temporal relatedness.
机译:时间信息是主题的重要属性,主题通常在有限的时间内存在。因此,许多研究人员已经探索了时间信息在主题检测和跟踪(TDT)中的利用。他们使用故事的发布时间或文本中的时间表达来得出两个故事或一个故事与一个主题之间的时间关联。但是,过去的研究忽略了这样的事实,即人们倾向于随着时间的流逝以不同的粒度表示时间。在仔细研究新闻流中的时间信息的基础上,我们提出了一种具有时间粒度推理的新策略,用于在主题跟踪中利用时间信息。总体而言,代表主题的时间属性的一组主题时间在给定的主题故事中与其他主题时间区分开。然后,故事和主题之间的时间相关性由故事中每个时间与每个主题时间之间的最高共参照度确定,其中从两次测试时间与主题时间之间的共参照度,它们的粒度,以及测试时间出现在故事​​的主题时间和发布时间之间的时间距离。此外,仅当新故事在时间和语义上都与目标主题相关时,才可以调整传入故事与主题之间的相似性值,即故事在主题上的可能性。在两种不同的TDT语料库上进行的实验表明,我们提出的方法可以很好地利用新闻报道中的时间信息,并且始终优于基线质心算法和其他考虑时间相关性的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号