首页> 外文期刊>Information Processing & Management >Discovering topic time from web news
【24h】

Discovering topic time from web news

机译:从网络新闻中发现话题时间

获取原文
获取原文并翻译 | 示例
           

摘要

Topic time reflects the temporal feature of topics in Web news pages, which can be used to establish and analyze topic models for many time-sensitive text mining tasks. However, there are two critical challenges in discovering topic time from Web news pages. The first issue is how to normalize different kinds of temporal expressions within a Web news page, e.g., explicit and implicit temporal expressions, into a unified representation framework. The second issue is how to determine the right topic time for topics in Web news. Aiming at solving these two problems, we propose a systematic framework for discovering topic time from Web news. In particular, for the first issue, we propose a new approach that can effectively determine the appropriate referential time for implicit temporal expressions and further present an effective defuzzification algorithm to find the right explanation for a fuzzy temporal expression. For the second issue, we propose a relation model to describe the relationship between news topics and topic time. Based on this model, we design a new algorithm to extract topic time from Web news. We build a prototype system called Topic Time Parser (TTP) and conduct extensive experiments to measure the effectiveness of our proposal. The results suggest that our proposal is effective in both temporal expression normalization and topic time extraction.
机译:主题时间反映了Web新闻页面中主题的时间特征,可用于为许多对时间敏感的文本挖掘任务建立和分析主题模型。但是,从Web新闻页面发现主题时间存在两个关键挑战。第一个问题是如何将Web新闻页面中的不同类型的时间表达式(例如显式和隐式时间表达式)标准化为一个统一的表示框架。第二个问题是如何为Web新闻中的主题确定正确的主题时间。为了解决这两个问题,我们提出了一个用于从Web新闻中发现主题时间的系统框架。特别是对于第一个问题,我们提出了一种新方法,该方法可以有效地确定隐式时间表达的适当参考时间,并进一步提出一种有效的去模糊化算法,以找到对模糊时间表达的正确解释。对于第二个问题,我们提出了一种关系模型来描述新闻主题与主题时间之间的关系。基于此模型,我们设计了一种从Web新闻中提取主题时间的新算法。我们构建了一个名为Topic Time Parser(TTP)的原型系统,并进行了广泛的实验以衡量我们提案的有效性。结果表明,我们的建议在时间表达规范化和主题时间提取方面均有效。

著录项

  • 来源
    《Information Processing & Management》 |2015年第6期|869-890|共22页
  • 作者单位

    School of Computer Science and Technology, Southwest University of Science and Technology, China;

    Key Laboratory of Electromagnetic Space Information, Chinese Academy of Sciences, China,School of Computer Science and Technology, University of Science and Technology of China, Jinzhai Road 59, Hefei 230027, China;

    School of Computer Science and Technology, University of Science and Technology of China, China,Key Laboratory of Electromagnetic Space Information, Chinese Academy of Sciences, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Temporal expression; Normalization; Topic time; Relation model; Web news;

    机译:时间表达;正常化;主题时间;关系模型;网络新闻;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号