首页> 外文期刊>Journal of Intelligent Information Systems >WebKey: a graph-based method for event detection in web news
【24h】

WebKey: a graph-based method for event detection in web news

机译:WebKey:Web新闻中的基于图形的事件检测方法

获取原文
获取原文并翻译 | 示例
       

摘要

With rapid and vast publishing of news over the Internet, there is a surge of interest to detect underlying hot events from online news streams. There are two main challenges in event detection: accuracy and scalability. In this paper, we propose a fast and efficient method to detect events in news websites. First, we identify bursty terms which suddenly appear in a lot of news documents. Then, we construct a novel co-occurrence graph between terms in which nodes and edges are weighted based on important features such as click and document frequency within burst intervals. Finally, a weighted community detection algorithm is used to cluster terms and find events. We also propose a couple of techniques to reduce the size of the graph. The results of our evaluations show that the proposed method yields a much higher precision and recall than past methods, such that their harmonic mean is improved by at least 40%. Moreover, it reduces the running time and memory usage by a factor of at least 2.
机译:随着互联网的快速和广泛的新闻发布,兴趣检测来自在线新闻溪流的底层热门活动。事件检测有两个主要挑战:准确性和可扩展性。在本文中,我们提出了一种快速有效的方法来检测新闻网站的事件。首先,我们识别突然出现在很多新闻文件中的突发条款。然后,我们在基于诸如突发间隔内的点击和文档频率的重要特征来构建节点和边缘加权的术语之间的新颖共同发生图。最后,加权社区检测算法用于群集术语并找到事件。我们还提出了几种技术来减少图表的大小。我们的评估结果表明,该方法比过去的方法产生了更高的精度和召回,使其谐波平均值提高了至少40%。此外,它将运行时间和内存使用减少了至少2的因子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号