【24h】

Connecting the Dots Between News Articles

机译:连接新闻文章之间的点

获取原文

摘要

The process of extracting useful knowledge from large datasets has become one of the most pressing problems in today's society. The problem spans entire sectors, from scientists to intelligence analysts and web users, all of whom are constantly struggling to keep up with the larger and larger amounts of content published every day. With this much data, it is often easy to miss the big picture.In this paper, we investigate methods for automatically connecting the dots - providing a structured, easy way to navigate within a new topic and discover hidden connections. We focus on the news domain: given two news articles, our system automatically finds a coherent chain linking them together. For example, it can recover the chain of events starting with the decline of home prices (January 2007), and ending with the ongoing health-care debate.We formalize the characteristics of a good chain and provide an efficient algorithm (with theoretical guarantees) to connect two fixed endpoints. We incorporate user feedback into our framework, allowing the stories to be refined and personalized. Finally, we evaluate our algorithm over real news data. Our user studies demonstrate the algorithm's effectiveness in helping users understanding the news.
机译:从大型数据集中提取有用知识的过程已成为当今社会最紧迫的问题之一。这个问题涉及整个领域,从科学家到情报分析人员再到网络用户,所有这些人都在不断努力以跟上每天发布的越来越多的内容。有了这么多的数据,通常很容易错过全局。 在本文中,我们研究了自动连接点的方法-提供了一种结构化的简单方法,可在新主题中导航并发现隐藏的连接。我们专注于新闻领域:给定两个新闻文章,我们的系统会自动找到一条连贯的链条将它们链接在一起。例如,它可以恢复一系列事件,从房价下降(2007年1月)开始,到正在进行的医疗辩论结束。 我们将一条好的链的特征形式化,并提供一种有效的算法(具有理论上的保证)来连接两个固定的端点。我们将用户反馈整合到我们的框架中,从而使故事得以完善和个性化。最后,我们根据真实新闻数据评估算法。我们的用户研究证明了该算法在帮助用户了解新闻方面的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号