首页> 外文会议>Web engineering >Efficient Term Cloud Generation for Streaming Web Content
【24h】

Efficient Term Cloud Generation for Streaming Web Content

机译:高效的术语云生成,用于流传输Web内容

获取原文
获取原文并翻译 | 示例

摘要

Large amounts of information are posted daily on the Web, such as articles published online by traditional news agencies or blog posts referring to and commenting on various events. Although the users sometimes rely on a small set of trusted sources from which to get their information, they often also want to get a wider overview and glimpse of what is being reported and discussed in the news and the blogosphere. In this paper, we present an approach for supporting this discovery and exploration process by exploiting term clouds. In particular, we provide an efficient method for dynamically computing the most frequently appearing terms in the posts of monitored online sources, for time intervals specified at query time, without the need to archive the actual published content. An experimental evaluation on a large-scale real-world set of blogs demonstrates the accuracy and the efficiency of the proposed method in terms of computational time and memory requirements.
机译:每天都会在Web上发布大量信息,例如传统新闻社在线发布的文章或引用和评论各种事件的博客文章。尽管用户有时依赖一小部分受信任的来源来获取信息,但他们通常还希望对新闻和博客圈中正在报道和讨论的内容有更广泛的了解。在本文中,我们提出了一种通过利用术语云来支持此发现和探索过程的方法。尤其是,我们提供了一种有效的方法,用于动态计算受监视的在线资源的帖子中最频繁出现的字词,查询时间为指定的时间间隔,而无需存档实际发布的内容。在大规模的真实世界博客上进行的实验评估证明了该方法在计算时间和内存需求方面的准确性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号