首页> 外文会议>International Conference on Computational Science and Its Applications >Generating Fuzzy Equivalence Classes on RSS News Articles for Retrieving Correlated Information
【24h】

Generating Fuzzy Equivalence Classes on RSS News Articles for Retrieving Correlated Information

机译:在RSS新闻文章中生成模糊等价类,用于检索相关信息

获取原文

摘要

Tens of thousands of news articles are posted on-line each day, covering topics from politics to science to current events. In order to better cope with this overwhelming volume of information, RSS (news) feeds are used to categorize newly posted articles. Nonetheless, most RSS users must filter through many articles within the same or different RSS feeds in order to locate articles pertaining to their particular interests. Due to the large number of news articles in individual RSS feeds, there is a need for further organizing articles to aid users in locating non-redundant, informative, and related articles of interest quickly. In this paper, we present a novel approach which uses the word-correlation factors in a fuzzy set information retrieval model to (i) filter out redundant news articles from RSS feeds, (ii) shed less-informative articles from the non-redundant ones, and (iii) cluster the remaining informative articles according to the fuzzy equivalence classes generated on the news articles. Our clustering approach requires little overhead or computational costs, and experimental results have shown that it outperforms other existing well-known clustering approaches.
机译:成千上万的新闻文章的数万人在网上每天发布,覆盖了从政治主题,以科学时事。为了与这种信息铺天盖地卷更好地应对,RSS(新闻)饲料被用来分类新发表的文章。尽管如此,大多数RSS用户必须以找到属于自己的特殊利益的文章通过相同或不同的RSS Feed内很多文章进行筛选。由于大量的个人RSS提要的新闻文章,有必要进一步组织的文章,以帮助用户定位非冗余的信息,并迅速的利益相关的文章。在本文中,我们提出使用所述字的相关性的因素在一个模糊集合信息检索模型与(i)过滤掉来自RSS Feed冗余新闻文章的新方法,(ⅱ)棚从非冗余那些信息性少的文章,和(iii)根据关于新闻文章中生成的模糊等价类簇的剩余信息的文章。我们的聚类方法需要很少的开销或计算成本,实验结果表明,其性能优于现有知名的聚类方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号