首页> 外文会议>International Conference on Computational Science and Its Applications;ICCSA 2008 >Generating Fuzzy Equivalence Classes on RSS News Articles for Retrieving Correlated Information
【24h】

Generating Fuzzy Equivalence Classes on RSS News Articles for Retrieving Correlated Information

机译:在RSS新闻文章上生成模糊对等类以检索相关信息

获取原文

摘要

Tens of thousands of news articles are posted on-line each day, covering topics from politics to science to current events. In order to better cope with this overwhelming volume of information, RSS (news) feeds are used to categorize newly posted articles. Nonetheless, most RSS users must filter through many articles within the same or different RSS feeds in order to locate articles pertaining to their particular interests. Due to the large number of news articles in individual RSS feeds, there is a need for further organizing articles to aid users in locating non-redundant, informative, and related articles of interest quickly. In this paper, we present a novel approach which uses the word-correlation factors in a fuzzy set information retrieval model to (ⅰ) filter out redundant news articles from RSS feeds, (ⅱ) shed less-informative articles from the non-redundant ones, and (ⅲ) cluster the remaining informative articles according to the fuzzy equivalence classes generated on the news articles. Our clustering approach requires little overhead or computational costs, and experimental results have shown that it outperforms other existing well-known clustering approaches.
机译:每天在线发布数以万计的新闻文章,涵盖从政治到科学再到时事的话题。为了更好地应对这种压倒性的信息量,RSS(新闻)提要用于对新发布的文章进行分类。但是,大多数RSS用户必须过滤相同或不同RSS提要中的许多文章,才能找到与他们的特定兴趣有关的文章。由于各个RSS feed中的新闻文章数量众多,因此需要进一步组织新闻稿,以帮助用户快速找到感兴趣的非冗余,翔实和相关的文章。在本文中,我们提出了一种新颖的方法,该方法在模糊集合信息检索模型中使用单词相关因子来(ⅰ)从RSS提要中过滤掉多余的新闻文章,(ⅱ)从非冗余的文章中去除信息较少的文章;和(according)根据新闻文章上生成的模糊对等类别对其余的信息文章进行聚类。我们的聚类方法只需要很少的开销或计算成本,并且实验结果表明它优于其他现有的众所周知的聚类方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号