首页> 外文会议>Web Information Systems and Applications Conference, 2009. WISA 2009 >An Approach of Community Detecting Based on Block Level Link Analysis
【24h】

An Approach of Community Detecting Based on Block Level Link Analysis

机译:基于块级链接分析的社区发现方法

获取原文

摘要

With the explosive increment of the information in the web, how to efficiently understand collective behaviors and emerging phenomenon in the WWW is becoming a serious problem. Web community, which can be recognized as a set of web units, usually the pages, based on a common topic, is a significant structure in the web. An efficient way to detect a community corresponding to a specific topic can greatly help users to obtain useful information. However, existing community detecting algorithms are all based on pages as units, the topic drift phenomenon becomes the inherent problem of each existing community detecting algorithm since the web pages always have multiple topics in content.This paper puts forward a new community detecting algorithm based on blocks as units, which combines the page dividing algorithm with the primary community detecting algorithm, overcoming the defect in the existing community detecting algorithms. The communities obtained by the new algorithm have much more definite topic in content, which is more meaningful for further data processing such as information extraction and so on.
机译:随着网络中信息的爆炸性增长,如何有效地了解WWW中的集体行为和新出现的现象已成为一个严重的问题。 Web社区可以被识别为一组Web单元,通常基于共同主题的页面是Web中的重要结构。检测对应于特定主题的社区的有效方法可以极大地帮助用户获得有用的信息。但是,现有的社区检测算法都是以页面为单位,由于网页上总是存在多个主题,因此主题漂移现象成为每个现有社区检测算法的固有问题。本文提出了一种新的基于社区的社区检测算法。以块为单位,结合了分页算法和主要社区检测算法,克服了现有社区检测算法的缺陷。通过新算法获得的社区在内容上具有更加明确的主题,这对于进一步的数据处理(如信息提取等)更有意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号