【24h】

An effective splog detector for Chinese weblogs

机译:用于中文博客的有效捕捉探测器

获取原文

摘要

The splogs are those blogs created for commercial purposes and purely copies contents from other blogs or webs. The splogs will block the contents of regular blogs and deprecate the value of the blog web server. The problems of detecting the splogs become the research topics in web engineering recently. In this paper, the methodologies and algorithms for detecting the splogs from the Chinese web logs are proposed. The goal of this study is to help blog server maintainers to detect and reduce the possible barricading splogs so that the functions of the web server can be maintained normally. In this paper, a new approach mixed with three factors of blog content similarity and a framework based on the SVM algorithm is proposed. This proposed method can correctly detect the Chinese splogs from a large set of Chinese web log and reduce the possible traffic loads of the Chinese we blog server.
机译:Splogs是为商业目的创建的博客,纯粹复制来自其他博客或网站的内容。拆分将阻止常规博客的内容,并弃用博客Web服务器的值。检测拆分的问题最近成为网络工程中的研究主题。在本文中,提出了用于检测来自中国Web日志的拆分的方法和算法。本研究的目标是帮助博客服务器维护者检测和减少可能的障碍拆分,从而可以正常维护Web服务器的功能。在本文中,提出了一种与三个基于SVM算法的博客内容相似性和框架混合的新方法。这一提出的方法可以从一大集的中文日志中正确检测汉语拼接,并减少中文我们的博客服务器的可能负载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号