Detecting Webspam Beneficiaries Using Information Collected by the Random Surfer

Thomas Largillier; Sylvain Peyronnet

首页> 外文期刊>International journal of organizational and collective intelligence >Detecting Webspam Beneficiaries Using Information Collected by the Random Surfer

【24h】

Detecting Webspam Beneficiaries Using Information Collected by the Random Surfer

机译：使用随机冲浪者收集的信息检测垃圾邮件的受益者

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Search engines use several criteria to rank webpages and choose which pages to display when answering a request. Those criteria can be separated into two notions, relevance and popularity. The notion of popularity is calculated by the search engine and is related to links made to the webpage. Malicious webmasters want to artificially increase their popularity; the techniques they use are often referred to as Webspam. It can take many forms and is in constant evolution, but Webspam usually consists of building a specific dedicated structure of spam pages around a given target page. It is important for a search engine to address the issue of Webspam; otherwise, it cannot provide users with fair and reliable results. In this paper, the authors propose a technique to identify Webspam through the frequency language associated with random walks among those dedicated structures. The authors identify the language by calculating the frequency of appearance ofk-grams on random walks launched from every node.

机译：搜索引擎使用多种条件对网页进行排名，并选择在回答请求时显示哪些页面。这些标准可以分为两个概念：相关性和受欢迎度。流行度概念是由搜索引擎计算的，并且与指向网页的链接有关。恶意的网站管理员希望人为地提高其知名度；他们使用的技术通常称为Webspam。它可以采取多种形式并且在不断发展，但是Webspam通常包括围绕给定的目标页面构建特定的垃圾邮件页面专用结构。搜索引擎必须解决Webspam的问题，这一点很重要。否则，它不能为用户提供公正可靠的结果。在本文中，作者提出了一种通过与那些专用结构中的随机游走相关的频率语言来识别Web垃圾邮件的技术。作者通过计算从每个节点发起的随机游动的k-gram出现频率来识别语言。

著录项

来源
《International journal of organizational and collective intelligence》 |2011年第2期|p.36-48|共13页
作者
Thomas Largillier; Sylvain Peyronnet;
展开▼
作者单位

LRI, Universite Paris-Sud, F-91405, France;

LRI, Universite Paris-Sud, F-91405, France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
data exploration; heuristics; information filtering; networks; web technologies;

机译：数据探索;启发式信息过滤;网络;网络技术;

相似文献

外文文献
中文文献
专利

1. It's a Small World for Random Surfers [J] . Mehrabian Abbas, Wormald Nick Algorithmica . 2016,第2期

机译：这是随机冲浪者的小世界
2. It?s a Small World for Random Surfers [J] . Abbas Mehrabian, Nick Wormald LIPIcs : Leibniz International Proceedings in Informatics . 2014,第1期

机译：这是随机冲浪者的小世界
3. Award of Commendation to Ash Downthwaite, Tony Dowthwaite Lighting Design Pty Ltd, for the Recreation Deck/Pools at the Hilton - Surfers Paradise, Surfers Paradise [J] . Lighting . 2013,第6期

机译：冲浪者天堂希尔顿-冲浪者天堂的休闲甲板/泳池获得了Tony Dowthwaite Lighting Design Pty Ltd的Ash Downthwaite奖
4. Using Patterns in the Behavior of the Random Surfer to Detect Webspam Beneficiaries [C] . Thomas Largillier, Sylvain Peyronnet International Conference on Web Information Systems Engineering . 2011

机译：在随机冲浪者行为中使用模式来检测WebSpam受益者
5. Freesurfer vs manual tracing: Detecting future cognitive decline in healthy older adults at-risk for Alzheimer's disease. [D] . Butts, Alissa M. 2013

机译：Freesurfer对比手动追踪：检测处于老年痴呆症危险中的健康老年人的未来认知能力下降。
6. The Surfer’s Shoulder: A Systematic Review of Current Literature and Potential Pathophysiological Explanations of Chronic Shoulder Complaints in Wave Surfers [O] . Lisette Charlotte Langenberg, Guilherme Vieira Lima, Sebastiaan Emanuel Heitkamp, 2021

机译：冲浪者的肩膀：对当前文献的系统审查以及波浪冲浪者慢性肩部抱怨的潜在病理生理学解释
7. Classification of Malicious Web Pages through a J48 Decision Tree, a Naïve Bayes, a RBF Network and a Random Forest Classifier for WebSpam Detection [O] . Muhammad Iqbal, Malik Muneeb Abid, Usman Waheed, 2017

机译：通过J48决策树，Naïve贝叶斯，RBF网络和用于WebSPAM检测的随机林分类器进行对恶意网页的分类
8. Thermoregulation in Surfers and Nonsurfers Immersed in Cold Water. [R] . Rochelle, R. D., Horvath, S. M. 1978

机译：冲浪者和非冲浪者的体温调节浸入冷水中。

Detecting Webspam Beneficiaries Using Information Collected by the Random Surfer

摘要

著录项

相似文献

相关主题

期刊订阅