Robust PageRank and locally computable spam detection features

机译：强大的PageRank和本地可计算的垃圾邮件检测功能

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Before the advent of the World Wide Web, information retrieval algorithms were developed for relatively small and coherent document collections such as newspaper articles or book catalogs in a library. In comparison to these collections, the Web is massive, much less cohe-rent, changes more rapidly, and is spread over geographically distributed computers. Scal-ing information retrieval algorithms to the World Wide Web is a challenging task. Success to date is depicted by the ubiquitous use of search engines to access Internet content. >From the point of view of a search engine, the Web is a mix of two types of content: the "closed Web" and the "open Web". The closed web comprises a few high-quality controlled collections which a search engine can fully trust. The "open Web," on the other hand, in-cludes the vast majority of Web pages, which lack an authority asserting their quality. The openness of the Web has been the key to its rapid growth and success. However, this open-ness is also a major source of new challenges for information retrieval methods. >Adversarial Information Retrieval addresses tasks such as gathering, indexing, filtering, re-trieving and ranking information from collections wherein a subset has been manipulated maliciously. On the Web, the predominant form of such manipulation is "search engine spamming" or spamdexing, i.e.: malicious attempts to influence the outcome of ranking al-gorithms, aimed at getting an undeserved high ranking for some items in the collection. There is an economic incentive to rank higher in search engines, considering that a good ranking on them is strongly correlated with more traffic, which often translates to more revenue.

机译：在万维网出现之前，信息检索算法是为相对较小和连贯的文档集（例如报纸上的文章或图书馆中的书籍目录）开发的。与这些集合相比，Web规模庞大，功能少得多，更改速度更快，并且散布在地理分布的计算机上。将信息检索算法扩展到万维网是一项艰巨的任务。到目前为止，成功的体现是广泛使用搜索引擎来访问Internet内容。

从搜索引擎的角度来看，Web是两种类型的内容的组合：“封闭式Web”和“开放式网络”。封闭的Web包含一些搜索引擎可以完全信任的高质量受控集合。另一方面，“开放Web”包括绝大多数Web页面，这些Web页面缺乏断言其质量的权限。 Web的开放性一直是其快速增长和成功的关键。但是，这种开放性也是信息检索方法面临新挑战的主要来源。已被恶意操纵。在网络上，这种操纵的主要形式是“搜索引擎垃圾邮件”或垃圾邮件散布，即：恶意尝试影响排名算法的结果，目的是使馆藏中的某些项目获得不应有的高排名。考虑到在搜索引擎上的良好排名与更多流量（通常转化为更多收入）密切相关，因此有一种经济动机鼓励它们在搜索引擎中排名更高。展开▼

著录项

来源
《4th international workshop on adversarial information retrieval on the web 2008》|2008年|P.69-76|共8页
会议地点
作者
Reid Andersen; Christian Borgs; Jennifer Chayes; John Hopcroft; Kamal Jain; Vahab Mirrokni; Shanghua Teng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机网络;
关键词
unsupervised learning;

机译：无监督学习;

相似文献

外文文献
中文文献
专利

1. Towards Spam Mail Detection using Robust Feature Evaluated with Feature Selection Techniques [J] . Josin Thomas, Vinod P, Nisha S Raj International Journal of Engineering and Technology . 2014,第5期

机译：使用功能选择技术评估的可靠功能实现垃圾邮件检测
2. Robust classification for spam filtering by back-propagation neural networks using behavior-based features [J] . Chih-Hung Wu, Chiung-Hui Tsai Applied Intelligence . 2009,第2期

机译：使用基于行为的特征通过反向传播神经网络对垃圾邮件过滤进行可靠分类
3. Robust classification for spam filtering by back-propagation neural networks using behavior-based features [J] . Wu CH, Tsai CH Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2009,第2期

机译：使用基于行为的功能通过反向传播神经网络对垃圾邮件过滤进行可靠分类
4. Robust PageRank and Locally Computable Spam Detection Features [C] . Reid Andersen, Christian Borgs, Jennifer Chayes, 4th international workshop on adversarial information retrieval on the web 2008 . 2008

机译：强大的PageRank和可本地计算的垃圾邮件检测功能
5. Robust tracking and event detection using receding horizon incremental locally linear embeddings [D] . Birdsong, Wendy Eileen 2009

机译：使用后退水平增量式局部线性嵌入进行鲁棒的跟踪和事件检测
6. Detection of distant metastases in patients with locally advancedbreast cancer: role of 18F-fluorodeoxyglucose positron emissiontomography/computed tomography and conventional imaging with computed tomographyscans [O] . Almir Galvão Vieira Bitencourt, Wesley Pereira Andrade, Rodrigo Rodrigues da Cunha, 2017

机译：局部晚期患者远处转移的检测乳腺癌：18F-氟脱氧葡萄糖正电子发射的作用体层摄影术/计算机体层摄影术和常规成像与计算机体层摄影术扫描
7. Robust pagerank and locally computable spam detection features [O] . Reid Andersen, John Hopcroft, Christian Borgs, 2008

机译：强大的Pagerank和可本地计算的垃圾邮件检测功能

Robust PageRank and locally computable spam detection features

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅