Web spam challenge proposal for filtering in archives

机译：针对垃圾邮件进行过滤的网络垃圾邮件挑战建议

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose new tasks for a possible future Web Spam Challenge motivated by the needs of the archival community. The Web archival community consists of several relatively small institutions that operate independently and possibly over different top level domains (TLDs). Each of them may have a large set of historic crawls. Efficient filtering would hence require (1) enhanced use of the time series of domain snapshots and (2) collaboration by transferring models across different TLDs. Corresponding Challenge tasks could hence include the distribution of crawl snapshot data for feature generation as well as classification of unlabeled new crawls of the same or even different TLDs.

机译：在本文中，我们根据档案社区的需求为可能的未来Web垃圾邮件挑战提出了新的任务。 Web归档社区由几个相对较小的机构组成，这些机构独立运作，并可能在不同的顶级域（TLD）上运作。他们每个人可能都有大量的历史爬网。因此，有效的过滤将要求（1）增强使用域快照的时间序列，以及（2）通过在不同TLD之间传输模型来进行协作。因此，相应的质询任务可能包括分发爬网快照数据以生成功能，以及对相同或什至不同TLD的未标记新爬网进行分类。

著录项

来源
《5th international workshop on adversarial information retrieval on the web 2009》|2009年|P.61 - 62|共2页
会议地点
作者
Andras A. Benczur; Miklos Erdelyi; Julien Masanes; David Siklosi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机网络;
关键词
challenge; document classification; evaluation; information retrieval; web archival; web spam;

机译：挑战;文档分类;评估;信息检索;网络档案;网络垃圾邮件;

相似文献

外文文献
中文文献
专利

1. The 419 scam: information warfare on the spam front and a proposal for local filtering [J] . Edelson E. Computers & Security . 2003,第5期

机译：419骗局：垃圾邮件战役中的信息战以及对本地过滤的建议
2. WSF2: A Novel Framework for Filtering Web Spam [J] . Fdez-Glez J., Ruano-Ordas D., Laza R., Scientific programming . 2016,第Pta1期

机译：WSF2：一种用于过滤Web垃圾邮件的新颖框架
3. Efficient and effective spam filtering and re-ranking for large web datasets [J] . Gordon V. Cormack, Mark D. Smucker, Charles L. A. Clarke Information retrieval . 2011,第5期

机译：高效且有效的垃圾邮件过滤和大型Web数据集重新排名
4. Web Spam Challenge Proposal for Filtering in Archives [C] . Andras A. Benczur, Miklos Erdelyi, Julien Masanes, 5th international workshop on adversarial information retrieval on the web 2009 . 2009

机译：针对垃圾邮件进行过滤的网络垃圾邮件挑战提案
5. Spam e-mail filtering via global and user-level dynamic ontologies. [D] . Youn, Seongwook. 2009

机译：通过全局和用户级动态本体过滤垃圾邮件。
6. A Proposal for Including Patient-Generated Web-based Creative Writing Material into Psychotherapy: Advantages and Challenges [O] . Timothy Lawver 2008

机译：关于将患者生成的基于网络的创意写作材料纳入心理治疗的提案：优势与挑战
7. Web Spam Challenge Proposal for Filtering in Archives∗ [O] . András A. Benczúra Miklós Erdélyic, A Julien Masanésb Dávid Siklósia 2014

机译：针对档案过滤的网络垃圾邮件挑战提案*

Web spam challenge proposal for filtering in archives

摘要

著录项

相似文献

相关主题

期刊订阅