LookAhead: Augmenting Crowdsourced Website Reputation Systems with Predictive Modeling

机译：前瞻：通过预测建模增强众包网站信誉系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Unsafe websites consist of malicious as well as inappropriate sites, such as those hosting questionable or offensive content. Website reputation systems are intended to help ordinary users steer away from these unsafe sites. However, the process of assigning safety ratings for websites typically involves humans. Consequently it is time consuming, costly and not scalable. This has resulted in two major problems: (ⅰ) a significant proportion of the web space remains unrated and (ⅱ) there is an unacceptable time lag before new websites are rated. In this paper, we show that by leveraging structural and content-based properties of websites, we can reliably and efficiently predict their safety ratings, thereby mitigating both problems. We demonstrate the effectiveness of our approach using four datasets of up to 90,000 websites. We use ratings from Web of Trust (WOT), a popular crowdsourced web reputation system, as ground truth. We propose a novel ensemble classification technique that makes opportunistic use of available structural and content properties of web pages to predict their eventual ratings in two dimensions used by WOT: trustworthiness and child safety. Ours is the first classification system to predict such subjective ratings. The same approach works equally well in identifying malicious websites. Across all datasets, our classification achieves average F_1-score in the 74-90% range.

机译：不安全的网站包括恶意和不适当的网站，例如那些托管有问题或令人反感的内容的网站。网站信誉系统旨在帮助普通用户远离这些不安全的站点。但是，为网站分配安全等级的过程通常涉及人员。因此，这是费时，昂贵且不可扩展的。这导致了两个主要问题：（ⅰ）很大一部分网站空间保持未评级，并且（ⅱ）在对新网站进行评级之前存在不可接受的时间间隔。在本文中，我们表明，通过利用网站的结构和基于内容的属性，我们可以可靠，有效地预测其安全等级，从而缓解这两个问题。我们使用多达90,000个网站的四个数据集证明了我们方法的有效性。我们使用来自流行的众包Web信誉系统Web of Trust（WOT）的评级作为事实。我们提出了一种新颖的集成分类技术，该技术可以利用机会利用网页的可用结构和内容属性来预测WOT使用的两个维度的最终评级：可信赖性和儿童安全。我们是第一个预测此类主观评分的分类系统。同样的方法在识别恶意网站方面同样有效。在所有数据集中，我们的分类均在74-90％的范围内获得了平均F_1得分。

著录项

来源
《International conference on trust and trustworthy computing》|2015年|143-162|共20页
会议地点
作者
Sourav Bhattacharya; Otto Huhta; N. Asokan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. In-Feed Native Advertising on News Websites: Effects of Advertising Format, Website Reputation, and Product Involvement [J] . Lijie Zhou, Fei Xue Journal of Internet Commerce . 2019,第1a4期

机译：新闻网站上的正在投放的本地广告：广告格式，网站声誉和产品参与度的影响
2. The impact of disposition to privacy, website reputation and website familiarity on information privacy concerns [J] . Yuan Li Decision support systems . 2014,第jana期

机译：处置对隐私，网站声誉和网站熟悉度对信息隐私问题的影响
3. Reputation-based crowdsourced Wi-Fi topology discovery [J] . Frangoudis Pantelis A., Polyzos George C. Computer networks . 2015,第mara14期

机译：基于信誉的众包Wi-Fi拓扑发现
4. LookAhead: Augmenting Crowdsourced Website Reputation Systems with Predictive Modeling [C] . Sourav Bhattacharya, Otto Huhta, N. Asokan International Conference on Trust and Trustworthy Computing . 2015

机译：Lookahead：通过预测建模增强众群网站声誉系统
5. Towards Developing Computational Models to Predict Perceived Visual Aesthetics of Website Interface Design. [D] . Altaboli, Ahamed A. O. 2012

机译：致力于开发计算模型来预测网站界面设计的感知视觉美学。
6. Crowdsourced Traffic Event Detection and Source Reputation Assessment Using Smart Contracts [O] . Jernej Mihelj, Yuan Zhang, Andrej Kos, 2019

机译：使用智能合约进行众包交通事件检测和源信誉评估
7. LookAhead: Augmenting Crowdsourced Website Reputation Systems With Predictive Modeling [O] . Bhattacharya, Sourav, Huhta, Otto, Asokan, N. 2015

机译：Lookahead：增加众包网站信誉系统预测建模

LookAhead: Augmenting Crowdsourced Website Reputation Systems with Predictive Modeling

摘要

著录项

相似文献

相关主题

期刊订阅