Large-Scale Automatic Classification of Phishing Pages

机译：大型的网络钓鱼页面自动分类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Phishing websites, fraudulent sites that impersonate arntrusted third party to gain access to private data, continuernto cost Internet users over a billion dollars each year. Inrnthis paper, we describe the design and performance char-rnacteristics of a scalable machine learning classifier we de-rnveloped to detect phishing websites. We use this classifierrnto maintain Google's phishing blacklist automatically. Ourrnclassifier analyzes millions of pages a day, examining thernURL and the contents of a page to determine whether orrnnot a page is phishing. Unlike previous work in this field,rnwe train the classifier on a noisy dataset consisting of mil-rnlions of samples from previously collected live classificationrndata. Despite the noise in the training data, our classifierrnlearns a robust model for identifying phishing pages whichrncorrectly classifies more than 90% of phishing pages sev-rneral weeks after training concludes.

机译：仿冒网站，冒充受信任的第三方以获取私人数据的欺诈性网站，继续使互联网用户每年花费超过10亿美元。在本文中，我们描述了一种可扩展的机器学习分类器的设计和性能特征，我们对这些分类器进行了开发以检测网络钓鱼网站。我们使用此分类器来自动维护Google的网络钓鱼黑名单。我们的分类器每天分析数百万个页面，检查URL和页面内容，以确定页面是否为网络钓鱼。与该领域以前的工作不同，我们在嘈杂的数据集上训练分类器，该数据集包含来自先前收集的实时分类数据的数百万个样本。尽管培训数据中存在噪音，但我们的分类器仍会学习一种可靠的网络钓鱼页面识别模型，该模型可在培训结束后的几周内正确地对90％以上的网络钓鱼页面进行分类。

著录项

来源
《2010 Network and distributed system security symposium》|2010年|p.1-14|共14页
会议地点 San Diego CA(US);San Diego CA(US);San Diego CA(US)
作者
Colin Whittaker; Brian Ryner; Marria Nazif;
展开▼
作者单位

Google Inc. cwhittak@google.com;

rnGoogle Inc. bryner@google.com;

rnGoogle Inc. marria@google.com;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类安全保密;
关键词

相似文献

外文文献
中文文献
专利

1. Performance Analysis of Anti-Phishing Tools and Study of Classification Data Mining Algorithms for a Novel Anti-Phishing System [J] . Rajendra Gupta, Piyush Kumar Shukla International Journal of Computer Network and Information Security . 2015,第12期

机译：新型反网络钓鱼系统的反网络钓鱼工具性能分析和分类数据挖掘算法研究
2. Automatic incident classification for large-scale traffic data by adaptive boosting SVM [J] . Wang Li-Li, Ngan Henry Y. T., Yung Nelson H. C. Information Sciences: An International Journal . 2018,第期

机译：通过自适应升压SVM自动事件分类，用于大规模交通数据
3. Deep Learning-Based Large-Scale Automatic Satellite Crosswalk Classification [J] . Rodrigo F. Berriel, André Teixeira Lopes, Alberto F. de Souza, IEEE Geoscience and Remote Sensing Letters . 2017,第9期

机译：基于深度学习的大规模自动人行横道分类
4. Large-Scale Automatic Classification of Phishing Pages [C] . Colin Whittaker, Brian Ryner, Marria Nazif Network and Distributed System Security Symposium . 2010

机译：大规模自动分类网络钓鱼页面
5. Automatic classification of vegetation and land degradation with large-scale, color infrared, aerial imagery. [D] . Breland, Adrienne E. 2001

机译：利用大型彩色红外航空影像对植被和土地退化进行自动分类。
6. Classification of Large-Scale Remote Sensing Images for Automatic Identification of Health Hazards [O] . Mark A. Wolters, C. B. Dean -1

机译：自动识别健康危害的大型遥感影像分类
7. Large-Scale Lexical Classification of Phishing Websites [O] . Medzinskii David 2017

机译：钓鱼网站的大规模词汇分类

Large-Scale Automatic Classification of Phishing Pages

摘要

著录项

相似文献

相关主题

期刊订阅