首页> 外文会议>International Symposium on Multidisciplinary Studies and Innovative Technologies >Phish-IRIS: A New Approach for Vision Based Brand Prediction of Phishing Web Pages via Compact Visual Descriptors
【24h】

Phish-IRIS: A New Approach for Vision Based Brand Prediction of Phishing Web Pages via Compact Visual Descriptors

机译:phish-eris:通过紧凑型视觉描述符的网络钓鱼网页的基于视觉品牌预测的一种新方法

获取原文

摘要

Phishing, a continuously growing cyber threat, aims to obtain innocent users' credentials by deceiving them via presenting fake web pages which mimic their legitimate targets. To date, various attempts have been carried out in order to detect phishing pages. In this study, we treat the problem of phishing web page identification as an image classification task and propose a machine learning augmented pure vision based approach which extracts and classifies compact visual features from web page screenshots. For this purpose, we employed several MPEG7 and MPEG7-like compact visual descriptors (SCD, CLD, CEDD, FCTH and JCD) to reveal color and edge based discriminative visual cues. Throughout the feature extraction process we have followed two different schemas working on either whole screenshots in a “holistic” manner or equal sized “patches” constructing a coarse-to-fine “pyramidal” representation. Moreover, for the task of image classification, we have built SVM and Random Forest based machine learning models. In order to assess the performance and generalization capability of the proposed approach, we have collected a mid-sized corpus covering 14 distinct brands and involving 2852 samples. According to the conducted experiments, our approach reaches up to 90.5% F1 score via SCD. As a result, compared to other studies, the suggested approach presents a lightweight schema serving competitive accuracy and superior feature extraction and inferring speed that enables it to be used as a browser plugin.
机译:网络钓鱼,一个不断增长的网络威胁,旨在通过展示模仿其合法目标的假网页来获得无辜的用户的凭据。迄今为止,已经进行了各种尝试以检测网络钓鱼页面。在本研究中,我们将网络钓鱼网页识别的问题视为图像分类任务,并提出了一种基于机器学习的基于纯Vision的方法,该方法从网页截图中提取和分类紧凑的视觉功能。为此目的,我们使用了几个MPEG7和MPEG7,类似于Compact Visual Descriptor(SCD,CLD,CEDD,FCTH和JCD),以显示基于颜色和边缘的鉴别视觉提示。在整个特征提取过程中,我们遵循两种不同的模式,以“整体”方式或平等尺寸的“贴片”构成粗致细“金字塔”表示的整个屏幕截图。此外,对于图像分类的任务,我们建立了SVM和随机林的机器学习模型。为了评估拟议方法的性能和泛化能力,我们收集了一个涵盖了14个不同品牌的中型语料库,涉及2852个样本。根据进行的实验,我们的方法通过SCD达到90.5%的F1得分。结果,与其他研究相比,建议的方法介绍了具有竞争精度和卓越的特征提取和推断速度的轻量级架构,使其能够用作浏览器插件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号