【24h】

Preventing False Positives in Content-Based Phishing Detection

机译:防止基于内容的网络钓鱼检测中的误报

获取原文

摘要

Content-based phishing detection extracts keywords from a target Web page, uses these keywords to retrieve the corresponding legitimate site, and detects phishing when the domain of the target page does not match that of the retrieved site. It often misidentifies a legitimate target site as a phishing site, however, because the extracted keywords do not charecterize the legitimate site with sufficient accuracy. Two methods are described for extracting keywords: domain keyword extraction, which extracts keywords from not only the page on the browser but also from pages linked from this page, and time-invariant keyword extraction, which extracts keywords from the page and previous versions of the page. Experiments using 172 legitimate sites demonstrated a reduction in the false detection rate from 14.0% to 7.6%, while experiments using 172 phishing sites demonstrated no change in the rate of overlooking phishing pages.
机译:基于内容的网络钓鱼检测从目标网页提取关键字,使用这些关键字来检索相应的合法站点,并在目标页面的域与检索到的站点的域不匹配时检测网络钓鱼。然而,它经常将合法的目标网站定义为网络钓鱼网站,因为提取的关键字不会以足够的准确性收取合法站点。描述了两个方法用于提取关键字:域关键字提取,其中从浏览器上的页面中提取关键字,而且从此页面中链接的页面,以及时间不变的关键字提取,它从页面和以前版本中提取关键字页。使用172个合理网站的实验证明了假检出率的降低从14.0%到7.6%,而使用172个网络钓鱼站点的实验表明忽略网络钓鱼页面的速度没有变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号