首页> 外文学位 >Learning to detect malicious URLs.
【24h】

Learning to detect malicious URLs.

机译:学习检测恶意URL。

获取原文
获取原文并翻译 | 示例

摘要

Malicious Web sites are a cornerstone of Internet criminal activities. They host a variety of unwanted content ranging from spam-advertised products, to phishing sites, to dangerous "drive-by" exploits that infect a visitor's machine with malware. As a result, there has been broad interest in developing systems to prevent the end user from visiting such sites. The most prominent existing approaches to the malicious URL problem are manually-constructed blacklists, as well as client-side systems that analyze the content or behavior of a Web site as it is visited.;The premise of this dissertation is that we should be able to construct a lightweight URL classification system that simultaneously overcomes the challenges that face blacklists (which have manual updates that can quickly become obsolete) and client-side systems (which are difficult to deploy on a large scale because of their high overhead). To this end, our contribution is that we develop a highly effective system for malicious URL detection that (in its final form) leverages large numbers of features and online learning to scalably and adaptively construct an accurate classifier. Because our system exploits large amounts of training data and adapts to day-by-day variations, we are able to classify URLs with up to 99% accuracy.;As part of pursuing malicious URL detection, this dissertation addresses issues that arise from the use of online learning for this application. Thus, our further contributions include advances in understanding the role of uncertainty in online learning, as well as the benefits of exploiting feature correlations in high-dimensional applications such as URL classification. Overall, the contributions of this dissertation make significant advances in improving malicious URL detection and understanding the role of online learning in this application.
机译:恶意网站是互联网犯罪活动的基石。它们托管各种有害内容,从垃圾邮件广告产品,网络钓鱼站点到危险的“偷渡式”漏洞,这些漏洞利用恶意软件感染访问者的计算机。结果,人们对开发防止最终用户访问此类站点的系统产生了广泛的兴趣。现有的解决恶意URL问题的最主要方法是手动构建的黑名单,以及用于分析网站的内容或行为的客户端系统。本文的前提是我们应该能够构建轻量级的URL分类系统,同时克服了黑名单(具有可能会很快变得过时的手动更新)和客户端系统(由于开销大而难以大规模部署)所面临的挑战。为此,我们的贡献在于,我们开发了一种高效的恶意URL检测系统(最终形式),该系统利用大量功能和在线学习来可扩展和自适应地构建准确的分类器。由于我们的系统利用大量的训练数据并适应日常变化,因此我们能够以高达99%的准确度对URL进行分类。作为追求恶意URL检测的一部分,本论文解决了使用过程中出现的问题此应用程序的在线学习。因此,我们的进一步贡献包括在理解不确定性在在线学习中的作用方面取得的进展,以及在诸如URL分类之类的高维度应用程序中利用特征相关性的好处。总体而言,本文的贡献在改进恶意URL检测和理解在线学习在此应用程序中的作用方面取得了重大进展。

著录项

  • 作者

    Ma, Justin Tung.;

  • 作者单位

    University of California, San Diego.;

  • 授予单位 University of California, San Diego.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 143 p.
  • 总页数 143
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号