首页> 外文会议>IEEE International Conference on Communications >Limited Dictionary Builder: An approach to select representative tokens for malicious URLs detection
【24h】

Limited Dictionary Builder: An approach to select representative tokens for malicious URLs detection

机译:有限公司字典构建器:一种选择代表性令牌的方法,用于恶意网址检测

获取原文

摘要

Cybercriminals use Malicious Uniform Resource Locators (URLs) as the entry to implement a variety of web attacks, such as phishing, spamming, and malware distribution, which may lead to huge finance and data loss. Thus, malicious URLs should be detected as accurately and quickly as possible. Heuristic-based detection approaches are one of the most popular methods to achieve the above goals. The detection results come from the usage of many heuristic features in this approach. However, tremendous new pages and meaningless tokens lead to the explosion of feature sets, and exhaust memory space finally. In this paper, we try to address the problem by selecting some representative members from the initial feature set, which should have the best predictive ability among the same number of selected features. For each feature, we give an evaluation method of O(1) complexity to measure its predictive ability. Then we make the selection based on all the measured values with linear complexity. Experimental results show that our approach can achieve almost the same false negative rate using only 8.3% features for malicious URLs detection, comparing with prior approaches. Moreover, our approach may work efficiently in the big data era, as it can handle 20 thousand URLs per second in our experiments on average.
机译:网络犯罪分子使用恶意统一资源定位器(URL)作为进入,以实现各种Web攻击,例如网络钓鱼,垃圾邮件和恶意软件分布,这可能导致巨额金融和数据丢失。因此,应尽可能准确地检测恶意URL。基于启发式的检测方法是实现上述目标最受欢迎的方法之一。检测结果来自这种方法中许多启发式特征的使用。然而,巨大的新页面和无意义的令牌导致功能集的爆炸,最后是排气存储空间。在本文中,我们尝试通过从初始功能集中选择一些代表成员来解决问题,这应该在相同数量的所选功能之间具有最佳的预测能力。对于每个特征,我们提供O(1)复杂性的评估方法来测量其预测能力。然后,我们基于具有线性复杂度的所有测量值进行选择。实验结果表明,我们的方法可以使用仅用于恶意URL检测的8.3%的特征来实现几乎相同的假负率,与现有方法相比。此外,我们的方法可以在大数据时代有效地工作,因为它可以平均处理我们的实验中每秒20万UTL。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号