Limited Dictionary Builder: An approach to select representative tokens for malicious URLs detection

机译：有限公司字典构建器：一种选择代表性令牌的方法，用于恶意网址检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cybercriminals use Malicious Uniform Resource Locators (URLs) as the entry to implement a variety of web attacks, such as phishing, spamming, and malware distribution, which may lead to huge finance and data loss. Thus, malicious URLs should be detected as accurately and quickly as possible. Heuristic-based detection approaches are one of the most popular methods to achieve the above goals. The detection results come from the usage of many heuristic features in this approach. However, tremendous new pages and meaningless tokens lead to the explosion of feature sets, and exhaust memory space finally. In this paper, we try to address the problem by selecting some representative members from the initial feature set, which should have the best predictive ability among the same number of selected features. For each feature, we give an evaluation method of O(1) complexity to measure its predictive ability. Then we make the selection based on all the measured values with linear complexity. Experimental results show that our approach can achieve almost the same false negative rate using only 8.3% features for malicious URLs detection, comparing with prior approaches. Moreover, our approach may work efficiently in the big data era, as it can handle 20 thousand URLs per second in our experiments on average.

机译：网络犯罪分子使用恶意统一资源定位器（URL）作为进入，以实现各种Web攻击，例如网络钓鱼，垃圾邮件和恶意软件分布，这可能导致巨额金融和数据丢失。因此，应尽可能准确地检测恶意URL。基于启发式的检测方法是实现上述目标最受欢迎的方法之一。检测结果来自这种方法中许多启发式特征的使用。然而，巨大的新页面和无意义的令牌导致功能集的爆炸，最后是排气存储空间。在本文中，我们尝试通过从初始功能集中选择一些代表成员来解决问题，这应该在相同数量的所选功能之间具有最佳的预测能力。对于每个特征，我们提供O（1）复杂性的评估方法来测量其预测能力。然后，我们基于具有线性复杂度的所有测量值进行选择。实验结果表明，我们的方法可以使用仅用于恶意URL检测的8.3％的特征来实现几乎相同的假负率，与现有方法相比。此外，我们的方法可以在大数据时代有效地工作，因为它可以平均处理我们的实验中每秒20万UTL。

著录项

来源
《IEEE International Conference on Communications》|2015年||共6页
会议地点
作者
Hongzhou Sha; Zhou Zhou; Qingyun Liu; Tingwen Liu; Chao Zheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词

相似文献

外文文献
中文文献
专利

1. Firstfilter: A cost-sensitive approach to malicious URL detection in large-scale enterprise networks [J] . L. Vu, P. Nguyen, D. Turaga IBM Journal of Research and Development . 2016,第4期

机译：Firstfilter：一种成本敏感的方法，用于大规模企业网络中的恶意URL检测
2. URLdeepDetect: A Deep Learning Approach for Detecting Malicious URLs Using Semantic Vector Models [J] . Sara Afzal, Muhammad Asim, Abdul Rehman Javed, Journal of network and systems management . 2021,第3期

机译：UrldeepDetect：使用语义矢量模型来检测恶意URL的深度学习方法
3. Websites Phishing Detection Using URLs Tokens as a Discriminating Features [J] . Ammar Yahya Daeef, R. Badlishah Ahmad, Yasmin Yacob Journal of Engineering & Applied Sciences . 2017,第3期

机译：网站网络钓鱼检测使用URL令牌作为鉴别功能
4. Limited Dictionary Builder: An approach to select representative tokens for malicious URLs detection [C] . Hongzhou Sha, Zhou Zhou, Qingyun Liu, IEEE International Conference on Communications . 2015

机译：受限词典生成器：一种选择代表令牌进行恶意URL检测的方法
5. Cloud computing based detection of malicious URL attacks on Android Smart phones. [D] . Adas, Husam A. 2013

机译：基于云计算的Android智能手机上恶意URL攻击的检测。
6. Malicious URL Detection Based on Associative Classification [O] . Sandra Kumi, ChaeHo Lim, Sang-Gon Lee 2021

机译：基于关联分类的恶意URL检测
7. An effective cost-sensitive XGBoost method for malicious URLs detection in imbalanced dataset [O] . Shen He, Bangling Li, Huaxi Peng, 2021

机译：一种有效的成本敏感的XGBoost方法，用于恶意数据集中的恶意URL检测

Limited Dictionary Builder: An approach to select representative tokens for malicious URLs detection

摘要

著录项

相似文献

相关主题

期刊订阅