A new machine learning-based method for android malware detection on imbalanced dataset

Dehkordy Diyana Tehrany; Rasoolzadegan Abbas

首页> 外文期刊>Multimedia Tools and Applications >A new machine learning-based method for android malware detection on imbalanced dataset

【24h】

A new machine learning-based method for android malware detection on imbalanced dataset

机译：基于机器学习的基于机器学习的Android Malware检测方法，用于基于Inbalanced DataSet

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays, malware applications are dangerous threats to Android devices, users, developers, and application stores. Researchers are trying to discover new methods for malware detection because the complexity of malwares, their continuous changes, and damages caused by their attacks have increased. One of the most important challenges in detecting malware is to have a balanced dataset. In this paper, a detection method is proposed to identify malware to improve accuracy and reduce error rates by preprocessing the used dataset and balancing it. To attain these purposes, the static analysis is used to extract features of the applications. The ranking methods of features are used to preprocess the feature set and the low-effective features are removed. The proposed method also balances the dataset by using the techniques of undersampling, the Synthetic Minority Oversampling Technique (SMOTE), and a combination of both methods, which have not yet been studied among detection methods. Then, the classifiers of K-Nearest Neighbor (KNN), Support Vector Machine, and Iterative Dichotomiser 3 are used to create the detection model. The performance of KNN with SMOTE is better than the performance of the other classifiers. The obtained results indicate that the criteria of precision, recall, accuracy, F-measure, and Matthews Correlation Coefficient are over 97%. The proposed method is effective in detecting 99.49% of the malware's existing in the used dataset and new malware.

机译：如今，恶意软件的应用是Android设备，用户，开发者和应用商店危险的威胁。研究人员正在试图发现的恶意软件检测新方法，因为恶意软件，他们的连续变化，并引起他们的攻击破坏的复杂性也随之增加。一个在检测恶意软件的最重要的挑战之一是有一个平衡的数据集。在本文中，检测方法提出了识别恶意软件，以提高精度和通过预处理使用的数据集和平衡它降低错误率。为了达到这些目的，静态分析用于应用程序的特征提取。的特征的排序方法用来预处理功能集和低有效特征被去除。所提出的方法还通过使用欠采样技术平衡数据集，合成少数过采样技术（SMOTE），和这两种方法，这还没有被检测方法中研究的组合。然后，K最近邻的（KNN），支持向量机，以及迭代Dichotomiser 3的分类器用于创建检测模型。 KNN与击打性能比其他分类器的性能更好。将所得到的结果表明，精度，召回，准确度，F值，和马修斯相关系数的范围是97％以上。所提出的方法可有效地检测存在于所使用的数据集和新的恶意软件恶意软件的的99.49％。

著录项

来源
《Multimedia Tools and Applications》 |2021年第16期|24533-24554|共22页
作者
Dehkordy Diyana Tehrany; Rasoolzadegan Abbas;
展开▼
作者单位

Ferdowsi Univ Mashhad Fac Engn Dept Comp Engn Mashhad Razavi Khorasan Iran;

Ferdowsi Univ Mashhad Fac Engn Dept Comp Engn Mashhad Razavi Khorasan Iran;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Malware detection; Android applications classification; Dataset balancing; SMOTE balancing;

机译：恶意软件检测;Android应用程序分类;数据集平衡;Smote Balancing;

相似文献

外文文献
中文文献
专利

1. Fuzzy–synthetic minority oversampling technique: Oversampling based on fuzzy set theory for Android malware detection in imbalanced datasets [J] . Yanping Xu, Chunhua Wu, Kangfeng Zheng, International Journal of Distributed Sensor Networks . 2017,第4期

机译：模糊综合少数群体过采样技术：基于模糊集理论的过采样用于不平衡数据集中的Android恶意软件检测
2. Empirical assessment of machine learning-based malware detectors for Android Measuring the gap between in-the-lab and in-the-wild validation scenarios [J] . Allix Kevin, Bissyande Tegawende F., Jerome Quentin, Empirical Software Engineering . 2016,第1期

机译：基于Android的基于机器学习的恶意软件检测器的经验评估衡量实验室和野外验证方案之间的差距
3. Imbalanced learning based on adaptive weighting and Gaussian function synthesizing with an application on Android malware detection [J] . Pang Ying, Peng Lizhi, Chen Zhenxiang, Information Sciences: An International Journal . 2019,第期

机译：基于Android恶意软件检测应用程序合成自适应加权和高斯功能的不平衡学习
4. Are Your Training Datasets Yet Relevant? An Investigation into the Importance of Timeline in Machine Learning-Based Malware Detection [C] . Kevin Allix, Tegawende F. Bissyande, Jacques Klein, International symposium on engineering secure software and systems . 2015

机译：您的训练数据集还相关吗？时间轴在基于机器学习的恶意软件检测中的重要性的调查
5. Active learning with support vector machines for imbalanced datasets and a method for stopping active learning based on stabilizing predictions. [D] . Bloodgood, Michael. 2009

机译：支持向量机用于不平衡数据集的主动学习，以及一种基于稳定预测的主动学习停止方法。
6. DeepDetectNet vs RLAttackNet: An adversarial method to improve deep learning-based static malware detection model [O] . Yong Fang, Yuetian Zeng, Beibei Li, 2020

机译：DeepDetectNet VS RLATTACKNET：一种改进基于深度学习的静态恶意软件检测模型的对手方法
7. Are Your Training Datasets Yet Relevant? - An Investigation into the Importance of Timeline in Machine Learning-Based Malware Detection [O] . Allix, Kevin, Bissyande, Tegawendé François D Assise, Klein, Jacques, 2015

机译：您的培训数据集是否相关？ - 基于机器学习的恶意软件检测中时间线重要性的研究
8. Methods to Address Extreme Class Imbalance in Machine Learning Based Network Intrusion Detection Systems. [R] . Walter, R. W. 2016

机译：解决基于机器学习的网络入侵检测系统中极端类不平衡的方法。

A new machine learning-based method for android malware detection on imbalanced dataset

摘要

著录项

相似文献

相关主题

期刊订阅