首页> 外文期刊>Multimedia Tools and Applications >A new machine learning-based method for android malware detection on imbalanced dataset
【24h】

A new machine learning-based method for android malware detection on imbalanced dataset

机译:基于机器学习的基于机器学习的Android Malware检测方法,用于基于Inbalanced DataSet

获取原文
获取原文并翻译 | 示例
       

摘要

Nowadays, malware applications are dangerous threats to Android devices, users, developers, and application stores. Researchers are trying to discover new methods for malware detection because the complexity of malwares, their continuous changes, and damages caused by their attacks have increased. One of the most important challenges in detecting malware is to have a balanced dataset. In this paper, a detection method is proposed to identify malware to improve accuracy and reduce error rates by preprocessing the used dataset and balancing it. To attain these purposes, the static analysis is used to extract features of the applications. The ranking methods of features are used to preprocess the feature set and the low-effective features are removed. The proposed method also balances the dataset by using the techniques of undersampling, the Synthetic Minority Oversampling Technique (SMOTE), and a combination of both methods, which have not yet been studied among detection methods. Then, the classifiers of K-Nearest Neighbor (KNN), Support Vector Machine, and Iterative Dichotomiser 3 are used to create the detection model. The performance of KNN with SMOTE is better than the performance of the other classifiers. The obtained results indicate that the criteria of precision, recall, accuracy, F-measure, and Matthews Correlation Coefficient are over 97%. The proposed method is effective in detecting 99.49% of the malware's existing in the used dataset and new malware.
机译:如今,恶意软件的应用是Android设备,用户,开发者和应用商店危险的威胁。研究人员正在试图发现的恶意软件检测新方法,因为恶意软件,他们的连续变化,并引起他们的攻击破坏的复杂性也随之增加。一个在检测恶意软件的最重要的挑战之一是有一个平衡的数据集。在本文中,检测方法提出了识别恶意软件,以提高精度和通过预处理使用的数据集和平衡它降低错误率。为了达到这些目的,静态分析用于应用程序的特征提取。的特征的排序方法用来预处理功能集和低有效特征被去除。所提出的方法还通过使用欠采样技术平衡数据集,合成少数过采样技术(SMOTE),和这两种方法,这还没有被检测方法中研究的组合。然后,K最近邻的(KNN),支持向量机,以及迭代Dichotomiser 3的分类器用于创建检测模型。 KNN与击打性能比其他分类器的性能更好。将所得到的结果表明,精度,召回,准确度,F值,和马修斯相关系数的范围是97%以上。所提出的方法可有效地检测存在于所使用的数据集和新的恶意软件恶意软件的的99.49%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号