...
首页> 外文期刊>Journal of computer sciences >Machine Learning and Deep Learning for Phishing Email Classification using One-Hot Encoding
【24h】

Machine Learning and Deep Learning for Phishing Email Classification using One-Hot Encoding

机译:使用单热编码的机器学习和深度学习,用于网络钓鱼电子邮件分类

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Representation of text is a significant task in Natural Language Processing (NLP) and in recent years Deep Learning (DL) and Machine Learning (ML) have been widely used in various NLP tasks like topic classification, sentiment analysis and language translation. Until very recently, little work has been devoted to semantic analysis in phishing detection or phishing email detection. The novelty of this study is in using deep semantic analysis to capture inherent characteristics of the text body. One-hot encoding was used with DL and ML techniques to classify emails as phishing or non-phishing. A comparison of various parameters and hyperparameters was performed for DL. The results of various ML models, Na?ve Bayes, SVM, Decision Tree, as well as DL models, Convolutional Neural Networks (CNN) and Long Short Term Memory (LSTM), were presented. The DL models performed better than the ML models in terms of accuracy, but the ML models performed better than the DL models in terms of computation time. CNN with Word Embedding performed the best in terms of accuracy (96.34%), demonstrating the effectiveness of semantic analysis in phishing email detection.
机译:文本的表示是自然语言处理(NLP)的重要任务,近年来,深度学习(DL)和机器学习(ML)已被广泛应用于各种NLP任务,如主题分类,情感分析和语言翻译。直到最近,在网络钓鱼检测或网络钓鱼电子邮件检测中,致力于语义分析。本研究的新颖性正在利用深度语义分析来捕捉文本体的固有特征。单热编码与DL和ML技术一起使用,以将电子邮件分类为网络钓鱼或非网络钓鱼。对DL进行各种参数和封闭参数的比较。展示了各种ML模型,Na of贝叶斯,SVM,决策树以及DL模型,卷积神经网络(CNN)和长短期内存(LSTM)的结果。在精度方面比ML模型更好地执行DL型号,但在计算时间方面比DL模型更好地执行ML模型。有关单词嵌入的CNN在准确性(96.34%)方面表现了最佳,展示了语义分析在网络钓鱼电子邮件检测中的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号