A comparative study of support vector machine and neural networks for file type identification using n-gram analysis

Sester Joachim; Hayes Darren; Scanlon Mark; Nhien-An Le-Khac

首页> 外文期刊>Digital investigation >A comparative study of support vector machine and neural networks for file type identification using n-gram analysis

【24h】

A comparative study of support vector machine and neural networks for file type identification using n-gram analysis

机译：使用N-GRAM分析对锉刀型识别的支持向量机和神经网络的比较研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

File type identification (FTI) has become a major discipline for anti-virus developers, firewall designers and for forensic cybercrime investigators. Over the past few years, research has seen the introduction of several classifiers and features. One of these advances is the so-called n-grams analysis, which is an interpretation of statistical counting in classified fragments. Recently, n-grams based approaches were already successfully combined with computational intelligence classifiers. However, the academic body of literature is scant when it comes to a comprehensive explanation of machine learning based approaches such as neural networks (NN) or support vector machines (SVM). For example, how the input parameters, including learning rate, different values of n for n-grams, etc. influence the results. In addition, very few studies have compared the scalability of NN vs. SVM approaches. Therefore, a systematic research in comparing different approaches is needed to address these questions. Hence, this paper investigates this type of comparison, by focusing on the n-gram analysis as a feature for the two different classifiers: SVMs and NNs. This paper details our experiments with two NNs and four SVMs, using linear kernels and RBF kernels on RealDC datasets. In general, we found that SVM-based approaches performed better than the NN, but their scalability is still a challenge. (c) 2021 The Authors. Published by Elsevier Ltd.

机译：文件类型识别（FTI）已成为防病毒开发人员，防火墙设计师和法医网络犯罪调查人员的主要学科。在过去的几年里，研究已经看到了几种分类器和特征的引入。其中一个进步是所谓的n-grams分析，这是对分类片段中统计计数的解释。最近，基于N-GRAMS的方法已经成功地与计算智能分类器结合。然而，在基于机器学习的方法（如神经网络（NN）或支持向量机（SVM）的方法中，文学的学术态度是令人勉强的。例如，输入参数，包括学习率，n-grams的n个不同值的方式如何影响结果。此外，很少有研究比较了NN与SVM方法的可扩展性。因此，需要进行对比较不同方法的系统研究来解决这些问题。因此，本文通过将N-GRAM分析专注于两个不同分类器的特征来研究这种比较：SVM和NNS。本文使用REALDC数据集中的线性内核和RBF内核详细说明了我们的两位NNS和四个SVM的实验。通常，我们发现基于SVM的方法比NN更好地表现，但它们的可扩展性仍然是一个挑战。（c）2021作者。 elsevier有限公司出版

著录项

来源
《Digital investigation》 |2021年第3期|301121.1-301121.10|共10页
作者
Sester Joachim; Hayes Darren; Scanlon Mark; Nhien-An Le-Khac;
展开▼
作者单位

Bundesminist Inneren BMI Nrw Germany;

Pace Univ New York NY 10038 USA;

Univ Coll Dublin Dublin 4 Ireland;

Univ Coll Dublin Dublin 4 Ireland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
File type identification; n-grams analysis; Forensic analysis; Neural networks; Support vector machine;

机译：文件类型识别;n-grams分析;法医分析;神经网络;支持向量机;

相似文献

外文文献
中文文献
专利

1. Meta-analysis of deep neural networks in remote sensing: A comparative study of mono-temporal classification to support vector machines [J] . Heydari Shahriar S., Mountrakis Giorgos ISPRS Journal of Photogrammetry and Remote Sensing . 2019,第JUNa期

机译：深度神经网络在遥感中的荟萃分析：单时分类支持向量机的比较研究
2. Meta-analysis of deep neural networks in remote sensing: A comparative study of mono-temporal classification to support vector machines [J] . Heydari Shahriar S., Mountrakis Giorgos ISPRS Journal of Photogrammetry and Remote Sensing . 2019,第Juna期

机译：遥感中深度神经网络的META分析：单时刻分类对支持向量机的比较研究
3. Credit rating analysis with support vector machines and neural networks: a market comparative study [J] . Zan Huang, Hsinchun Chen, Chia-Jung Hsu, Decision support systems . 2004,第4期

机译：支持向量机和神经网络的信用评级分析：市场比较研究
4. A Comparative Study of Extreme Learning Machine, Least Squares Support Vector Machine, Back Propagation Neural Network for Outlet Total Phosphorus Prediction [C] . Tingting Yu, Yun Bai Prognostics and System Health Management Conference . 2018

机译：极限学习机，最小二乘支持向量机，反向传播神经网络用于出口总磷预测的比较研究
5. Brainprint: Identifying unique features of neural activity using cross-correlation, support vector machines and neural networks to evaluate its use as an authentication method. [D] . Ruiz Blondet, Maria Virginia. 2014

机译：大脑印记：使用互相关，支持向量机和神经网络来识别神经活动的独特特征，以评估其作为身份验证方法的用途。
6. Development of support vector machine-based model and comparative analysis with artificial neural network for modeling the plant tissue culture procedures: effect of plant growth regulators on somatic embryogenesis of chrysanthemum as a case study [O] . Mohsen Hesami, Roohangiz Naderi, Masoud Tohidfar, 2020

机译：基于支持向量机的模型的发展与人工神经网络对植物组织培养程序建模的比较分析：植物生长调节剂对菊花体细胞胚胎发生的影响为例
7. A study and identification of COVID-19 viruses using N-grams with Naïve Bayes, K-Nearest Neighbors, Artificial Neural Networks, Decision tree and Support Vector Machine [O] . Mohamed El Boujnouni 2020

机译：利用Naïve贝叶斯，K-CORLED邻居，人工神经网络，决策树和支持向量机的研究和鉴定Covid-19病毒

A comparative study of support vector machine and neural networks for file type identification using n-gram analysis

摘要

著录项

相似文献

相关主题

期刊订阅