A Comparative Study of Parametric Versus Non-Parametric Text Classification Algorithms

机译：参数与非参数文本分类算法的比较研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Evolution of modern technologies allowed to store the text in various digital formats such as e-mails, e-documents, libraries, etc. The amount of text data that is produced daily is increasing dramatically. Discovering useful patterns in text that can be represented in unstructured, semi-structured or structured format is a difficult task that requires a good understanding of machine learning algorithms. Finding a suitable algorithm for text mining tasks such as classification, clustering or natural language processing is a demanding situation that tests researchers’ abilities. This paper provides an overview of the text mining process also, presents a comparison of the performance and limitations of two predictive models generated using the parametric Naïve Bayes algorithm and nonparametric Deep Learning neural network. RapidMiner data science software platform has been used for models’ implementations and e-mail classification.

机译：现代技术的发展允许以各种数字格式存储文本，例如电子邮件，电子文档，图书馆等。每天产生的文本数据量急剧增加。在文本中发现可用非结构化，半结构化或结构化格式表示的有用模式是一项艰巨的任务，需要对机器学习算法有充分的了解。为诸如分类，聚类或自然语言处理之类的文本挖掘任务找到合适的算法，是测试研究人员能力的一种苛刻要求。本文还概述了文本挖掘过程，并对使用参数朴素贝叶斯算法和非参数深度学习神经网络生成的两个预测模型的性能和局限性进行了比较。 RapidMiner数据科学软件平台已用于模型的实现和电子邮件分类。

著录项

来源
《International Conference on Development and Application Systems》|2020年|208-213|共6页
会议地点
作者
Mihaela Chistol;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
text classification; text mining; machine learning; Naïve Bayes; neural network; performance evaluation.;

机译：文本分类;文本挖掘;机器学习;朴素贝叶斯;神经网络;性能评估;

相似文献

外文文献
中文文献
专利

1. Separability Analysis of Atlantic Forest Patches by Comparing Parametric and Non-Parametric Image Classification Algorithms [J] . Marcos Roberto Martines, Mariana de Paula Garcia Lúcio, Alexandre D. M. Cavagis, Journal of Geographic Information System . 2019,第5期

机译：通过比较参数和非参数图像分类算法对大西洋森林斑块的可分性分析
2. Supervised and semi-supervised learning in text classification using enhanced KNN algorithm: a comparative study of supervised and semi-supervised classification in text categorisation [J] . M. A. Wajeed, T. Adilakshmi International Journal of Intelligent Systems Technologies and Applications . 2012,第3a4期

机译：使用增强型KNN算法的文本分类中的有监督和半监督学习：文本分类中有监督和半监督分类的比较研究
3. Converting Non-parametric Distance-based Classification To Anytime Algorithms [J] . Xiaopeng Xi, Ken Ueno, Eamonn Keogh Dah-Jye Lee Pattern Analysis and Applications . 2008,第3a4期

机译：将非参数基于距离的分类转换为随时算法
4. A Comparative Study of Parametric Versus Non-Parametric Text Classification Algorithms [C] . Mihaela Chistol International Conference on Development and Application Systems . 2020

机译：参数与非参数文本分类算法的比较研究
5. A Study of Applying Machine Learning Algorithms in Application of Text Classification [D] . Lalluvadia, Megha. 2017

机译：机器学习算法在文本分类中的应用研究
6. Evaluation of the performance of traditional machine learning algorithms convolutional neural network and AutoML Vision in ultrasound breast lesions classification: a comparative study [O] . Ka Wing Wan, Chun Hoi Wong, Ho Fung Ip, 2021

机译：超声乳房病变中传统机器学习算法卷积神经网络和自动视力的性能评价：比较研究
7. Combining Parametric and Non-parametric Algorithms for a Partially Unsupervised Classification of Multitemporal Remote-Sensing Images [O] . Bruzzone Lorenzo, Cossu Roberto, Vernazza Gianni 2002

机译：结合参数和非参数算法的多时相遥感影像的部分无监督分类

A Comparative Study of Parametric Versus Non-Parametric Text Classification Algorithms

摘要

著录项

相似文献

相关主题

期刊订阅