首页> 外文会议>International Conference on Communication, Computing and Electronics Systems >News Topic Classification Using Machine Learning Techniques
【24h】

News Topic Classification Using Machine Learning Techniques

机译:新闻主题分类使用机器学习技术

获取原文

摘要

News topic classification is a method of classifying news articles available in text data into some predefined classes or labels. This is one of the applications of text classification. Text classification can be applied in the fields of spam filtering, language recognition, segmenting customer feedbacks, segregating technical documents, etc. This paper discusses news topic classification on AG's News Topic Classification Dataset using machine learning algorithms such as linear support vector machine, multinomial Naive Bayesian classifier, K-Nearest Neighbor, Rocchio, bagging, and boosting. This paper discusses three steps for classification, namely pre-processing of text, then applying feature extraction techniques, and finally implementing machine learning algorithms. These algorithms are compared using evaluation metrics like Accuracy, Recall, Precision, and F1 Score.
机译:新闻主题分类是一种将文本数据中可用的新闻文章分类为某些预定义的类或标签的方法。 这是文本分类的应用之一。 文本分类可以应用于垃圾邮件过滤,语言识别,分段客户反馈,隔离技术文件等领域。本文讨论了使用机器学习算法(如线性支持向量机,多项式Naive等机器学习算法)上的新闻主题分类数据集 贝叶斯分类器,k最近邻居,rocchio,bagging和boosting。 本文讨论了分类三个步骤,即文本的预处理,然后应用特征提取技术,最后实现机器学习算法。 使用评估度量,如准确度,召回,精度和F1分数进行比较这些算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号