首页> 外文会议>International Conference on Information Technology, Information Systems and Electrical Engineering >Trending Topic Classification for Single-Label Using Multinomial Naive Bayes (MNB) and Multi-Label Using K-Nearest Neighbors (KNN)
【24h】

Trending Topic Classification for Single-Label Using Multinomial Naive Bayes (MNB) and Multi-Label Using K-Nearest Neighbors (KNN)

机译:使用多项式朴素贝叶斯(MNB)的单标签和使用K最近邻(KNN)的多标签的趋势主题分类

获取原文

摘要

Trending Topic is one of the features found of Twitter in a short text. However, the short text used as a trending topic on Twitter sometimes confuses its users, so they need to be classified into several labels, but one tweet can have more than one label called multilable. The lables are politics, sports, entertainment, tourism, business, and other news. Another problem is the multi-labeling of classifications. Single-label will classify a trending topic into one label, while multi-label classifies into more than one label. This paper aimed to classify Twitter's trending topic using Multinomial Naive Bayes (MNB) for single-label data and K-Nearest Neighbor (KNN) for multi-label data. The steps were to collect trending topic data along with their tweets, labeling and text preprocessing, weighting TF-IDF, single-label classification using MNB and multi-label classification using KNN with the Binary Relevance approach, finally evaluation and analysis of results. By using K=3, the results show that KNN have 88.05% accuracy for multi-label data, whilst, MNB has a good result for single-label data 82.53% accuracy.
机译:趋势主题是Twitter的简短功能之一。但是,在Twitter上用作趋势主题的短文本有时会混淆其用户,因此需要将其分类为几个标签,但是一条推文可以包含一个以上的标签,称为multilable。标签是政治,体育,娱乐,旅游,商业和其他新闻。另一个问题是分类的多标签。单标签将趋势主题分类为一个标签,而多标签则分类为多个标签。本文旨在使用单标签数据的多项朴素贝叶斯(MNB)和多标签数据的K最近邻(KNN)对Twitter的趋势主题进行分类。这些步骤包括收集趋势主题数据及其推文,标签和文本预处理,加权TF-IDF,使用MNB的单标签分类以及使用Binary Relevance方法使用KNN的多标签分类,最后评估和分析结果。通过使用K = 3,结果表明KNN对多标签数据的准确性为88.05%,而MNB对单标签数据的准确性为82.53%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号