首页> 外文会议>International Conference on Interactive Digital Media >Hoax Information Detection System Using Apriori Algorithm and Random Forest Algorithm in Twitter
【24h】

Hoax Information Detection System Using Apriori Algorithm and Random Forest Algorithm in Twitter

机译:Hoax信息检测系统在推特中使用APRIORI算法和随机林算法

获取原文

摘要

This research is based on the disturbance faced by Twitter users, related to the distribution of hoax information in the text form. One of the efforts to overcome such problem is by building a system to detect hoax news in Twitter application. A set of information in text from on social media, can capture the use of language in written or verbal (corpus) form. Based on the advantages of Apriori algorithm which it is able to mine the text data from many used datasets and able to find the relation pattern or itemset combination in a database as a recommendation of the raised pattern. Moreover, the detection system also needs a method to classifying information based on the classes of hoax and non-hoax. One of the algorithms used is Random Forest algorithm, which it is able to combine several models of decision trees to eliminate the problems of overfitting. The purpose of this research, apart from implementing and integrating the Apriori algorithm and the Random Forest algorithm, is to make it easier for researchers to analyze and evaluate the system’s results to detect hoax information most optimal level of accuracy. The results show that the system can detect hoax and non-hoax news, whose data is integrated directly with the Twitter application. Accuracy level, precision and recall from the built system in detecting hoax news information reached 100% with minimum support value of 23.
机译:该研究基于Twitter用户面临的干扰,与文本形式的欺骗信息分发有关。克服此类问题的一项努力是通过构建一个系统来检测Twitter应用程序的恶作剧。来自社交媒体上的文本中的一组信息,可以以书面或口头(语料库)形式捕获语言的使用。基于APRiori算法的优点,它能够从许多使用的数据集中挖掘文本数据,并能够在数据库中找到关系模式或项目集合作为提出的模式的推荐。此外,检测系统还需要一种基于HOAX和非HOAX的类对信息进行分类的方法。使用的算法之一是随机森林算法,它能够组合多种决策树模型来消除过度装备的问题。该研究的目的是,除了实施和集成APRIORI算法和随机林算法之外,研究人员可以更轻松地分析和评估系统的结果,以检测HOAX信息最佳的准确度水平最佳。结果表明,该系统可以检测HOAX和非HOAX新闻,其数据与Twitter应用程序直接集成。在检测Hoax新闻信息中,精度等级,精度和召回从构建系统中达到100%,最小支持值为23。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号