首页> 外文会议>Annual IEEE International Systems Conference >An Ensemble Learning for Detecting Situational Awareness Tweets during Environmental Hazards
【24h】

An Ensemble Learning for Detecting Situational Awareness Tweets during Environmental Hazards

机译:一种在环境危害期间检测情境意识推文的集合学习

获取原文

摘要

The shift to social media platforms like Twitter during environmental hazards and emergencies has expanded recently. Yet, the classification of situational awareness tweet based on people post is a complicated process due to the high dimensionality of features. In this empirical study, A framework using machine learning and Natural Language Processing techniques was developed for two-stage binary classification of Twitter data. The First stage consists of four models: Random Forest, Support Vector Machine, Naive Bayes and Decision Trees. Whereas, the second stage includes an ensemble learning approach. Text features - TFIDF (term frequency, inverse document frequency), psychometric, and linguistic - were analyzed as predictors of binary classification to categorize each tweet as situational relevant or irrelevant automatically. A manually built and labeled dataset of 4,000 tweets were analyzed for situational awareness of environmental health hazards in Barbados from water, mosquito-borne diseases, and sewage during the period 2014 - 2018. Based on the experiment, our model was able to achieve over 85% accuracy on classifying tweets that contribute to situational awareness. Furthermore, the results indicate that applying ensemble learning in the second stage showed superior results compared to the combined features-based classification models.
机译:在环境危险期间,在环境危害和紧急情况下的推特等社交媒体平台的转变已经扩大。然而,基于人们职位的情境意识推文的分类是由于特征的高度的高度复杂的过程。在该实证研究中,开发了一种用于推特数据的两级二进制分类的机器学习和自然语言处理技术的框架。第一阶段由四种型号组成:随机森林,支持向量机,天真贝叶斯和决策树。虽然,第二阶段包括集合学习方法。文本特征 - TFIDF(术语频率,逆文档频率),心理学和语言 - 被分析为二进制分类的预测因子,以便自动将每个推文对每个推文进行分类或自动无关。在2014年至2018年期间,对巴巴多斯的巴巴多斯环境健康危害的情况进行了4,000条推文的一个手动制造和标记的数据集。根据实验,我们的模型能够实现超过85对促进态势意识的推文的%准确性。此外,结果表明,与基于特征的分类模型相比,在第二阶段应用集合学习显示出优异的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号