首页> 外文期刊>International journal of information systems for crisis response and management >Domain Adaptation for Crisis Data Using Correlation Alignment and Self-Training
【24h】

Domain Adaptation for Crisis Data Using Correlation Alignment and Self-Training

机译:使用关联对齐和自训练对危机数据进行域自适应

获取原文
获取原文并翻译 | 示例
       

摘要

Domain adaptation methods have been introduced for auto-filtering disaster tweets to address the issue of lacking labeled data for an emerging disaster. In this article, the authors present and compare two simple, yet effective approaches for the task of classifying disaster-related tweets. The first approach leverages the unlabeled target disaster data to align the source disaster distribution to the target distribution, and, subsequently, learns a supervised classifier from the modified source data. The second approach uses the strategy of self-training to iteratively label the available unlabeled target data, and then builds a classifier as a weighted combination of source and target-specific classifiers. Experimental results using Naive Bayes as the base classifier show that both approaches generally improve performance as compared to baseline. Overall, the self-training approach gives better results than the alignment-based approach. Furthermore, combining correlation alignment with self-training leads to better result, but the results of self-training are still better.
机译:已经引入了域自适应方法来自动过滤灾难消息,以解决针对正在出现的灾难而缺少标记数据的问题。在本文中,作者介绍并比较了两种简单而有效的方法来对与灾难相关的推文进行分类。第一种方法利用未标记的目标灾难数据将源灾难分布与目标分布对齐,然后从修改后的源数据中学习监督分类器。第二种方法使用自我训练的策略来迭代标记可用的未标记目标数据,然后将分类器构建为源分类器和目标特定分类器的加权组合。使用朴素贝叶斯作为基础分类器的实验结果表明,与基线相比,这两种方法通常都会提高性能。总的来说,自训练方法比基于对齐的方法提供更好的结果。此外,将相关对齐与自我训练相结合可获得更好的结果,但自我训练的结果仍然更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号