...
首页> 外文期刊>Web Intelligence and Agent Systems >Life aspect inference of tweets based on probability distribution
【24h】

Life aspect inference of tweets based on probability distribution

机译:基于概率分布的推文生活方面推断

获取原文
获取原文并翻译 | 示例
           

摘要

Many people share their daily events and opinions on Twitter. Some tweets are beneficial and others are related to such aspects of a user's real-life as eating, traffic conditions, and weather. In this paper, we propose an inference method of the real-life aspect distribution of tweets using labeled tweets. Our method infers the aspect probability distributions by a hierarchical estimation framework (HEF), which is hierarchically composed of both unsupervised and supervised machine learning methods. In the first phase, it extracts topics from a sea of tweets using Latent Dirichlet Allocation (LDA). In the second phase, it builds associations between topics and real-life aspects using a small set of labeled tweets. The probability distribution of aspects is inferred using the associations based on the bag of terms extracted from unknown tweets. Our sophisticated experimental evaluations with a large amount of actual tweets demonstrate the high efficiency and robustness of our inference method. Especially in the case of single-label training, HEF showed significantly lower JSD values than other baseline methods, such as Naive Bayes, SVM, and L-LDA.
机译:许多人在Twitter上分享他们的日常事件和意见。一些推文是有益的,而其他推文与用户的现实生活中的饮食,交通状况和天气等方面有关。在本文中,我们提出了一种使用标签推文推论推文的实际方面分布的推断方法。我们的方法通过层次估计框架(HEF)来推断方面概率分布,该框架由无监督和有监督的机器学习方法分层组成。在第一阶段,它使用潜在狄利克雷分配(LDA)从大量推文中提取主题。在第二阶段,它使用一小组带标签的推文在主题​​和现实生活之间建立关联。使用基于从未知推文中提取的术语包的关联来推断方面的概率分布。我们对大量实际推文的复杂实验评估表明,我们的推理方法具有很高的效率和鲁棒性。特别是在单标签训练的情况下,HEF显示出的JSD值明显低于其他基准方法,例如Naive Bayes,SVM和L-LDA。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号