首页> 外文会议>Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >Political Issue Extraction Model: A Novel Hierarchical Topic Model That Uses Tweets By Political And Non-Political Authors
【24h】

Political Issue Extraction Model: A Novel Hierarchical Topic Model That Uses Tweets By Political And Non-Political Authors

机译:政治发行提取模型:一种使用政治和非政治作者推文的新型等级主题模型

获取原文

摘要

People often use social media to discuss opinions, including political ones. We refer to relevant topics in these discussions as political issues, and the alternate stands towards these topics as political positions. We present a Political Issue Extraction (PIE) model that is capable of discovering political issues and positions from an unlabeled dataset of tweets. A strength of this model is that it uses twitter timelines of political and non-political authors, and affiliation information of only political authors. The model estimates word-specific distributions (that denote political issues and positions) and hierarchical author/group-specific distributions (that show how these issues divide people). Our experiments using a dataset of 2.4 million tweets from the US show that this model effectively captures the desired properties (with respect to words and groups) of political discussions. We also evaluate the two components of the model by experimenting with: (a) Use to alternate strategies to classify words, and (b) Value addition due to incorporation of group membership information. Estimated distributions are then used to predict political affiliation with 68% accuracy.
机译:人们经常使用社交媒体讨论的意见,包括政治因素。我们把相关主题在这些讨论中的政治问题,且替代代表对这些议题的政治立场。我们提出了一个政治问题,提取(PIE)模型,该模型能够从鸣叫的未标记的数据集中发现政治问题和立场。这种模式的优点是,它使用了政治和非政治作者叽叽喳喳的时间表,只有政治作家的联系信息。该模型估计特定单词的版本(即分别表示政治问题和位置)和分层撰文/组特定分布(即展示这些问题如何划分人)。使用来自美国的240万个推特的数据集我们的实验表明,该模型有效地捕捉所需的性能(相对于词和组)政治讨论的。 (a)中使用,以替代的策略进行分类的话,和(b)值除了由于组播组成员信息掺入:我们还通过与实验评估模型的两个分量。估计分布随后被用来预测有68%的准确性政治派别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号