首页> 外文会议>Workshop on natural language processing and computational social sciences >#WhoAmI in 160 Characters? Classifying Social Identities Based on Twitter Profile Descriptions
【24h】

#WhoAmI in 160 Characters? Classifying Social Identities Based on Twitter Profile Descriptions

机译:#WhoAmI中的160个字符?根据Twitter个人资料描述对社会身份进行分类

获取原文
获取原文并翻译 | 示例

摘要

We combine social theory and NLP methods to classify English-speaking Twitter users' online social identity in profile descriptions. We conduct two text classification experiments. In Experiment 1 we use a 5-category online social identity classification based on identity and self-categorization theories. While we are able to automatically classify two identity categories (Relational and Occupational), automatic classification of the other three identities (Political, Ethnic/religious and Stigmatized) is challenging. In Experiment 2 we test a merger of such identities based on theoretical arguments. We find that by combining these identities we can improve the predictive performance of the classifiers in the experiment. Our study shows how social theory can be used to guide NLP methods, and how such methods provide input to revisit traditional social theory that is strongly consolidated in offline settings.
机译:我们将社会理论和NLP方法结合起来,以在个人资料描述中对讲英语的Twitter用户的在线社会身份进行分类。我们进行了两个文本分类实验。在实验1中,我们使用基于身份和自我分类理论的5类在线社会身份分类。虽然我们能够自动对两个身份类别(关系和职业)进行分类,但对其他三个身份(政治,种族/宗教和污名化)的自动分类却具有挑战性。在实验2中,我们基于理论论证测试了这些身份的合并。我们发现,通过组合这些标识,可以提高实验中分类器的预测性能。我们的研究表明,社会理论可用于指导NLP方法,以及这些方法如何为重新审视在离线环境中得到巩固的传统社会理论提供输入。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号