首页> 外文会议>Conference on empirical methods in natural language processing >#WhoAmI in 160 Characters? Classifying Social Identities Based on Twitter Profile Descriptions
【24h】

#WhoAmI in 160 Characters? Classifying Social Identities Based on Twitter Profile Descriptions

机译:#whoami是160个字符?根据Twitter配置文件描述进行分类社会身份

获取原文

摘要

We combine social theory and NLP methods to classify English-speaking Twitter users' online social identity in profile descriptions. We conduct two text classification experiments. In Experiment 1 we use a 5-category online social identity classification based on identity and self-categorization theories. While we are able to automatically classify two identity categories (Relational and Occupational), automatic classification of the other three identities (Political, Ethnic/religious and Stigmatized) is challenging. In Experiment 2 we test a merger of such identities based on theoretical arguments. We find that by combining these identities we can improve the predictive performance of the classifiers in the experiment. Our study shows how social theory can be used to guide NLP methods, and how such methods provide input to revisit traditional social theory that is strongly consolidated in offline settings.
机译:我们将社会理论和NLP方法结合在一起,在简介描述中对英语的Twitter用户的在线社会身份进行分类。我们进行两个文本分类实验。在实验1中,我们根据身份和自我分类理论使用5类在线社会形式分类。虽然我们能够自动分类两种身份类别(关系和职业),但自动分类其他三个身份(政治,种族/宗教和侮辱)是挑战性的。在实验2中,我们根据理论参数测试这些身份的合并。我们发现,通过结合这些身份,我们可以提高实验中分类器的预测性能。我们的研究表明社会理论如何用于指导NLP方法,以及这些方法如何提供输入以重新审视在离线设置中强烈合并的传统社会理论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号