首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Weakly Supervised User Profile Extraction from Twitter
【24h】

Weakly Supervised User Profile Extraction from Twitter

机译:从Twitter弱监督的用户配置文件提取

获取原文

摘要

While user attribute extraction on social media has received considerable attention, existing approaches, mostly supervised, encounter great difficulty in obtaining gold standard data and are therefore limited to predicting unary predicates (e.g., gender). In this paper, we present a weakly-supervised approach to user profile extraction from Twitter. Users' profiles from social media websites such as Facebook or Google Plus are used as a distant source of supervision for extraction of their attributes from user-generated text. In addition to traditional linguistic features used in distant supervision for information extraction, our approach also takes into account network information, a unique opportunity offered by social media. We test our algorithm on three attribute domains: spouse, education and job; experimental results demonstrate our approach is able to make accurate predictions for users' attributes based on their tweets.
机译:尽管社交媒体上的用户属性提取已受到相当大的关注,但是大多数方法都受到监督的现有方法在获取黄金标准数据时遇到了很大的困难,因此仅限于预测一元谓词(例如性别)。在本文中,我们提出了一种从Twitter提取用户档案的弱监督方法。来自社交媒体网站(如Facebook或Google Plus)的用户个人资料被用作远程监管来源,用于从用户生成的文本中提取其属性。除了用于远程监管以提取信息的传统语言功能外,我们的方法还考虑了网络信息,这是社交媒体提供的独特机会。我们在三个属性域上测试我们的算法:配偶,教育和工作;实验结果表明,我们的方法能够根据用户的推文对用户的属性做出准确的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号