首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Weakly Supervised User Profile Extraction from Twitter
【24h】

Weakly Supervised User Profile Extraction from Twitter

机译:从Twitter中弱监督的用户配置文件提取

获取原文

摘要

While user attribute extraction on social media has received considerable attention, existing approaches, mostly supervised, encounter great difficulty in obtaining gold standard data and are therefore limited to predicting unary predicates (e.g., gender). In this paper, we present a weakly-supervised approach to user profile extraction from Twitter. Users' profiles from social media websites such as Facebook or Google Plus are used as a distant source of supervision for extraction of their attributes from user-generated text. In addition to traditional linguistic features used in distant supervision for information extraction, our approach also takes into account network information, a unique opportunity offered by social media. We test our algorithm on three attribute domains: spouse, education and job; experimental results demonstrate our approach is able to make accurate predictions for users' attributes based on their tweets.
机译:虽然在社交媒体上的用户属性提取获得了相当大的关注,但现有的方法主要监督,遇到难以获得Gold标准数据的很大困难,因此仅限于预测一元谓词(例如,性别)。在本文中,我们提出了一种从Twitter提取的弱监督方法。来自Facebook或Google Plus等社交媒体网站的用户的配置文件被用作从用户生成的文本提取其属性的遥远的监督源。除了用于信息提取的遥远监督的传统语言特征外,我们的方法还考虑了网络信息,是社交媒体提供的独特机会。我们在三个属性域上测试我们的算法:配偶,教育和工作;实验结果表明,我们的方法能够基于推文对​​用户的属性进行准确的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号