Weakly Supervised User Profile Extraction from Twitter

机译：从Twitter中弱监督的用户配置文件提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While user attribute extraction on social media has received considerable attention, existing approaches, mostly supervised, encounter great difficulty in obtaining gold standard data and are therefore limited to predicting unary predicates (e.g., gender). In this paper, we present a weakly-supervised approach to user profile extraction from Twitter. Users' profiles from social media websites such as Facebook or Google Plus are used as a distant source of supervision for extraction of their attributes from user-generated text. In addition to traditional linguistic features used in distant supervision for information extraction, our approach also takes into account network information, a unique opportunity offered by social media. We test our algorithm on three attribute domains: spouse, education and job; experimental results demonstrate our approach is able to make accurate predictions for users' attributes based on their tweets.

机译：虽然在社交媒体上的用户属性提取获得了相当大的关注，但现有的方法主要监督，遇到难以获得Gold标准数据的很大困难，因此仅限于预测一元谓词（例如，性别）。在本文中，我们提出了一种从Twitter提取的弱监督方法。来自Facebook或Google Plus等社交媒体网站的用户的配置文件被用作从用户生成的文本提取其属性的遥远的监督源。除了用于信息提取的遥远监督的传统语言特征外，我们的方法还考虑了网络信息，是社交媒体提供的独特机会。我们在三个属性域上测试我们的算法：配偶，教育和工作;实验结果表明，我们的方法能够基于推文对用户的属性进行准确的预测。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2014年||共10页
会议地点
作者
Jiwei Li; Alan Ritter; Eduard Hovy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. 'Less is more': Mining useful features from Twitter user profiles for Twitter user classification in the public health domain [J] . Online Information Review . 2020,第1期

机译：“少即是多”：从Twitter用户配置文件中挖掘有用的功能，以在公共卫生领域中对Twitter用户进行分类
2. An enhanced graph-based semi-supervised learning algorithm to detect fake users on Twitter [J] . BalaAnand M., Karthikeyan N., Karthik S., Journal of supercomputing . 2019,第9期

机译：一种基于图的增强型半监督学习算法，可检测Twitter上的虚假用户
3. Who is mentally healthy? Mental health profiles of Japanese social networking service users with a focus on LINE, Facebook, Twitter, and Instagram [J] . Ryota Sakurai, Yuta Nemoto, Hiroko Mastunaga, PLoS One . 2021,第3期

机译：谁在精神上健康？日本社交网络服务用户的心理健康概况，专注于线路，Facebook，Twitter和Instagram
4. Weakly Supervised User Profile Extraction from Twitter [C] . Jiwei Li, Alan Ritter, Eduard Hovy Annual meeting of the Association for Computational Linguistics . 2014

机译：从Twitter弱监督的用户配置文件提取
5. Entity Extraction and Disambiguation in Short Text Using Wikipedia and Semantic User Profiles. [D] . Zendejas, Ignacio. 2014

机译：使用Wikipedia和语义用户配置文件在短文本中提取和消除歧义。
6. Who is mentally healthy? Mental health profiles of Japanese social networking service users with a focus on LINE Facebook Twitter and Instagram [O] . Ryota Sakurai, Yuta Nemoto, Hiroko Mastunaga, 2021

机译：谁是精神健康的？日本社交网络服务用户的心理健康概况专注于线路FacebookTwitter和Instagram
7. Weakly Supervised User Profile Extraction from Twitter [O] . Jiwei Li, Alan Ritter, Eduard Hovy 2015

机译：来自Twitter的弱监督用户档案提取
8. Relation Extraction with Weak Supervision and Distributional Semantics. [R] . Min, B. 2013

机译：弱监督与分布语义的关系抽取。

Weakly Supervised User Profile Extraction from Twitter

摘要

著录项

相似文献

相关主题

期刊订阅