Weakly Supervised User Profile Extraction from Twitter

机译：从Twitter弱监督的用户配置文件提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While user attribute extraction on social media has received considerable attention, existing approaches, mostly supervised, encounter great difficulty in obtaining gold standard data and are therefore limited to predicting unary predicates (e.g., gender). In this paper, we present a weakly-supervised approach to user profile extraction from Twitter. Users' profiles from social media websites such as Facebook or Google Plus are used as a distant source of supervision for extraction of their attributes from user-generated text. In addition to traditional linguistic features used in distant supervision for information extraction, our approach also takes into account network information, a unique opportunity offered by social media. We test our algorithm on three attribute domains: spouse, education and job; experimental results demonstrate our approach is able to make accurate predictions for users' attributes based on their tweets.

机译：尽管社交媒体上的用户属性提取已受到相当大的关注，但是大多数方法都受到监督的现有方法在获取黄金标准数据时遇到了很大的困难，因此仅限于预测一元谓词（例如性别）。在本文中，我们提出了一种从Twitter提取用户档案的弱监督方法。来自社交媒体网站（如Facebook或Google Plus）的用户个人资料被用作远程监管来源，用于从用户生成的文本中提取其属性。除了用于远程监管以提取信息的传统语言功能外，我们的方法还考虑了网络信息，这是社交媒体提供的独特机会。我们在三个属性域上测试我们的算法：配偶，教育和工作;实验结果表明，我们的方法能够根据用户的推文对用户的属性做出准确的预测。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2014年|165-174|共10页
会议地点
作者
Jiwei Li; Alan Ritter; Eduard Hovy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. 'Less is more': Mining useful features from Twitter user profiles for Twitter user classification in the public health domain [J] . Online Information Review . 2020,第1期

机译：“少即是多”：从Twitter用户配置文件中挖掘有用的功能，以在公共卫生领域中对Twitter用户进行分类
2. An enhanced graph-based semi-supervised learning algorithm to detect fake users on Twitter [J] . BalaAnand M., Karthikeyan N., Karthik S., Journal of supercomputing . 2019,第9期

机译：一种基于图的增强型半监督学习算法，可检测Twitter上的虚假用户
3. Who is mentally healthy? Mental health profiles of Japanese social networking service users with a focus on LINE, Facebook, Twitter, and Instagram [J] . Ryota Sakurai, Yuta Nemoto, Hiroko Mastunaga, PLoS One . 2021,第3期

机译：谁在精神上健康？日本社交网络服务用户的心理健康概况，专注于线路，Facebook，Twitter和Instagram
4. Weakly Supervised User Profile Extraction from Twitter [C] . Jiwei Li, Alan Ritter, Eduard Hovy Annual meeting of the Association for Computational Linguistics . 2014

机译：从Twitter中弱监督的用户配置文件提取
5. Entity Extraction and Disambiguation in Short Text Using Wikipedia and Semantic User Profiles. [D] . Zendejas, Ignacio. 2014

机译：使用Wikipedia和语义用户配置文件在短文本中提取和消除歧义。
6. Who is mentally healthy? Mental health profiles of Japanese social networking service users with a focus on LINE Facebook Twitter and Instagram [O] . Ryota Sakurai, Yuta Nemoto, Hiroko Mastunaga, 2021

机译：谁是精神健康的？日本社交网络服务用户的心理健康概况专注于线路FacebookTwitter和Instagram
7. Weakly Supervised User Profile Extraction from Twitter [O] . Jiwei Li, Alan Ritter, Eduard Hovy 2015

机译：来自Twitter的弱监督用户档案提取
8. Relation Extraction with Weak Supervision and Distributional Semantics. [R] . Min, B. 2013

机译：弱监督与分布语义的关系抽取。

Weakly Supervised User Profile Extraction from Twitter

摘要

著录项

相似文献

相关主题

期刊订阅