Why Gender and Age Prediction from Tweets is Hard: Lessons from a Crowdsourcing Experiment

机译：为什么Tweets的性别和年龄预测很难：来自众群实验的课程

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There is a growing interest in automatically predicting the gender and age of authors from texts. However, most research so far ignores that language use is related to the social identity of speakers, which may be different from their biological identity. In this paper, we combine insights from sociolinguistics with data collected through an online game, to underline the importance of approaching age and gender as social variables rather than static biological variables. In our game, thousands of players guessed the gender and age of Twitter users based on tweets alone. We show that more than 10% of the Twitter users do not employ language that the crowd associates with their biological sex. It is also shown that older Twitter users are often perceived to be younger. Our findings highlight the limitations of current approaches to gender and age prediction from texts.

机译：自动预测来自文本的作者的性别和年龄越来越感兴趣。然而，大多数研究到目前为止忽略了语言使用与发言者的社会形式有关，这可能与他们的生物身份不同。在本文中，我们将社会语言学的见解与通过在线游戏收集的数据相结合，强调了接近年龄和性别作为社会变量而不是静态生物变量的重要性。在我们的游戏中，成千上万的玩家猜测了基于推文的Twitter用户的性别和年龄。我们表明，超过10％的推特用户不使用人群与其生物性别的语言。还表明，旧的Twitter用户经常被认为是年轻人。我们的研究结果突出了目前对文本的性别和年龄预测方法的局限性。

著录项

来源
《International conference on computational linguistics》|2014年||共12页
会议地点
作者
Dong Nguyen; Dolf Trieschnigg; A. Seza Dogruoez; Rilana Gravel; Marieet Theune; Theo Meder; Franciska de Jong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. TED's Hard-Learned Lessons in Crowdsourcing [J] . Nilofer Merchant Harvard Business Review . 2013,第6期

机译：TED在众包方面的经验教训
2. Molecular Muscle Experiment: Hardware and Operational Lessons for Future Astrobiology Space Experiments [J] . Pollard Amelia K., Gaffney Christopher J., Deane Colleen S., Astrobiology . 2020,第8期

机译：分子肌实验：硬件和经营课程未来天体毒液空间实验
3. To Tweet or Not to Tweet? Lessons in Careful Use of Social Media [J] . Carmen Lawrence, Richard Walker, Alana L. Griffin, Intellectual property & technology law journal . 2019,第2期

机译：发还是不发？谨慎使用社交媒体的经验教训
4. Why Gender and Age Prediction from Tweets is Hard: Lessons from a Crowdsourcing Experiment [C] . Dong Nguyen, Dolf Trieschnigg, A. Seza Dogruoez, International conference on computational linguistics . 2014

机译：为什么很难通过推文进行性别和年龄预测：众包实验的经验教训
5. A Natural Experiment about the Effects of Urbanization on Elders' Mental Well-Being and Chronic Disease Management: Lessons from China's Passive Urbanization [D] . Liang, Di 2017

机译：关于城市化对老年人心理健康和慢性病管理影响的自然实验：中国被动城市化的经验教训
6. Molecular Muscle Experiment: Hardware and Operational Lessons for Future Astrobiology Space Experiments [O] . Amelia K. Pollard, Christopher J. Gaffney, Colleen S. Deane, -1

机译：分子肌实验：硬件和经营课程未来天体毒液空间实验
7. Why gender and age prediction from tweets is hard: lessons from a crowdsourcing experiment [O] . Nguyen Dong, Trieschnigg Dolf, Dogruoz A. Seza, 2014

机译：为什么推文中的性别和年龄预测很难：众包实验的经验教训

Why Gender and Age Prediction from Tweets is Hard: Lessons from a Crowdsourcing Experiment

摘要

著录项

相似文献

相关主题

期刊订阅