Why Gender and Age Prediction from Tweets is Hard: Lessons from a Crowdsourcing Experiment

机译：为什么很难通过推文进行性别和年龄预测：众包实验的经验教训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There is a growing interest in automatically predicting the gender and age of authors from texts. However, most research so far ignores that language use is related to the social identity of speakers, which may be different from their biological identity. In this paper, we combine insights from sociolinguistics with data collected through an online game, to underline the importance of approaching age and gender as social variables rather than static biological variables. In our game, thousands of players guessed the gender and age of Twitter users based on tweets alone. We show that more than 10% of the Twitter users do not employ language that the crowd associates with their biological sex. It is also shown that older Twitter users are often perceived to be younger. Our findings highlight the limitations of current approaches to gender and age prediction from texts.

机译：从文本自动预测作者的性别和年龄的兴趣日益浓厚。但是，到目前为止，大多数研究都忽略了语言的使用与说话者的社会身份有关，这可能与他们的生物学身份有所不同。在本文中，我们将社会语言学的见识与通过在线游戏收集的数据相结合，以强调将年龄和性别作为社会变量而非静态生物变量的重要性。在我们的游戏中，成千上万的玩家仅凭推文就猜出了Twitter用户的性别和年龄。我们显示，超过10％的Twitter用户没有使用人群与其性生活相关联的语言。还表明，老的Twitter用户通常被认为是年轻的。我们的发现凸显了当前通过文本预测性别和年龄的方法的局限性。

著录项

来源
《International conference on computational linguistics》|2014年|1950-1961|共12页
会议地点
作者
Dong Nguyen; Dolf Trieschnigg; A. Seza Dogruoez; Rilana Gravel; Marieet Theune; Theo Meder; Franciska de Jong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. TED's Hard-Learned Lessons in Crowdsourcing [J] . Nilofer Merchant Harvard Business Review . 2013,第6期

机译：TED在众包方面的经验教训
2. Molecular Muscle Experiment: Hardware and Operational Lessons for Future Astrobiology Space Experiments [J] . Pollard Amelia K., Gaffney Christopher J., Deane Colleen S., Astrobiology . 2020,第8期

机译：分子肌实验：硬件和经营课程未来天体毒液空间实验
3. To Tweet or Not to Tweet? Lessons in Careful Use of Social Media [J] . Carmen Lawrence, Richard Walker, Alana L. Griffin, Intellectual property & technology law journal . 2019,第2期

机译：发还是不发？谨慎使用社交媒体的经验教训
4. Why Gender and Age Prediction from Tweets is Hard: Lessons from a Crowdsourcing Experiment [C] . Dong Nguyen, Dolf Trieschnigg, A. Seza Dogruoez, International conference on computational linguistics . 2014

机译：为什么Tweets的性别和年龄预测很难：来自众群实验的课程
5. A Natural Experiment about the Effects of Urbanization on Elders' Mental Well-Being and Chronic Disease Management: Lessons from China's Passive Urbanization [D] . Liang, Di 2017

机译：关于城市化对老年人心理健康和慢性病管理影响的自然实验：中国被动城市化的经验教训
6. Molecular Muscle Experiment: Hardware and Operational Lessons for Future Astrobiology Space Experiments [O] . Amelia K. Pollard, Christopher J. Gaffney, Colleen S. Deane, -1

机译：分子肌实验：硬件和经营课程未来天体毒液空间实验
7. Why gender and age prediction from tweets is hard: lessons from a crowdsourcing experiment [O] . Nguyen Dong, Trieschnigg Dolf, Dogruoz A. Seza, 2014

机译：为什么推文中的性别和年龄预测很难：众包实验的经验教训

Why Gender and Age Prediction from Tweets is Hard: Lessons from a Crowdsourcing Experiment

摘要

著录项

相似文献

相关主题

期刊订阅