首页> 外文会议>AAAI Conference on Artificial Intelligence >Predicting the Demographics of Twitter Users from Website Traffic Data
【24h】

Predicting the Demographics of Twitter Users from Website Traffic Data

机译:预测网站流量数据的推特用户的人口统计数据

获取原文

摘要

Understanding the demographics of users of online social networks has important applications for health, marketing, and public messaging. In this paper, we predict the demographics of Twitter users based on whom they follow. Whereas most prior approaches rely on a supervised learning approach, in which individual users are labeled with demographics, we instead create a distantly labeled dataset by collecting audience measurement data for 1,500 websites (e.g., 50% of visitors to gizmodo.com are estimated to have a bachelor's degree). We then fit a regression model to predict these demographics using information about the followers of each website on Twitter. The resulting average held-out correlation is .77 across six different variables (gender, age, ethnicity, education, income, and child status). We additionally validate the model on a smaller set of Twitter users labeled individually for ethnicity and gender, finding performance that is surprisingly competitive with a fully supervised approach.
机译:了解在线社交网络用户的人口对健康,市场营销和公共通讯重要的应用。在本文中,我们预测基础上,他们遵循人的Twitter用户的人口统计数据。虽然大多数现有的方法依赖于监督学习的方法,其中个人用户都标有人口统计,我们不是通过收集受众测量数据为1500级的网站(创建一个远亲标记数据集例如,以gizmodo.com游客的50%,估计有大学本科学历)。然后,我们拟合回归模型来预测使用约在Twitter上每个网站的追随者这些信息人口统计数据。所得到的平均持有了相关的0.77跨越六种不同的变量(性别,年龄,种族,教育程度,收入,子女状况)。我们还验证在一个较小的一套单独标记为种族和性别Twitter用户的模型,发现性能是一个完全监控的做法令人惊讶的竞争力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号