首页> 外文期刊>Journal of Big Data Analytics in Transportation >Inferring Twitters' Socio-demographics to Correct Sampling Bias of Social Media Data for Augmenting Travel Behavior Analysis
【24h】

Inferring Twitters' Socio-demographics to Correct Sampling Bias of Social Media Data for Augmenting Travel Behavior Analysis

机译:推断嫁偶的社会人口统计数据,以纠正社交媒体数据的抽样偏见,以增加旅行行为分析

获取原文
获取原文并翻译 | 示例
           

摘要

Many studies demonstrated that social media data, especially Twitter data, have significant potentials to develop models for estimating travel demand, managing operation, and conducting long-term planning purposes. However, it is well known that research with social media data is facing a looming challenge in sampling bias. The Twitter user's population has huge discrepancies compared with the overall population. Therefore, social media data, when it is directly used for travel behavior analysis, contains biases and errors to some degree. The objective of this study is to correct sampling bias of Twitter data for travel behavior analysis by inferring Twitter users' socio-demographics. This study first links travelers' Twitter account with their Facebook account, and verifies their socio-demographics by Facebook data, assuming that one's Facebook information is real. Second, several models are proposed for predicting socio-demographics, including gender, age, ethnicity, and education levels. Afterward, this paper resamples social media data and compares it to the 2009 California Household Travel Survey data. The resampled data show comparable characteristics to the survey data. This research shed light on tackling sampling bias issues when social media data are incorporated for augmenting travel behavior analysis and urban planning.
机译:许多研究表明,社交媒体数据,尤其是Twitter数据,具有开发用于估算旅行需求,管理操作和进行长期规划目的的模型的潜力。然而,众所周知,与社交媒体数据的研究在采样偏见方面面临着迫在眉睫的挑战。与整体人口相比,Twitter用户的人口具有巨大的差异。因此,社交媒体数据,当它直接用于旅行行为分析时,在某种程度上包含偏差和错误。本研究的目的是通过推断Twitter用户的社会人口统计数据来纠正Twitter数据的抽样偏差。本研究首先将旅行者的Twitter帐户与他们的Facebook帐户联系起来,并通过Facebook数据验证他们的社会人口统计数据,假设一个人的Facebook信息是真实的。其次,提出了几种模型,用于预测社会人口统计数据,包括性别,年龄,种族和教育水平。之后,本文将社交媒体数据重新列出,并将其与2009年加州家庭旅游调查数据进行比较。重采样数据显示调查数据的可比特征。这项研究揭示了在社交媒体数据纳入增强旅行行为分析和城市规划时解决采样偏置问题。

著录项

  • 来源
  • 作者

    Yu Cui; Qing He;

  • 作者单位

    Department of Civil Structural and Environmental Engineering University at Buffalo The State University of New York Buffalo NY 14260 USA;

    Department of Civil Structural and Environmental Engineering University at Buffalo The State University of New York Buffalo NY 14260 USA Department of Industrial and Systems Engineering University at Buffalo The State University of New York Buffalo NY 14260 USA Key Laboratory of High-speed Railway Engineering of the Ministry of Education School of Civil Engineering Southwest Jiaotong University Chengdu 610031 China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Social media data; Twitter; Socio-demographics; Sampling bias correction; Travel behavior;

    机译:社交媒体数据;推特;社会人口统计学;抽样偏压校正;旅行行为;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号