首页> 外文会议>IEEE International Conference on Intelligent Transportation Systems >Complementing Travel Diary Surveys with Twitter Data: Application of Text Mining Techniques on Activity Location, Type and Time
【24h】

Complementing Travel Diary Surveys with Twitter Data: Application of Text Mining Techniques on Activity Location, Type and Time

机译:用Twitter数据补充旅行日记调查:文本挖掘技术在活动位置,类型和时间上的应用

获取原文

摘要

A growing body of literature in social science has been devoted to extracting new information from social media to assist authorities in manage crowd projects. In this paper geolocation (or spatial) based information provided in social media is investigated to utilize intelligent transportation services. Further, the general trend of travel activities during weekdays is studied. For this purpose, a dataset consisting of more than 40,000 tweets in south and west part of the Sydney metropolitan area is utilized. After a data processing effort, the tweets are clustered into seven main categories using text mining techniques, where each category represents a type of activity including shopping, recreation, and work. Unlike the previous studies in this area, the focus of this work is on the content of the tweets rather than only using geotagged data or sentiment analysis. Beside activity type, temporal and spatial distributions of activities are used in the classification exercise. Categories are mapped to the identified regions within the city of Sydney across four time slots (two peak periods and two off-peak periods). Each time slot is used to construct a network with nodes representing people, activities and locations and edges reflecting the association between the nodes. The constructed networks are used to study the trend of activities/locations in a typical working day.
机译:越来越多的社会科学文献致力于从社交媒体中提取新信息,以协助当局管理人群项目。本文研究了社交媒体中基于地理位置(或空间)的信息,以利用智能交通服务。此外,研究了平日旅行活动的总体趋势。为此,使用了一个数据集,该数据集包含悉尼都会区南部和西部的40,000多条推文。经过数据处理之后,这些推文使用文本挖掘技术被分为七个主要类别,其中每个类别代表一种活动类型,包括购物,娱乐和工作。与以前在该领域的研究不同,这项工作的重点是推文的内容,而不是仅使用地理标记的数据或情感分析。除活动类型外,分类练习还使用活动的时间和空间分布。在四个时间段(两个高峰时段和两个非高峰时段)中,将类别映射到悉尼市内已识别的区域。每个时隙用于构建一个网络,其中的节点代表人员,活动,位置以及反映节点之间关联的边。构建的网络用于研究典型工作日活动/位置的趋势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号