...
首页> 外文期刊>IEEE Transactions on Intelligent Transportation Systems >A Novel Supervised Clustering Algorithm for Transportation System Applications
【24h】

A Novel Supervised Clustering Algorithm for Transportation System Applications

机译:运输系统应用中一种新型监督聚类算法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper proposes a novel supervised clustering algorithm to analyze large datasets. The proposed clustering algorithm models the problem as a matching problem between two disjoint sets of agents, namely, centroids and data points. This novel view of the clustering problem allows the proposed algorithm to be multi-objective, where each agent may have its own objective function. The proposed algorithm is used to maximize the purity and similarity in each cluster simultaneously. Our algorithm shows promising performance when tested using two different transportation datasets. The first dataset includes speed measurements along a section of Interstate 64 in the state of Virginia, while the second dataset includes the bike station status of a bike sharing system (BSS) in the San Francisco Bay Area. We clustered each dataset separately to examine how traffic and bike patterns change within clusters and then determined when and where the system would be congested or imbalanced, respectively. Using a spatial analysis of these congestion states or imbalance points, we propose potential solutions for decision makers and agencies to improve the operations of I-64 and the BSS. We demonstrate that the proposed algorithm produces better results than classical $k$ -means clustering algorithms when applied to our datasets with respect to a time event. The contributions of our paper are: 1) we developed a multi-objective clustering algorithm; 2) the algorithm is scalable (polynomial order), fast, and simple; and 3) the algorithm simultaneously identifies a stable number of clusters and clusters the data.
机译:本文提出了一种新颖的监督聚类算法来分析大型数据集。提出的聚类算法将该问题建模为两个不相交的代理集(质心和数据点)之间的匹配问题。对聚类问题的这种新颖观点使所提出的算法具有多目标性,其中每个代理可以具有自己的目标函数。所提出的算法用于同时最大化每个簇的纯度和相似性。当使用两个不同的运输数据集进行测试时,我们的算法显示出令人鼓舞的性能。第一个数据集包括沿弗吉尼亚州64号州际公路沿线的速度测量,而第二个数据集包括旧金山湾区的自行车共享系统(BSS)的自行车站状态。我们分别对每个数据集进行聚类,以检查聚类中的交通和自行车模式如何变化,然后分别确定系统何时以及在何处发生拥塞或不平衡。通过对这些拥塞状态或不平衡点的空间分析,我们为决策者和机构提出了改善I-64和BSS运营的潜在解决方案。我们证明,当将其应用于时间事件方面的数据集时,与传统的$ k $-均值聚类算法相比,该算法产生的结果更好。本文的贡献是:1)我们开发了一种多目标聚类算法; 2)该算法可扩展(多项式阶),快速且简单; 3)该算法同时识别稳定数目的聚类并将数据聚类。

著录项

  • 来源
  • 作者

  • 作者单位

    Virginia Tech Charles E Via Jr Dept Civil & Environm Engn Blacksburg VA 24061 USA|Virginia Tech Transportat Inst Ctr Sustainable Mobil Blacksburg VA 24061 USA|King Saud Univ Civil Engn Dept Riyadh 2890588 Saudi Arabia;

    Queensland Univ Technol Ctr Accid Res & Rd Safety Brisbane Qld 4059 Australia;

    Virginia Tech Charles E Via Jr Dept Civil & Environm Engn Blacksburg VA 24061 USA|Virginia Tech Transportat Inst Ctr Sustainable Mobil Blacksburg VA 24061 USA|Virginia Tech Bradley Dept Elect & Comp Engn Blacksburg VA 24061 USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Supervised clustering; high dimensional datasets and traffic operations; bike-sharing systems; urban computing; classification;

    机译:有监督的集群;高维数据集和交通运营;自行车共享系统;城市计算;分类;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号