首页> 外文期刊>International Journal of Modern Physics, B. Condensed Matter Physics, Statistical Physics, Applied Physics >An improved algorithm for clustering uncertain traffic data streams based on Hadoop platform
【24h】

An improved algorithm for clustering uncertain traffic data streams based on Hadoop platform

机译:一种改进基于Hadoop平台的不确定交通数据流的改进算法

获取原文
获取原文并翻译 | 示例
           

摘要

During the development of intelligent transportation systems, traffic data has the characteristics of streaming, high dimension and uncertainty. In order to realize the query of uncertain traffic data streams in a distributed environment, the authors design the algorithm of Uncertain Traffic Data Stream Parallel Continuous Query algorithm (UTD-SPCQ). Firstly, the sliding window mode is applied to realize the data receiving and buffering in the data stream environment, so as to adapt to the MapReduce computing framework of the Hadoop distributed structure. Then, the impact of the high dimensionality and uncertainty of the data on the feature analysis of the dataset is reduced, through the dimension reduction and data rewriting. Finally, a multi-attribute data point RePoint is newly defined, to solve the problem of data dimension increase caused by data rewriting. Experiments show that this algorithm optimizes the traditional density-based clustering algorithm, and make it more adaptable to parallel continuous queries for uncertain traffic data streams, and can fully consider the newly generated streaming traffic data.
机译:在智能运输系统的发展期间,交通数据具有流媒体,高维度和不确定性的特点。为了实现分布式环境中不确定的交通数据流的查询,作者设计了不确定的交通数据流并行连续查询算法(UTD-SPCQ)的算法。首先,应用滑动窗口模式以实现数据流环境中的数据接收和缓冲,以便适应Hadoop分布式结构的MapReduce计算框架。然后,通过尺寸减少和数据重写,减少了高维度和数据对数据集特征分析的影响的影响。最后,新定义了多属性数据点重新调整,以解决数据重写引起的数据维度增加的问题。实验表明,该算法优化了传统的基于密度的聚类算法,使其更适应不确定交通数据流的并行连续查询,并且可以充分考虑新生成的流量数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号