首页> 外文会议>2010 12th IEEE International Conference on High Performance Computing and Communications >A Simple and Efficient Parallel Approach to Large-Scale Railway Freight Data Analysis
【24h】

A Simple and Efficient Parallel Approach to Large-Scale Railway Freight Data Analysis

机译:大规模铁路货运数据分析的一种简单有效的并行方法

获取原文

摘要

Analysis of rail freight data is essential to make a better strategy for railway freight traffic organization. However, rail freight data are national-widely stored in data center of railway bureaus. Furthermore, the size of the data is very large and grows rapidly. Although analytical data processing has received much attention these years, little attention is paid to transparently processing data stored in remote data sources, particularly where the data are generated. In this paper, we present a simple and efficient parallel approach, RFDPS, to large-scale rail freight data processing. RFDPS provides a novel virtualization data model to integrate geographically distributed freight data. Besides, it can parallelize and optimize data processing automatically across multiple data center of railway bureaus. We run typical rail freight analysis applications in production environment to evaluate the efficiency of RFDPS. The results show that the performance of RFDPS is improved exponentially compared to the approach Chinaȁ9;s Ministry of Railways adopted before.
机译:铁路货运数据的分析对于为铁路货运组织制定更好的策略至关重要。但是,铁路货运数据在全国范围内存储在铁路局的数据中心中。此外,数据的大小非常大,并且增长迅速。尽管这些年来分析数据处理引起了广泛关注,但很少关注透明处理存储在远程数据源中的数据,尤其是在生成数据的地方。在本文中,我们提出了一种简单有效的并行方法RFDPS,用于大规模铁路货运数据处理。 RFDPS提供了一种新颖的虚拟化数据模型,以集成地理分布的货运数据。此外,它可以跨铁路局的多个数据中心自动并行化和优化数据处理。我们在生产环境中运行典型的铁路货运分析应用程序,以评估RFDPS的效率。结果表明,与中国铁道部以前采用的方法相比,RFDPS的性能成倍提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号