首页> 外文会议>International Parallel and Distributed Processing Symposium >A Services Oriented Framework for Next Generation Data Analysis Centers
【24h】

A Services Oriented Framework for Next Generation Data Analysis Centers

机译:以下一代数据分析中心为导向的服务框架

获取原文

摘要

Over the past decade, advances in computational and sensor technology have enabled us to dynamically collect vast amounts of data from observations, health screening tests, simulations, and experiments at an ever-increasing pace. Knowledge discovery and data mining is an iterative process concerned with deriving interesting, non-obvious, and useful patterns and models from such large volumes of data. Although inexpensive storage is conducive to maintaining said data, accessing and managing it for knowledge discovery and data mining becomes a performance issue when datasets are large, dynamic, and distributed. In this work, we present our vision of a software framework consisting of middleware services to support interactive data mining over dynamic data at data analysis centers built on top of heterogeneous clusters. The design of a sampling service for dynamic data, together with initial performance results, are also presented.
机译:在过去的十年中,计算和传感器技术的进步使我们能够以不断增加的速度动态地从观察,健康筛查测试,模拟和实验中收集大量数据。知识发现和数据挖掘是一个迭代过程,有关导出来自如此大量数据的有趣,不明显和有用的模式和模型。虽然廉价的存储是有利于维护所述数据,但是在数据集大型,动态和分布式的数据集时,访问和管理它在知识发现和数据挖掘成为一个性能问题。在这项工作中,我们展示了由中间件服务组成的软件框架的愿景,以支持在异构集群顶部的数据分析中心的动态数据上挖掘交互式数据。还介绍了动态数据的采样服务的设计以及初始性能结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号