【24h】

Scalable Infrastructures for Data in Motion

机译:运动数据的可扩展基础架构

获取原文

摘要

Analytics applications for reporting and human interaction with big data rely upon scalable frameworks for data ingest, storage, and computation. Batch processing of analytic workloads increases latency of results and can perform redundant computation. In real-world applications, new data points are continuously arriving and a suite of algorithms must be updated to reflect the changes. Reducing the latency of re-computation by keeping algorithms online and up-to-date enables fast query, experimentation, and drill-down. In this paper, we share our experiences designing and implementing scalable infrastructure around No SQL databases for social media analytics applications. We propose a new heterogeneous architecture and execution model for streaming data applications that focuses on throughput and modularity.
机译:用于报告和与大数据进行人机交互的分析应用程序依赖于可扩展的框架来进行数据提取,存储和计算。解析工作负载的批处理会增加结果的延迟,并且可以执行冗余计算。在实际应用中,不断有新的数据点到达,必须更新一组算法以反映变化。通过使算法保持在线和最新状态来减少重新计算的等待时间,可以实现快速查询,实验和下钻。在本文中,我们分享了围绕社交媒体分析应用程序设计和实施No SQL数据库的可扩展基础结构的经验。我们为流数据应用程序提出了一种新的异构体系结构和执行模型,该模型着重于吞吐量和模块化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号