首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >NEPTUNE: Real Time Stream Processing for Internet of Things and Sensing Environments
【24h】

NEPTUNE: Real Time Stream Processing for Internet of Things and Sensing Environments

机译:海王星:用于物联网和传感环境的实时流处理

获取原文

摘要

Improvements in miniaturization and networking capabilities of sensors have contributed to the proliferation of Internet of Things (IoT) and continuous sensing environments. Data streams generated in such settings must keep pace with generation rates and be processed in real time. Challenges in accomplishing this include: high data arrival rates, buffer overflows, context-switches, and object creation overheads. We propose a holistic framework that addresses the CPU, memory, network, and kernel issues involved in stream processing. Our prototype, Neptune, builds on our Granules cloud runtime. The framework maximizes bandwidth utilization in the presence of small messages via the use of buffering and dynamic compactions of packets based on payload entropy. Our use of thread-pools and batched processing reduces context switches and improves effective CPU utilizations. NEPTUNE alleviates memory pressure that can lead to swapping, page faults, and thrashing through efficient reuse of objects. To cope with buffer overflows we rely on flow control and throttling the preceding stages of a processing pipeline. Our benchmarks demonstrate the suitability of the Neptune and we contrast our performance with Apache Storm, the dominant stream-processing framework developed by Twitter. At a single node, we are able to achieve a processing rate of ~2 million stream packets per-second. In a distributed setup, we achieved a rate of ~100 million packets per-second.
机译:传感器的小型化和网络能力的改进有助于互联网(物联网)和连续传感环境的扩散。在这种设置中生成的数据流必须与生成速率保持速度并实时处理。完成此内容的挑战包括:高数据到达速率,缓冲区溢出,上下文交换机和对象创建开销。我们提出了一个全面框架,解决了流处理中涉及的CPU,内存,网络和内核问题。我们的原型,海王星,在我们的颗粒云运行时构建。框架通过使用基于有效载荷熵的数据包的缓冲和动态作品,可以在存在小消息时最大化带宽利用率。我们使用线程池和批处理处理可减少上下文切换并提高有效的CPU利用率。海王星减轻了可以通过有效再利用物体交换,页面断层和延长的记忆压力。为了应对缓冲区溢出,我们依靠流量控制并限制处理管道的前一级。我们的基准测试展示了海王星的适用性,并将我们与Apache Storm的表现造影,Twitter开发的主要流处理框架。在单个节点,我们能够实现每秒约200万流数据包的处理速率。在分布式设置中,我们实现了每秒〜100万包的速率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号