首页> 外文会议>International conference on web engineering >Distributed Stream Consistency Checking
【24h】

Distributed Stream Consistency Checking

机译:分布式流一致性检查

获取原文

摘要

Dealing with noisy data is one of the big issues in stream processing. While noise has been widely studied in settings where streams have simple schemas, e.g. time series, few solutions focused on streams characterized by complex data structures. This paper studies how to check consistency over large amounts of complex streams. Our proposed methods exploit reasoning to assess if portions of the streams are compliant to a reference conceptual model. To achieve scalability, our methods run on state-of-the-art distributed stream processing platforms, e.g. Apache Storm or Twitter Heron. Our first method computes the closure of Negative Inclusions (NIs) for DL-Lite ontologies and registers the NIs as queries. The second method compiles the ontology into a processing pipeline to evenly distribute the workload. Experiments compares the two methods and show that the second one improves the throughput up to 139% with the LUBM ontology and 330% with the NPD ontology.
机译:处理嘈杂的数据是流处理中的主要问题之一。虽然在流具有简单模式的设置中对噪声进行了广泛研究,例如时间序列上,很少有解决方案专注于以复杂数据结构为特征的流。本文研究了如何检查大量复杂流的一致性。我们提出的方法利用推理来评估部分流是否符合参考概念模型。为了实现可扩展性,我们的方法在最先进的分布式流处理平台上运行,例如Apache Storm或Twitter Heron。我们的第一种方法为DL-Lite本体计算否定包含(NI)的闭合并将其注册为查询。第二种方法将本体编译到处理管道中以平均分配工作负载。实验对这两种方法进行了比较,结果表明,第二种方法使用LUBM本体将吞吐量提高了139%,使用NPD本体将吞吐量提高了330%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号