首页> 外文会议>International conference on very large data bases >An IDEA: An gestion Framework for Data Enrichment in AsterixDB

【24h】

An IDEA: An gestion Framework for Data Enrichment in AsterixDB

机译：IDEA：AsterixDB中用于数据丰富的/ gestation框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Big Data today is being generated at an unprecedented rate from various sources such as sensors, applications, and devices, and it often needs to be enriched based on other reference information to support complex analytical queries. Depending on the use case, the enrichment operations can be compiled code, declarative queries, or machine learning models with different complexities. For enrichments that will be frequently used in the future, it can be advantageous to push their computation into the ingestion pipeline so that they can be stored (and queried) together with the data. In some cases, the referenced information may change over time, so the ingestion pipeline should be able to adapt to such changes to guarantee the currency and/or correctness of the enrichment results. In this paper, we present a new data ingestion framework that supports data ingestion at scale, enrichments requiring complex operations, and adaptiveness to reference data changes. We explain how this framework has been built on top of Apache AsterixDB and investigate its performance at scale under various workloads.

机译：如今，大数据正以前所未有的速度从传感器，应用程序和设备等各种来源生成，并且通常需要基于其他参考信息来丰富大数据以支持复杂的分析查询。根据使用情况，扩展操作可以是编译代码，声明性查询或具有不同复杂性的机器学习模型。对于将来将经常使用的浓缩，将其计算推入摄取管道以使它们可以与数据一起存储（和查询）可能是有利的。在某些情况下，参考信息可能会随时间而变化，因此，摄入流水线应该能够适应这种变化，以保证浓缩结果的准确性和/或正确性。在本文中，我们提出了一个新的数据摄取框架，该框架支持大规模的数据摄取，需要复杂操作的扩充以及对参考数据更改的适应性。我们将说明如何在Apache AsterixDB的基础上构建此框架，并在各种工作负载下大规模研究其性能。

著录项

来源
《International conference on very large data bases 》|2019年|1485-1498|共14页
会议地点
作者
Xikui Wang; Michael J. Carey;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Road Data Enrichment Framework Based on Heterogeneous Data Fusion for ITS [J] . Rettore Paulo H. L., Santos Bruno P., Lopes Roberto Rigolin F., IEEE Transactions on Intelligent Transportation Systems . 2020 ,第4期

机译：基于异构数据融合的道路数据丰富框架
2. Enriching linguistic descriptions of data: A framework for composite protoforms [J] . Ramos-Soto A., Martin-Rodilla P. Fuzzy sets and systems . 2021 ,第Mara1期

机译：丰富数据的语言描述：复合原生植物的框架
3. A semantic framework for textual data enrichment [J] . Gutierrez Yoan, Vazquez Sonia, Montoyo Andres Expert Systems with Application . 2016 ,第Sepa期

机译：文本数据丰富化的语义框架
4. An IDEA: An /ngestion Framework for Data Enrichment in AsterixDB [C] . Xikui Wang, Michael J. Carey International conference on very large data bases . 2019

机译：一个想法：AsterixDB中的数据丰富的AN / NOVELION框架
5. Enhancing Apache AsterixDB for Efficient Big Data Search and Analytics [D] . Kim, Taewoo. 2018

机译：增强Apache AsterixDB以进行有效的大数据搜索和分析
6. International Fertility Change: New Data and Insights from the Developmental Idealism Framework [O] . Arland Thornton, Georgina Binstock, Kathryn M. Yount, -1

机译：国际土壤肥力变化：新数据并从发展理想主义框架见解
7. Figure 1 from: Vanderhoeven S, Adriaens T, Desmet P, Strubbe D, Backeljau T, Barbier Y, Brosens D, Cigar J, Coupremanne M, De Troch R, Eggermont H, Heughebaert A, Hostens K, Huybrechts P, Jacquemart A, Lens L, Monty A, Paquet J, Prévot C, Robertson T, Termonia P, Van De Kerchove R, Van Hoey G, Van Schaeybroeck B, Vercayie D, Verleye T, Welby S, Groom Q (2017) Tracking Invasive Alien Species (TrIAS): Building a data-driven framework to inform policy. Research Ideas and Outcomes 3: e13414. https://doi.org/10.3897/rio.3.e13414 [O] . Vanderhoeven, Sonia, Adriaens, Tim, Desmet, Peter, 2017

机译：图1来自：范德霍芬（Vanderhoeven S），阿德里亚恩斯（Adriaens T），迪斯美（Desmet P），斯特鲁贝（Strubbe）D，巴克耶（Backeljau）T，巴比耶（Barbier）Y，布罗森斯（Drosens）D，雪茄（Cougarmanne）M，德Troch R，埃格蒙特（Eggermont H），休格堡（Hughebaert）A，Hostens K，休伊布列茨（Huybrechts）P，雅克马尔（Jacquemart）A， Lens L，Monty A，Paquet J，PrévotC，Robertson T，Termonia P，Van De Kerchove R，Van Hoey G，Van Schaeybroeck B，Vercayie D，Verleye T，Welby S，Groom Q（2017）追踪外来入侵物种（ TrIAS）：建立一个数据驱动的框架来告知政策。研究思路与成果3：e13414。 https://doi.org/10.3897/rio.3.e13414

An IDEA: An gestion Framework for Data Enrichment in AsterixDB

摘要

著录项

相似文献

相关主题

期刊订阅