Learning Effective Query Management Strategies from Big Data

机译：从大数据中学习有效的查询管理策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The availability of big data collections, together with powerful hardware and software mechanisms to process them, gives nowadays the possibility to learn useful insights from data, which can be exploited for multiple purposes, including marketing, fault prevention, and so forth. However, it is also possible to learn important metadata that can suggest how data should be manipulated in several advanced operations. In this paper, we show the potentiality of learning from data by focusing on the problem of relaxing the results of database queries, that is, trying to return some approximated answer to a query when a result for it is unavailable in the database, and the system will return an empty answer set, or even worse, erroneous mismatch results. In particular, we introduce a novel approach to rewrite queries that are in disjunctive normal form and contain a mixture of discrete and continuous attributes. The approach preprocesses data collections to discover the implicit relationships that exist among the various domain attributes, and then uses this knowledge to rewrite the constraints from the failing query. In a first step, the approach tries to learn a set of functional dependencies from the data, which are ranked according to special mechanisms that will successively allow to predict the order in which the extracted dependencies have to be used to properly rewrite the failing query. An experimental evaluation of the approach on three real data sets shows its effectiveness in terms of robustness and coverage.

机译：如今，大数据集合的可用性以及处理它们的强大硬件和软件机制使人们有可能从数据中学习有用的见解，这些见解可用于多种目的，包括营销，故障预防等。但是，也可以学习重要的元数据，这些元数据可以建议应如何在几个高级操作中操作数据。在本文中，我们将重点放在放宽数据库查询结果的问题上，从而展示从数据中学习的潜力，即在数据库中没有查询结果的情况下，尝试向查询返回一些近似答案，以及系统将返回一个空的答案集，甚至更糟糕的是，错误的不匹配结果。特别是，我们引入了一种新颖的方法来重写处于析取范式且包含离散和连续属性的混合形式的查询。该方法对数据收集进行预处理，以发现各种域属性之间存在的隐式关系，然后使用此知识来重写失败查询中的约束。在第一步中，该方法尝试从数据中学习一组功能依赖项，这些功能依赖项将根据特殊机制进行排序，这些特殊机制将依次允许预测所提取的依赖项必须用来正确重写失败查询的顺序。在三个真实数据集上对该方法进行的实验评估表明，该方法在鲁棒性和覆盖范围方面均有效。

著录项

来源
《IEEE International Conference on Machine Learning and Applications》|2017年|643-648|共6页
会议地点
作者
Loredana Caruccio; Vincenzo Deufemia; Giuseppe Polese;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Databases; Data mining; Automobiles; Petroleum; Big Data; Metadata;

机译：数据库;数据挖掘;汽车;石油;大数据;元数据;

相似文献

外文文献
中文文献
专利

1. Data Mining for Discovering Effective Time-Series Transition of Learning Strategies on Mutual Viewing-Based Learning [J] . Yuto Omae, Tatsuro Furuya, Kazutaka Mizukoshi, Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2018,第7a134期

机译：用于发现基于相互观看学习的有效时间序列转换的数据挖掘
2. Effective Communication as a Change Management Tool in Creating Awareness on Leadership Vision and Strategy: A Focus on Management of Student Academic Records at Institutions of Higher Learning [J] . Esther F. W. Nyagah Journal of Education and Practice . 2017,第26期

机译：有效的沟通作为一种变革管理工具，可以提高对领导力愿景和战略的意识：以管理高校学生的学业成绩为重点
3. Corrigendum: How and Why Do Students Use Learning Strategies? A Mixed Methods Study on Learning Strategies and Desirable Difficulties With Effective Strategy Users [J] . Sanne F. E. Rovers, Jeroen J. G. van Merri?nboer, Hans H. C. M. Savelberg, Frontiers in Psychology . 2020,第a期

机译：逆势：学生如何以及为什么使用学习策略？有效策略用户的学习策略和理想困难的混合方法研究
4. Learning Effective Query Management Strategies from Big Data [C] . Loredana Caruccio, Vincenzo Deufemia, Giuseppe Polese IEEE International Conference on Machine Learning and Applications . 2017

机译：学习大数据的有效查询管理策略
5. Learning effective and robust knowledge for semantic query optimization [D] . Hsu, Chun-Nan 1996

机译：学习有效和强大的知识以进行语义查询优化
6. Corrigendum: How and Why Do Students Use Learning Strategies? A Mixed Methods Study on Learning Strategies and Desirable Difficulties With Effective Strategy Users [O] . Sanne F. E. Rovers, Renée E. Stalmeijer, Jeroen J. G. van Merriënboer, 2020

机译：更正：学生如何以及为什么使用学习策略？有效策略使用者对学习策略和期望困难的混合方法研究
7. A Query Optimization Method Based on Hadoop-HANA Hybrid Data Management Strategy [O] . Jie Wang, Qiao Pan, Yuan Zhang, 2017

机译：基于Hadoop-Hana混合数据管理策略的查询优化方法
8. Learning Strategy Training Program: Paraphrasing Strategy for Effective Learning. [R] . Dansereau, D. F., Long, G. L., McDonald, B. A., 1975

机译：学习策略培训计划：有效学习的释义策略。

Learning Effective Query Management Strategies from Big Data

摘要

著录项

相似文献

相关主题

期刊订阅