首页> 外文学位 >Learning-based Interactive Data Exploration
【24h】

Learning-based Interactive Data Exploration

机译:基于学习的交互式数据探索

获取原文
获取原文并翻译 | 示例

摘要

In this thesis, we propose that database systems should be augmented with an automated data exploration service that methodically steers users through the data in a meaningful way. Such an automated system is crucial for deriving insights from complex datasets found in many big data applications such as scientific and healthcare applications as well as for reducing the human effort of data exploration. Towards this end, we designed AIDE, an Automatic Interactive Data Exploration framework that assists users in discovering new interesting data patterns and eliminate expensive ad-hoc exploratory queries.;AIDE relies on a seamless integration of classification algorithms and data management optimization techniques that collectively strive to accurately learn the user interests based on his relevance feedback on strategically collected samples. We present a number of exploration techniques as well as optimizations that minimize the number of samples presented to the user while offering interactive performance. AIDE can deliver highly accurate query predictions for very common conjunctive queries with small user effort while, given a reasonable number of samples, it can predict with high accuracy complex disjunctive queries. It provides interactive performance as it limits the user wait time per iteration of exploration to less than a few seconds. Our user study also shows that AIDE improves the current state-of-the-art of manual exploration by significantly reducing the user effort and total exploration time.
机译:在本文中,我们建议应使用自动数据探索服务来扩展数据库系统,该服务将以有意义的方式有条理地引导用户浏览数据。这样的自动化系统对于从许多大数据应用程序(例如科学和医疗保健应用程序)中发现的复杂数据集获取洞察力以及减少人工数据探索工作至关重要。为此,我们设计了AIDE,这是一个自动交互式数据探索框架,可帮助用户发现新的有趣数据模式并消除昂贵的临时探索性查询。AIDE依靠分类算法和数据管理优化技术的无缝集成,共同努力根据他对策略性收集的样本的相关性反馈,准确地了解用户的兴趣。我们提供了许多探索技术和优化方法,这些方法和优化方法在提供交互式性能的同时,最大限度地减少了呈现给用户的样本数量。 AIDE可以用很少的工作量为非常常见的联合查询提供高度准确的查询预测,而在给定合理数量的样本的情况下,AIDE可以以高精度进行复杂的析取查询。它提供交互式性能,因为它将每次探索迭代的用户等待时间限制在几秒钟以内。我们的用户研究还表明,AIDE通过显着减少用户的工作量和总的探索时间,改善了当前的手动探索技术。

著录项

  • 作者

    Dimitriadou, Kyriaki.;

  • 作者单位

    Brandeis University.;

  • 授予单位 Brandeis University.;
  • 学科 Computer science.
  • 学位 Ph.D.
  • 年度 2018
  • 页码 128 p.
  • 总页数 128
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号