首页> 外文会议>SIGMOD/PODS >Towards Keyword-Driven Analytical Processing
【24h】

Towards Keyword-Driven Analytical Processing

机译:向关键字驱动的分析处理

获取原文
获取外文期刊封面目录资料

摘要

Gaining business insights from data has recently been the focus of research and product development. OnLine-Analytical Processing(OLAP) tools provide elaborate query languages that allow users to group and aggregate data in various ways, and explore interesting trends and patterns in the data. However, the dynamic nature of today's data along with the overwhelming detail at which data is provided, make it nearly impossible to organize the data in a way that a business analyst needs for thinking about the data. In this paper, we introduce "Keyword-Driven Analytical Processing"( KDAP), which combines intuitive keyword-based search with the power of aggregation in OLAP without having to spend considerable effort in organizing the data in terms that the business analyst understands. Our design point is around a user mentality that we frequently encounter: "users don't know how to specify what they want, but they know it when they see it". We present our complete solution framework, which implements various phases from disambiguating the keyword terms to organizing and ranking the results in dynamic facets, that allow the user to explore efficiently the aggregation space. We address specific issues that analysts encounter, like joins, groupings and aggregations, and we provide efficient and scalable solutions. We show, how KDAP can handle both categorical and numerical data equally well and, finally, we demonstrate the generality and applicability of KDAP to two different aspects of OLAP, namely, finding exceptions or surprises in the data and finding bellwether regions where local aggregates are highly correlated with global aggregates, using various experiments on real data.
机译:最近获得数据的商业洞察力是研究和产品开发的重点。在线分析处理(OLAP)工具提供了详细的查询语言,允许用户以各种方式组成和聚合数据,并探索数据中的有趣趋势和模式。然而,当今数据的动态性质以及提供数据的压倒性细节,使得几乎不可能以商业分析师需要思考数据的方式组织数据。在本文中,我们介绍了“关键字驱动的分析处理”(KDAP),其将基于直观的关键字的搜索与OLAP中的聚合的功率相结合,而无需花费大量努力,以便在业务分析师理解的术语中组织数据。我们的设计点围绕着我们经常遇到的用户心态:“用户不知道如何指定他们想要的内容,但他们在看到它时知道它”。我们展示了我们的完整解决方案框架,它可以实现各种阶段,从消除关键字术语来组织和排列动态方面的结果,允许用户有效地探索聚合空间。我们解决了分析师遇到的特定问题,例如加入,分组和聚合,我们提供有效和可扩展的解决方案。我们展示了KDAP如何同样地处理分类和数值数据,最后,我们展示了KDAP对OLAP的两个不同方面的一般性和适用性,即,在数据中找到异常或惊喜,并找到局部聚集体的领导物区域使用各种实验与实际数据的各种实验与全局聚集体高度相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号