首页> 外文会议>SIGMOD/PODS 2007 >Towards Keyword-Driven Analytical Processing
【24h】

Towards Keyword-Driven Analytical Processing

机译:迈向关键字驱动的分析处理

获取原文

摘要

Gaining business insights from data has recently been the focus of research and product development. OnLine-Analytical Processing(OLAP) tools provide elaborate query languages that allow users to group and aggregate data in various ways, and explore interesting trends and patterns in the data. However, the dynamic nature of today's data along with the overwhelming detail at which data is provided, make it nearly impossible to organize the data in a way that a business analyst needs for thinking about the data. In this paper, we introduce "Keyword-Driven Analytical Processing"( KDAP), which combines intuitive keyword-based search with the power of aggregation in OLAP without having to spend considerable effort in organizing the data in terms that the business analyst understands. Our design point is around a user mentality that we frequently encounter: "users don't know how to specify what they want, but they know it when they see it". We present our complete solution framework, which implements various phases from disambiguating the keyword terms to organizing and ranking the results in dynamic facets, that allow the user to explore efficiently the aggregation space. We address specific issues that analysts encounter, like joins, groupings and aggregations, and we provide efficient and scalable solutions. We show, how KDAP can handle both categorical and numerical data equally well and, finally, we demonstrate the generality and applicability of KDAP to two different aspects of OLAP, namely, finding exceptions or surprises in the data and finding bellwether regions where local aggregates are highly correlated with global aggregates, using various experiments on real data.
机译:从数据中获得业务洞察力最近一直是研究和产品开发的重点。在线分析处理(OLAP)工具提供了详尽的查询语言,允许用户以各种方式对数据进行分组和汇总,并探索数据中有趣的趋势和模式。但是,当今数据的动态性质以及提供数据的压倒性细节,使得几乎不可能以业务分析师考虑数据的方式来组织数据。在本文中,我们介绍了“关键字驱动的分析处理”(KDAP),它将基于关键字的直观搜索与OLAP中的聚合功能结合在一起,而无需花费大量精力来组织业务分析师理解的数据。我们的设计要点是围绕着我们经常遇到的用户心态:“用户不知道如何指定他们想要的东西,但是他们在看到它们时就知道了”。我们提供了完整的解决方案框架,该框架实施了从消除关键字词歧义到在动态构面中对结果进行组织和排名的各个阶段,使用户可以有效地探索聚合空间。我们解决了分析师遇到的特定问题,例如联接,分组和聚合,并且我们提供了高效且可扩展的解决方案。我们展示了KDAP如何能够同样好地处理分类数据和数值数据,最后,我们展示了KDAP在OLAP的两个不同方面的普遍性和适用性,即在数据中查找异常或意外情况以及查找局部聚集点所在的领头区域。通过对真实数据进行各种实验,与全球总量高度相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号