首页> 外文会议>SIGMOD/PODS >Supporting Ranking and Clustering as Generalized Order-By and Group-By*

【24h】

Supporting Ranking and Clustering as Generalized Order-By and Group-By*

机译：支持排名和群集作为广义order-by和by *

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The Boolean semantics of SQL queries cannot adequately capture the "fuzzy" preferences and "soft" criteria required in non-traditional data retrieval applications. One way to solve this problem is to add a flavor of "information retrieval" into database queries by allowing fuzzy query conditions and exibly supporting grouping and ranking of the query results within the DBMS engine. While ranking is already supported by all major commercial DBMSs natively, support of flexibly grouping is still very limited (I.e., group-by). In this paper, we propose to generalize group-by to enable exible grouping (clustering specifically) of the query results. Different from clustering in data mining applications, our focus is on supporting efficient clustering of Boolean results generated at query time. Moreover, we propose to integrate ranking and clustering with Boolean conditions, forming a new type of ClusterRank query to allow structured data retrieval. Such an integration is nontrivial in terms of both semantics and query processing. We investigate various semantics of this type of queries. To process such queries, a straightforward approach is to simply glue the techniques developed for ranking-only and clustering-only together. This approach is costly since both ranking and clustering are treated as blocking post-processing tasks upon Boolean query results by existing techniques. We propose a summary-based evaluation method that utilizes bitmap index to seamlessly integrate Boolean conditions, clustering, and ranking. Experimental study shows that our approach significantly outperforms the straightforward one and maintains high clustering quality.

机译：SQL查询的布尔语义无法充分捕获非传统数据检索应用中所需的“模糊”首选项和“软”标准。解决此问题的一种方法是通过允许模糊查询条件和令人难以置信地支持DBMS引擎内的查询结果来将“信息检索”的风格添加到数据库查询中。虽然所有主要商业DBMS本地支持排名，但对灵活分组的支持仍然非常有限（即，逐组）。在本文中，我们建议概括组 - 以便启用查询结果的可用分组（具体群集）。与数据挖掘应用程序中的聚类不同，我们的重点是支持在查询时间生成的布尔结果的有效聚类。此外，我们建议将排名和聚类与布尔条件集成，形成新型的ClusterRank查询，以允许结构化数据检索。在语义和查询处理方面，这种集成是不可行的。我们调查这种类型的各种语义。为了处理此类查询，简单的方法是简单地粘合为仅限排名和聚类的技术而开发的技术。这种方法成本高，因为排名和群集都被视为通过现有技术的布尔查询结果阻止后处理任务。我们提出了一种基于摘要的评估方法，该方法利用位图索引来无缝集成布尔条件，聚类和排名。实验研究表明，我们的方法显着优于直接的，并保持高集群质量。

著录项

来源
《SIGMOD/PODS》|2007年||共12页
会议地点
作者
Chengkai Li; Min Wang; Lipyeow Lim; Haixun Wang; Kevin Chen-Chuan Chang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词
retrieval; data exploration; ranking; query processing; clustering; top-k; grouping;

机译：检索;数据探索;排名;查询处理;聚类;top-k;分组;

相似文献

外文文献
中文文献
专利

1. Ranking Z-numbers with an improved ranking method for generalized fuzzy numbers [J] . Jiang Wen, Xie Chunhe, Luo Yu, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2017,第3期

机译：用改进的概括模糊数排列Z号码
2. Featured Clustering and Ranking-Based Bad Cluster Removal for Hyperspectral Band Selection and Classification Using Ensemble of Binary SVM Classifiers [J] . Kalidindi Kishore Raju, Varma Pardha Saradhi G., Davuluri Rajyalakshmi International journal of information technology project management . 2021,第4期

机译：使用二进制SVM分类器的集群选择和基于排序的基于群集和基于排序的群集群集删除
3. Pythagorean Fuzzy Clustering Analysis: A Hierarchical Clustering Algorithm with the Ratio Index-Based Ranking Methods [J] . Zhang Xiaolu International journal of entelligent systems . 2018,第9期

机译：勾股模糊聚类分析：一种基于比率索引的分级聚类算法
4. Supporting ranking and clustering as generalized order-by and group-by [C] . Chengkai Li, Min Wang, Lipyeow Lim, ACM SIGMOD international conference on Management of data . 2007

机译：支持排序和聚类为广义的排序和分组
5. Influence of supports, cluster structure, and cluster composition on hydrogenation reactions catalyzed by oxide-supported metal clusters. [D] . Argo, Andrew Michael. 2001

机译：载体，簇结构和簇组成对氧化物负载的金属簇催化的氢化反应的影响。
6. Peer Reviewed: Cluster Analysis and Cluster Ranking for Asthma Inpatient Hospitalizations Among Children Adolescents and Adults Aged 0 to 19 Years in Cook County Illinois 2011–2014 [O] . Katie Labgold, Amanda C. Bennett, Kristen M. Wells 2020

机译：同行评审：2011-2014年伊利诺伊州库克县儿童青少年和0至19岁成年人哮喘住院治疗的聚类分析和聚类排名
7. Supporting ranking and clustering as generalized order-by and group-by [O] . Chengkai Li, Min Wang, Lipyeow Lim, 2007

机译：支持排名和聚类为广义的order-by和group-by
8. Configurational Energies in Terms of Effective Cluster Interactions in Binary Substitutional Alloys: Connection Between the Embedded Cluster Method and the Generalized Perturbation Method [R] . Turchi, P. E. A. , Gonis, A. , Zhang, X. , 1987

机译：二元取代合金中有效团簇相互作用的构型能：嵌入式团簇法与广义扰动法的联系

Supporting Ranking and Clustering as Generalized Order-By and Group-By*

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅