On High Dimensional Skylines

机译：在高维天际线上

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many decision-making applications, the skyline query is frequently used to find a set of dominating data points (called skyline points) in a multidimensional dataset. In a high-dimensional space skyline points no longer offer any interesting insights as there are too many of them. In this paper, we introduce a novel metric, called skyline frequency that compares and ranks the interesting-ness of data points based on how often they are returned in the skyline when different number of dimensions (i.e., subspaces) are considered. Intuitively, a point with a high skyline frequency is more interesting as it can be dominated on fewer combinations of the dimensions. Thus, the problem becomes one of finding top-k frequent skyline points. But the algorithms thus far proposed for skyline computation typically do not scale well with dimensionality. Moreover, frequent skyline computation requires that skylines be computed for each of an exponential number of subsets of the dimensions. We present efficient approximate algorithms to address these twin difficulties. Our extensive performance study shows that our approximate algorithm can run fast and compute the correct result on large data sets in high-dimensional spaces.

机译：在许多决策应用程序中，天际线查询通常用于在多维数据集中查找一组主要数据点（称为天际线点）。在高维空间中，天际点不再提供任何有趣的见解，因为它们太多了。在本文中，我们介绍了一种称为``天际线频率''的新颖度量标准，该指标根据考虑了不同数量的维数（即子空间）时它们在天际线中返回的频率来比较和排列数据点的有趣程度。直观地讲，具有较高天际线频率的点更有趣，因为它可以在较少的尺寸组合上占主导地位。因此，该问题成为查找前k个频繁的天际线点之一。但是，迄今为止提出的用于天际线计算的算法通常无法很好地随维数扩展。此外，频繁的天际线计算要求为每个指数子集的维数计算天际线。我们提出了有效的近似算法来解决这些双重难题。我们广泛的性能研究表明，我们的近似算法可以快速运行并在高维空间中的大型数据集上计算正确的结果。

著录项

来源
《International Conference on Extending Database Technology(EDBT 2006); 20060326-31; Munich(DE)》|2006年|P.478-495|共18页
会议地点 Munich(DE)
作者
Chee-Yong Chan; H.V. Jagadish; Kian-Lee Tan; Anthony K.H. Tung; Zhenjie Zhang;
展开▼
作者单位

National University of Singapore University of Michigan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词
入库时间 2022-08-26 14:11:47

相似文献

外文文献
中文文献
专利

1. Sky Flow: A visual analysis of high-dimensional skylines in time-series [J] . Wooil Kim, Changbeom Shim, Yon Dohn Chung Journal of visualization . 2021,第5期

机译：天空：时间序列高维地平线的视觉分析
2. Parallel k-dominant skyline queries in high-dimensional datasets [J] . Peng Yi-Wen, Chen Wei-Mei Information Sciences: An International Journal . 2019,第期

机译：在高维数据集中的并行k主导地平线查询
3. Spatial skyline query method based on Hilbert R-tree in multi-dimensional space [J] . Li Song, Zhang Liping, Li Shuang, 高技术通讯（英文版） . 2019,第003期

机译：基于希尔伯特R树的多维空间空间天际线查询方法
4. Multi-dimensional Skyline Query to Find Best Shopping Mall for Customers [C] . Md Amiruzzaman, Suphanut Jamonnak International Conference on Devices, Circuits and Systems . 2020

机译：多维天际线查询为客户找到最佳购物中心
5. Efficient algorithms for similarity and skyline summary on multidimensional datasets. [D] . Morse, Michael David. 2007

机译：多维数据集上的相似度和天际线摘要的高效算法。
6. TCA precipitation and ethanol/HCl single-step purification evaluation: One-dimensional gel electrophoresis bradford assays spectrofluorometry and Raman spectroscopy data on HSA Rnase lysozyme - Mascots and Skyline data [O] . Balkis Eddhif, Nadia Guignard, Yann Batonneau, 2018

机译：TCA沉淀和乙醇/ HCl单步纯化评估：HSARnase溶菌酶-吉祥物和天际线数据的一维凝胶电泳布拉德福德测定荧光光谱和拉曼光谱数据
7. Treillis des concepts skylines : Analyse multidimensionnelle des skylines fondée sur les ensembles en accord [O] . Nedjar Sébastien, Pesci Fabien, Lakhal Lotfi, 2010

机译：天际线概念的格：基于调整集的天际线多维分析

On High Dimensional Skylines

摘要

著录项

相似文献

相关主题

期刊订阅