Clustering Methods for Large Databases: From the Past to the Future

机译：大型数据库的聚类方法：从过去到未来

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Because of the fast technological progress, the amount of information which is stored in databases is rapidly increasing. In addition, new applications require the storage and retrieval of complex multimedia objects which are often represented by high-dimensional feature vectors. Finding the valuable information hidden in those databases is a difficult task. Cluster analysis is one of the basic techniques which is often applied in analyzing large data sets. Originating from the area of statistics, most cluster analysis algorithms have originally been developed for relatively small data sets. In the recent years, the clustering algorithms have been extended to efficiently work on large data sets, and some of them even allow the clustering of high-dimensional feature vectors. Many such methods use some kind of an index structure for an efficient retrieval of the required data; other approaches are based on preprocessing for a more efficient clustering.

机译：由于技术进步速度快，存储在数据库中的信息量正在迅速增加。此外，新应用程序需要存储和检索复杂的多媒体对象，这些对象通常由高维特征向量表示。找到这些数据库中隐藏的有价值的信息是一项艰巨的任务。集群分析是通常应用于分析大数据集的基本技术之一。源自统计区域，大多数集群分析算法最初是为相对较小的数据集开发的。近年来，群集算法已经扩展以有效地处理大数据集，其中一些甚至允许群集高维特征向量。许多这样的方法使用某种索引结构来有效检索所需数据;其他方法基于预处理进行更有效的聚类。

著录项

来源
《ACM SIGMOD International Conference on Management of Data》|1999年||共1页
会议地点
作者
Alexander Hinneburg; Daniel A. Keim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-532;
关键词

相似文献

外文文献
中文文献
专利

1. A Multiobjective Evolutionary Conceptual Clustering Methodology for Gene Annotation Within Structural Databases: A Case of Study on the Gene Ontology Database [J] . Romero-Zaliz R. C., Rubio-Escudero C., Cobb J. P., IEEE transactions on evolutionary computation . 2008,第6期

机译：结构数据库中基因注释的多目标进化概念聚类方法：以基因本体数据库为例
2. A Weighted Distance Metric Clustering Method to Cluster Small Data Points from a Projected Database Generated from a Freespan Algorithm [J] . S. Gayathri, M. Mary Metilda, S. Sanjai Babu Indian Journal of Science and Technology . 2015,第22期

机译：一种基于Freespan算法生成的投影数据库中的小数据点的加权距离度量聚类方法
3. A Weighted Distance Metric Clustering Method to Cluster Small Data Points from a Projected Database Generated from a Freespan Algorithm [J] . S. Gayathri, M. Mary Metilda, S. Sanjai Babu Indian Journal of Science and Technology . 2015,第22期

机译：一种基于Freespan算法生成的投影数据库中的小数据点的加权距离度量聚类方法
4. Clustering Methods for Large Databases: From the Past to the Future [C] . Alexander Hinneburg, Daniel A. Keim ACM SIGMOD International Conference on Management of Data . 1999

机译：大型数据库的聚类方法：从过去到未来
5. Temporal databases: Access structures, search methods, migration strategies, and declustering techniques. [D] . Kouramajian, Vram. 1994

机译：临时数据库：访问结构，搜索方法，迁移策略和分簇技术。
6. Comparison of cluster-based and source-attribution methods for estimating transmission risk using large HIV sequence databases [O] . Stéphane Le Vu, Oliver Ratmann, Valerie Delpech, -1

机译：使用大型HIV序列数据库评估传播风险的基于聚类和来源归因方法的比较
7. A multiobjective evolutionary conceptual clustering methodology for gene annotation within structural databases: A case of study on the gene ontology database [O] . Rocío C. Romero-zaliz, Cristina Rubio-escudero, J. Perren Cobb, 2015

机译：结构数据库中基因注释的多目标进化概念聚类方法 - 基因本体数据库研究案例
8. Replication and Distribution Methods for Future TACS (Tactical Air Control Systems) Distributed Databases. [R] . Perrizo, W., Varvel, D. A. 1984

机译：未来TaCs（战术空中控制系统）分布式数据库的复制和分配方法。

Clustering Methods for Large Databases: From the Past to the Future

摘要

著录项

相似文献

相关主题

期刊订阅