首页> 外文会议>ACM SIGMOD International Conference on Management of Data >Clustering Methods for Large Databases: From the Past to the Future
【24h】

Clustering Methods for Large Databases: From the Past to the Future

机译:大型数据库的聚类方法:从过去到未来

获取原文

摘要

Because of the fast technological progress, the amount of information which is stored in databases is rapidly increasing. In addition, new applications require the storage and retrieval of complex multimedia objects which are often represented by high-dimensional feature vectors. Finding the valuable information hidden in those databases is a difficult task. Cluster analysis is one of the basic techniques which is often applied in analyzing large data sets. Originating from the area of statistics, most cluster analysis algorithms have originally been developed for relatively small data sets. In the recent years, the clustering algorithms have been extended to efficiently work on large data sets, and some of them even allow the clustering of high-dimensional feature vectors. Many such methods use some kind of an index structure for an efficient retrieval of the required data; other approaches are based on preprocessing for a more efficient clustering.
机译:由于技术进步速度快,存储在数据库中的信息量正在迅速增加。此外,新应用程序需要存储和检索复杂的多媒体对象,这些对象通常由高维特征向量表示。找到这些数据库中隐藏的有价值的信息是一项艰巨的任务。集群分析是通常应用于分析大数据集的基本技术之一。源自统计区域,大多数集群分析算法最初是为相对较小的数据集开发的。近年来,群集算法已经扩展以有效地处理大数据集,其中一些甚至允许群集高维特征向量。许多这样的方法使用某种索引结构来有效检索所需数据;其他方法基于预处理进行更有效的聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号