首页>
外国专利>
Method, apparatus and programmed medium for clustering databases with categorical attributes
Method, apparatus and programmed medium for clustering databases with categorical attributes
展开▼
机译:用于对具有分类属性的数据库进行聚类的方法,装置和程序介质
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention relates to a computer method, apparatus and programmed medium for clustering databases containing data with categorical attributes. The present invention assigns a pair of points to be neighbors if their similarity exceeds a certain threshold. The similarity value for pairs of points can be based on non-metric information. The present invention determines a total number of links between each cluster and every other cluster bases upon the neighbors of the clusters. A goodness measure between each cluster and every other cluster based upon the total number of links between each cluster and every other cluster and the total number of points within each cluster and every other cluster is then calculated. The present invention merges the two clusters with the best goodness measure. Thus, clustering is performed accurately and efficiently by merging data based on the amount of links between the data to be clustered.
展开▼