Clustering of information unit entity based on the semantic similarity between the distribute information source is the important step of global view construction for information sharing in the virtual organization. This paper orienting the demands of constructing the global unified view of distribute information, using a metadata model based on ontology and the semantic similarity, defines the semantic clustering feature(SCF). And with the definition of SCF, this paper designs a SCF based hybrid hiberarchy clustering algorithm, and presents the analysis of the algorithm from theory and experiment.%根据各分布信息源信息单元实体类的语义相似度,对于信息单元实体类进行聚类,是半自动地进行本体映射、构建分布异构信息资源全局视图的重要步骤.本文面向分布信息资源统一信息视图构建需求,利用基于本体的元数据模型及语义相似度,在其基础上定义了语义聚类特征,基于语义聚类特征设计了一种基于语义特征树的混合层次聚类算法SCFBHCA.从理论和实验两个角度对SCFBHCA算法进行了分析,对比HCA和HCP,该算法具有增量式和扩展性且效率更高.
展开▼