Clustering Deep Web Databases Semantically

机译：语义化群集深度Web数据库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep Web database clustering is a key operation in organizing Deep Web resources. Cosine similarity in Vector Space Model (VSM) is used as the similarity computation in traditional ways. However it cannot denote the semantic similarity between the contents of two databases. In this paper how to cluster Deep Web databases semantically is discussed. Firstly, a fuzzy semantic measure, which integrates ontology and fuzzy set theory to compute semantic similarity between the visible features of two Deep Web forms, is proposed, and then a hybrid Particle Swarm Optimization (PSO) algorithm is provided for Deep Web databases clustering. Finally the clustering results are evaluated according to Average Similarity of Document to the Cluster Centroid (ASDC) and Rand Index (RI). Experiments show that: 1) the hybrid PSO approach has the higher ASDC values than those based on PSO and K-Means approaches. It means the hybrid PSO approach has the higher intra cluster similarity and lowest inter cluster similarity; 2) the clustering results based on fuzzy semantic similarity have higher ASDC values and higher RI values than those based on cosine similarity. It reflects the conclusion that the fuzzy semantic similarity approach can explore latent semantics.

机译：深度Web数据库集群是组织深度Web资源的关键操作。向量空间模型（VSM）中的余弦相似度以传统方式用作相似度计算。但是，它不能表示两个数据库内容之间的语义相似性。本文讨论了如何在语义上对Deep Web数据库进行集群。首先提出了一种模糊语义测度，将本体和模糊集理论相结合，计算了两个Deep Web表单的可见特征之间的语义相似度，然后为Deep Web数据库聚类提供了一种混合粒子群优化算法。最后，根据文档与聚类质心的平均相似度（ASDC）和兰德指数（RI）评估聚类结果。实验表明：1）混合PSO方法比基于PSO和K-Means方法的ASDC值更高。这意味着混合PSO方法具有较高的集群内相似度和最低的集群间相似度。 2）基于模糊语义相似度的聚类结果比基于余弦相似度的聚类结果具有更高的ASDC值和更高的RI值。它反映了模糊语义相似性方法可以探索潜在语义的结论。

著录项

来源
《Information Retrieval Technology》|2008年|P.365-376|共12页
会议地点
作者
Ling Song; Jun Ma; Po Yan; Li Lian; Dongmei Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机设备安全;
关键词
semantic deep web clustering; fuzzy set; ontology; PSO; k-means;

机译：语义深层网络聚类;模糊集;本体; PSO; k-means;

相似文献

外文文献
中文文献
专利

1. Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases [J] . Akihiro Matsushima, Manabu Ishii, Norio Kobayashi, Nucleic acids research . 2011,第suppla2期

机译：Semantic-JSON：轻量级Web服务接口，用于集成多个生命科学数据库的语义Web内容
2. Trust estimation of the semantic web using semantic webrnclustering [J] . Shirgahi Hossein, Mohsenzadeh Mehran, Javadi Hamid Haj Seyyed Journal of Experimental and Theoretical Artificial Intelligence . 2017,第3期

机译：使用语义编织的语义网信任度估计
3. A Framework To Convert Relational Database To Ontology For Knowledge Database In Semantic Web [J] . Vishal Jain, Dr. Mayank Singh International Journal of Scientific & Technology Research . 2013,第10期

机译：语义网中知识数据库将关系数据库转换为本体的框架
4. Clustering Deep Web Databases Semantically [C] . Ling Song, Jun Ma, Po Yan, Asia Information Retrieval Symposium . 2008

机译：语义上聚类深网络数据库
5. Integration of Heterogeneous Data for Protein Ontology Database Using Semantic Web Technology [D] . Li, Xiang 2018

机译：使用语义Web技术集成蛋白质本体数据库的异构数据
6. Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases [O] . Norio Kobayashi, Manabu Ishii, Satoshi Takahashi, 2011

机译：Semantic-JSON：轻型Web服务接口用于集成多个生命科学数据库的语义Web内容
7. Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases [O] . Kobayashi, Norio, Ishii, Manabu, Takahashi, Satoshi, 2011

机译：Semantic-JSON：轻型Web服务接口，用于集成多个生命科学数据库的语义Web内容

Clustering Deep Web Databases Semantically

摘要

著录项

相似文献

相关主题

期刊订阅