首页> 外文会议>2011 International Conference on Management of e-Commerce and e-Government >Clustering XML Search Results Based on Content and Structure Similarity

【24h】

Clustering XML Search Results Based on Content and Structure Similarity

机译：基于内容和结构相似性的XML搜索结果聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Clustering XML search results is an effective way to improve performance. However, the key problem is how to measure similarity between XML documents. In this paper, we propose a semantic similarity measure method combining content with structure, in which a variety of XML document features, including term element frequency, term inverse element frequency, semantic weight of tag label and level information of the term, are analyzed and applied for computing the similarity between XML documents. In addition, two new performance evaluation methodology, namely Cluster Ratio Relevant and Docu Ratio Relevant, for clustering quality are introduced motivated by the observations of relevant documents distribution and the fact that collection has no classification information. Experiment results show that proposed similarity method(CAS measure)outperforms traditional document clustering(CO measure) in Cluster Ratio Relevant and Docu Ratio Relevant and produces better clustering quality.

机译：群集XML搜索结果是提高性能的有效方法。但是，关键问题是如何测量XML文档之间的相似性。在本文中，我们提出了一种与结构结合内容的语义相似度测量方法，其中分析了各种XML文档特征，包括术语元素频率，术语逆元素频率，标签标签的语义权重和术语的级别信息，应用于计算XML文档之间的相似性。此外，通过有关文件分布的观察和收集没有分类信息，引入了两种新的性能评估方法，即群集比率和相关和文档比率相关的DOCU比率。实验结果表明，所提出的相似性方法（CAS测量）优于传统的文档聚类（CO测量）在群集比中相关和DOCU比率相关，并产生更好的聚类质量。

著录项

来源
《2011 International Conference on Management of e-Commerce and e-Government 》|2011年|p.10-14|共5页
会议地点
作者
Min-Juan Zhong; Chang-Xuan Wan; De-Xi Liu; Xian-Pei Jiao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电子贸易、网上贸易 ;
关键词
XML Clustering; node level; relevant cluster ratio; relevant document distribution ratio; tag weight;

机译：XML集群;节点级别;相关集群率;相关文档分配率;标签权重;

相似文献

外文文献
中文文献
专利

1. Clustering XML Documents Using Structure and Content based on a New Similarity Function OverallSimSUX [J] . Damny Magdaleno, Ivett E. Fuentes, María M. García Computacion y Sistemas . 2015 ,第1期

机译：基于结构和内容的XML文档基于新的相似性功能TotalSimSUX的聚类
2. Similarity search for office XML documents based on style and structure data [J] . Yousuke Watanabe, Hidetaka Kamigaito, Haruo Yokota International journal of web information systems . 2013 ,第2期

机译：基于样式和结构数据的Office XML文档的相似性搜索
3. Strategy for XML Integration Using Similarity in Structure and Content [J] . Youn Hee KIM, Byung Gon KIM, Jaeho LEE, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2004 ,第6期

机译：使用结构和内容相似性的XML集成策略
4. Clustering XML Search Results Based on Content and Structure Similarity [C] . Min-Juan Zhong, Chang-Xuan Wan, De-Xi Liu, International Conference on Management of e-Commerce and e-Goverment . 2011

机译：基于内容和结构相似性群集XML搜索结果
5. A generalized multidimensional index structure for multimedia data to support content-based similarity searches in a collaborative search environment. [D] . Chetterjee, Kasturi. 2010

机译：用于多媒体数据的通用多维索引结构，以在协作搜索环境中支持基于内容的相似性搜索。
6. Omokage search: shape similarity search service for biomolecular structures in both the PDB and EMDB [O] . Hirofumi Suzuki, Takeshi Kawabata, Haruki Nakamura -1

机译：Omokage搜索：PDB和EMDB中生物分子结构的形状相似性搜索服务
7. Combining structure and content similarities for XML document clustering [O] . Tran Tien, Nayak Richi, Bruza Peter D. 2008

机译：结合结构和内容相似性进行XML文档集群

Clustering XML Search Results Based on Content and Structure Similarity

摘要

著录项

相似文献

相关主题

期刊订阅