Clustering XML Documents Using Closed Frequent Subtrees: A Structural Similarity Approach

机译：使用闭合频繁子树进行聚类XML文档：结构相似性方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the experimental study conducted over the INEX 2007 Document Mining Challenge corpus employing a frequent subtree-based incremental clustering approach. Using the structural information of the XML documents, the closed frequent subtrees are generated. A matrix is then developed representing the closed frequent subtree distribution in documents. This matrix is used to progressively cluster the XML documents. In spite of the large number of documents in INEX 2007 Wikipedia dataset, the proposed frequent subtree-based incremental clustering approach was successful in clustering the documents.

机译：本文介绍了在2007年Inex 2007年挖掘挑战语料库上进行的实验研究，采用频繁的基于子树的增量聚类方法。使用XML文档的结构信息，生成封闭的频繁子树。然后开发矩阵，其代表文档中的封闭式频繁的子树分布。该矩阵用于逐步群集XML文档。尽管Inex 2007 Wikipedia数据集中的大量文档，所提出的频繁的基于子树的增量聚类方法是成功的群集文件。

著录项

来源
《International Workshop of the Initiative for the Evaluation of XML Retrieval》|2008年||共12页
会议地点
作者
Sangeetha Kutty; Tien Tran; Richi Nayak; Yuefeng Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
clustering; xml document mining; frequent mining; frequent subtrees; inex; structural mining;

机译：聚类;XML文件挖掘;频繁采矿;频繁的子树;inex;结构挖掘;

相似文献

外文文献
中文文献
专利

1. Using structural similarity for clustering XML documents [J] . Ali Aitelhadj, Mohand Boughanem, Mohamed Mezghiche, Knowledge and information systems . 2012,第1期

机译：使用结构相似性对XML文档进行聚类
2. Using structural similarity for clustering XML documents [J] . Ali Aïtelhadj, Mohand Boughanem, Mohamed Mezghiche, Knowledge and Information Systems . 2012,第1期

机译：使用结构相似性对XML文档进行聚类
3. Clustering XML Documents Using Structure and Content based on a New Similarity Function OverallSimSUX [J] . Damny Magdaleno, Ivett E. Fuentes, María M. García Computacion y Sistemas . 2015,第1期

机译：基于结构和内容的XML文档基于新的相似性功能TotalSimSUX的聚类
4. Clustering XML Documents Using Closed Frequent Subtrees: A Structural Similarity Approach [C] . Sangeetha Kutty, Tien Tran, Richi Nayak, International Workshop of the Initiative for the Evaluation of XML Retrieval . 2008

机译：使用闭合频繁子树进行聚类XML文档：结构相似性方法
5. Mining frequent structural patterns from XML datasets. [D] . Ali, Mohammed Mohsin. 2012

机译：从XML数据集中挖掘频繁的结构模式。
6. Management of Clinical XML Documents: A Pragmatic Approach [O] . R Schweiger, T Buerkle, S Hoelzer, 2000

机译：临床XML文档管理：一种务实的方法
7. Clustering XML Documents Using Closed Frequent Subtrees: A Structural Similarity Approach [O] . Kutty Sangeetha, Tran Tien, Nayak Richi, 2008

机译：使用封闭的频繁子树对XML文档进行聚类：结构相似性方法

Clustering XML Documents Using Closed Frequent Subtrees: A Structural Similarity Approach

摘要

著录项

相似文献

相关主题

期刊订阅