An Effective Data Processing Method for Fast Clustering

机译：一种有效的快速聚类数据处理方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Because of the extensive diffusion of Internet usage, heterogeneous computing platforms, and ubiquitous computing technologies, Web data that are usually written in XML format are explosively increased. With the growth of Web data and the importance of their clustering, we need similarity detection method because it is a fundamental technology for efficient document management. In this paper, we introduce a similarity detection method that can check both semantic similarity and structural similarity between XML DTDs. For semantic checking, we adopt ontology technology, and we apply longest common string and longest nesting common string methods for structural checking. Our similarity detection method uses multi-tag sequences instead of traversing XML schema trees, so that it gets fast and reasonable results.

机译：由于Internet使用，异构计算平台和无处不在的计算技术的广泛传播，通常以XML格式编写的Web数据正在爆炸性地增加。随着Web数据的增长及其聚集的重要性，我们需要相似性检测方法，因为它是有效文档管理的基本技术。在本文中，我们介绍了一种相似性检测方法，该方法可以同时检查XML DTD之间的语义相似性和结构相似性。对于语义检查，我们采用本体技术，并应用最长的公共字符串和最长的嵌套公共字符串方法进行结构检查。我们的相似性检测方法使用多标签序列而不是遍历XML模式树，从而获得快速而合理的结果。

著录项

来源
《International Conference on Computational Science and Its Applications;ICCSA 2008》|2008年|P.335-347|共13页
会议地点
作者
Hyun-Joo Moon; Sangheon Kim; Jongbae Moon; Eun-Ser Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
XML; DTD; similarity detection; ontology; tag sequences;

机译：XML; DTD;相似度检测;本体;标签序列;
入库时间 2022-08-26 14:16:06

相似文献

外文文献
中文文献
专利

1. Diluvian Clustering: A Fast, Effective Algorithm for Clustering Compositional and Other Data [J] . Ritchie Nicholas W. M. Microscopy and microanalysis: The official journal of Microscopy Society of America, Microbeam Analysis Society, Microscopical Society of Canada . 2015,第5期

机译：Diluvian聚类：一种快速有效的聚类成分和其他数据的算法
2. Online analytical processing (OLAP): a fast and effective data mining tool for gene expression databases. [J] . Alkharouf NW, Jamison DC, Matthews BF Journal of biomedicine & biotechnology . 2005,第2期

机译：在线分析处理（OLAP）：一种用于基因表达数据库的快速有效的数据挖掘工具。
3. Online analytical processing (OLAP): a fast and effective data mining tool for gene expression databases. [J] . Alkharouf NW, Jamison DC, Matthews BF Journal of biomedicine & biotechnology . 2005,第2期

机译：在线分析处理（OLAP）：一种用于基因表达数据库的快速有效的数据挖掘工具。
4. An Effective Data Processing Method for Fast Clustering [C] . Hyun-Joo Moon, Sangheon Kim, Jongbae Moon, International Conference on Computational Science and Its Applications . 2008

机译：快速聚类的有效数据处理方法
5. Improved methods for NMR data processing, and faster experiments for oligosaccharide structure and mixture analysis. [D] . Ridge, Clark D. 2010

机译：改进的NMR数据处理方法，以及更快的寡糖结构和混合物分析实验。
6. Online Analytical Processing (OLAP): A Fast and Effective Data Mining Tool for Gene Expression Databases [O] . Nadim W. Alkharouf, D. Curtis Jamison, Benjamin F. Matthews 2005

机译：在线分析处理（OLAP）：一种用于基因表达数据库的快速有效的数据挖掘工具
7. ClusMAM: fast and effective unsupervised clustering of large complex datasets using metric access methods [O] . Souza Jessica Andressa de, Cazzolato Mirela Teixeira, Traina Agma Juci Machado 2016

机译：ClusMAM：使用度量访问方法对大型复杂数据集进行快速有效的无监督聚类

An Effective Data Processing Method for Fast Clustering

摘要

著录项

相似文献

相关主题

期刊订阅