Two approaches for clustering algorithms with relational-based data

Xavier-Junior Joao C.; Canuto Anne M. P.; Goncalves Luiz M. G.

首页> 外文期刊>Knowledge and information systems >Two approaches for clustering algorithms with relational-based data

【24h】

Two approaches for clustering algorithms with relational-based data

机译：基于关系的数据的聚类算法的两种方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

It is well known that relational databases still play an important role for many companies around the world. For this reason, the use of data mining methods to discover knowledge in large relational databases has become an interesting research issue. In the context of unsupervised data mining, for instance, the conventional clustering algorithms cannot handle the particularities of the relational databases in an efficient way. There are some clustering algorithms for relational datasets proposed in the literature. However, most of these methods apply complex and/or specific procedures to handle the relational nature of data, or the relational-based methods do not capture the relational nature in an efficient way. Aiming to contribute to this important topic, in this paper, we will present two simple and generic approaches to handle relational-based data for clustering algorithms. One of them treats the relational data through the use of a hierarchical structure, while the second approach applies a weight structure based on relationship and attribute information. In presenting these two approaches, we aim to tackle relational-based dataset in a simple and efficient way, improving the efficiency of corporations that handle relational-based in the unsupervised data mining context. In order to evaluate the effectiveness of the presented approaches, a comparative analysis will be conducted, comparing the proposed approaches with some existing approaches and with a baseline approach. In all analyzed approaches, we will use two well-known types of clustering algorithms (agglomerative hierarchical and K-means). In order to perform this analysis, we will use two internal and one external clusters as validity measures.

机译：众所周知，关系数据库仍然对世界各地的许多公司发挥着重要作用。因此，使用数据挖掘方法来发现大型关系数据库中的知识已成为一个有趣的研究问题。例如，在无监督数据挖掘的上下文中，传统的聚类算法不能以有效的方式处理关系数据库的特定。文献中提出的关系数据集有一些聚类算法。然而，大多数方法应用复杂和/或特定程序来处理数据的关系性质，或者基于关系的方法不会以有效的方式捕获关系性质。旨在为这篇重要的主题做出贡献，在本文中，我们将提出两个简单而通用的方法来处理基于关系的群集算法的数据。其中一个通过使用分层结构来处理关系数据，而第二种方法基于关系和属性信息应用权重结构。在提出这两种方法时，我们的目标是以简单有效的方式解决基于关系的数据集，提高了基于无监督数据挖掘上下文的关系的公司的效率。为了评估所提出的方法的有效性，将进行比较分析，比较拟议的方法与一些现有方法和基线方法。在所有分析的方法中，我们将使用两个众所周知的聚类算法（附名分层和K-means）。为了执行此分析，我们将使用两个内部和一个外部集群作为有效度措施。

著录项

来源
《Knowledge and information systems》 |2020年第3期|共25页
作者
Xavier-Junior Joao C.; Canuto Anne M. P.; Goncalves Luiz M. G.;
展开▼
作者单位

Fed Univ RN Digital Metropolis Inst Natal RN Brazil;

Fed Univ RN Informat &

Appl Math Dept Natal RN Brazil;

Fed Univ RN Comp &

Automat Engn Dept Natal RN Brazil;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词
Relational database; Relational data clustering approach; Cluster validity measures;

机译：关系数据库;关系数据聚类方法;集群有效性措施;

相似文献

外文文献
中文文献
专利

1. Two approaches for clustering algorithms with relational-based data [J] . Xavier-Junior Joao C., Canuto Anne M. P., Goncalves Luiz M. G. Knowledge and information systems . 2020,第3期

机译：基于关系的数据的聚类算法的两种方法
2. Visual Approaches for Exploratory Data Analysis: A Survey of the Visual Assessment of Clustering Tendency (VAT) Family of Algorithms [J] . Kumar Dheeraj, Bezdek James C. IEEE Systems, Man, and Cybernetics Magazine . 2020,第2期

机译：探索性数据分析的视觉方法：对算法聚类趋势的视觉评估调查（算法
3. A novel clustering algorithm based on data transformation approaches [J] . Azimi Rasool, Ghayekhloo Mohadeseh, Ghofrani Mahmoud, Expert Systems with Application . 2017,第Juna期

机译：一种基于数据转换方法的新型聚类算法
4. A Comparative Study of Density-based Clustering Algorithms on Data Streams: Micro-clustering Approaches [C] . Amineh Amini, Teh Ying Wah Intelligent control and innovative computing . 2011

机译：基于密度的数据流聚类算法比较研究：微聚类方法
5. Novel approaches to clustering, biclustering algorithms based on adaptive resonance theory and intelligent control. [D] . Kim, Sejun. 2016

机译：基于自适应共振理论和智能控制的新型聚类，双聚类算法。
6. Type2 diabetes mellitus prediction using data mining algorithms based on the long-noncoding RNAs expression: a comparison of four data mining approaches [O] . Faranak Kazerouni, Azadeh Bayani, Farkhondeh Asadi, 2020

机译：基于长非编码RNA表达的数据挖掘算法类型2糖尿病预测：四种数据采矿方法的比较
7. Approaches to Partition Medical Data using Clustering Algorithms [O] . P. Kalyani 2013

机译：使用聚类算法划分医学数据的方法
8. A RELATIONAL-BASED DATA MANAGEMENT SYSTEM FOR ENGINEERING AND SCIENTIFIC APPLICATION [R] . Maurice M. Hallum 1980

机译：基于关系的工程和科学应用数据管理系统

Two approaches for clustering algorithms with relational-based data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅