k-Nearest Neighbour Using Ensemble Clustering Based on Feature Selection Approach to Learning Relational Data

机译：基于专题选择方法的基于特征选择方法，使用集群基于学习关系数据

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Due to the growing amount of data generated and stored in relational databases, relational learning has attracted the interest of researchers in recent years. Many approaches have been developed in order to learn relational data. One of the approaches used to learn relational data is Dynamic Aggregation of Relational Attributes (DARA). The DARA algorithm is designed to summarize relational data with oneto-many relations. However, DARA suffers a major drawback when the cardinalities of attributes are very high because the size of the vector space representation depends on the number of unique values that exist for all attributes in the dataset. A feature selection process can be introduced to overcome this problem. These selected features can be further optimized to achieve a good classification result. Several clustering runs can be performed for different values of k to yield an ensemble of clustering results. This paper proposes a two-layered genetic algorithm-based feature selection in order to improve the classification performance of learning relational database using a k-NN ensemble classifier. The proposed method involves the task of omitting less relevant features but retaining the diversity of the classifiers so as to improve the performance of the k-NN ensemble. The result shows that the proposed k-NN ensemble is able to improve the performance of traditional k-NN classifiers.

机译：由于生成和存储在关系数据库中的数据越来越多，近年来的关系学习吸引了研究人员的兴趣。已经开发了许多方法，以便学习关系数据。用于学习关系数据的方法之一是关系属性的动态聚合（DARA）。 DARA算法旨在将关系数据与ONETO-许多关系汇总。然而，当属性的基数非常高时，Dara遭受了重大缺点，因为矢量空间表示的大小取决于数据集中所有属性的唯一值的数量。可以引入特征选择过程以克服此问题。可以进一步优化这些所选特征以实现良好的分类结果。可以对k的不同值执行几个聚类运行，以产生聚类结果的集合。本文提出了一种基于两层遗传算法的特征选择，以便使用K-NN集合分类器来提高学习关系数据库的分类性能。所提出的方法涉及省略较少相关特征但保留分类器的多样性的任务，以提高K-NN集合的性能。结果表明，所提出的K-NN集合能够改善传统K-NN分类器的性能。

著录项

来源
《International Conference on Advances in Information and Communication Technology》|2017年|xvii 661 pages|共10页
会议地点
作者
Rayner Alfred; Kung Ke Shin; Mohd Shamrie Sainin; Chin Kim On; Paulraj Murugesa Pandiyan; Ag Asri Ag Ibrahim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN92-532;
关键词
Relational data mining; k-Nearest Neighbours; Classification; Ensembles; Feature selection; Genetic Algorithm;

机译：关系数据挖掘;k最近邻居;分类;合奏;特征选择;遗传算法;

相似文献

外文文献
中文文献
专利

1. K-nearest neighbour-based feature selection using hyperspectral data [J] . Pal Mahesh, Charan Teja B., Poriya Akshay Remote sensing letters . 2021,第1a3期

机译：基于邻居的基于邻居的特征选择，使用超细数据
2. Enhancing the Classification Accuracy of Noisy Dataset by Fusing Correlation Based Feature Selection With K-Nearest Neighbour [J] . SAMIR KUMAR SINGHA, SYED IMTIAZ HASSAN Oriental Journal of Computer Science and Technology . 2017,第2期

机译：通过将基于相关的特征选择与K最近邻融合来提高噪声数据集的分类精度
3. Enhancing the Classification Accuracy of Noisy Dataset By Fusing Correlation Based Feature Selection with K-Nearest Neighbour [J] . Samir Kumar Singha, Syed Imtiaz Hassan Oriental journal of computer science and technology . 2017,第2期

机译：通过将基于相关的特征选择与K最近邻融合来提高噪声数据集的分类精度
4. k-Nearest Neighbour Using Ensemble Clustering Based on Feature Selection Approach to Learning Relational Data [C] . Rayner Alfred, Kung Ke Shin, Mohd Shamrie Sainin, International Conference on Advances in Information and Communication Technology . 2017

机译：基于学习关系数据的特征选择方法，使用合奏聚类的k - 最近邻居
5. A categorical data clustering approach with expectation maximization and K-nearest neighbour. [D] . Liu, Yu. 2003

机译：一种具有期望最大化和K近邻的分类数据聚类方法。
6. EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models [O] . Xiangju Liu, Yu Zhang, Chunli Fu, 2021

机译：腹部：基于特征选择和机器学习模型检测肺动脉高压生物标志物的合奏方法
7. ENHANCING THE CLASSIFICATION ACCURACY OF NOISY DATASET BY FUSING CORRELATION BASED FEATURE SELECTION WITH K-NEAREST NEIGHBOUR [O] . Samir Singha, Syed Hassan 2017

机译：通过熔断基于邻居的相关的特征选择来提高噪声数据集的分类准确性

k-Nearest Neighbour Using Ensemble Clustering Based on Feature Selection Approach to Learning Relational Data

摘要

著录项

相似文献

相关主题

期刊订阅