【24h】

A Hybrid Unsupervised Feature Selection Algorithm

机译:混合无监督特征选择算法

获取原文

摘要

Due to the explosion of data, a vast amount of high-dimensional data like images, texts as well as medical microarray data are generated. In addition to exponentially raising measurement storage and processing strain on algorithms & computer hardware, direct processing of high-dimensional data often results in poor performance because of irrelevant, noisy as well as duplicate dimensions. A large number of features present in the dataset used for machine intelligence purposes pose a big threat to researchers. Algorithms that use these large dimension features suffer in terms of computer time taken to make decisions and space required to store them in computer memory. In the proposed work we have developed a hybrid algorithm to select the highly discriminative features present in the dataset. Using the multicluster feature rank score and unsupervised discriminative feature ranking methods in selecting the most discriminative features, on some well-documented datasets like the ORL, we have carried out comprehensive experiments. Our experimental results have proven the superiority of our algorithms in comparison to some state-of-the-art algorithms.
机译:由于数据爆炸,产生了大量的高维数据,如图像,文本以及医疗微阵列数据。除了在算法和计算机硬件上提高测量存储和加工应变之外,直接处理高维数据通常由于不相关,嘈杂和重复尺寸而导致性能差。用于机器智能目的的数据集中存在的大量特征对研究人员构成了很大的威胁。使用这些大维功能的算法在计算机时间方面受到占据储存在计算机存储器中所需的决策和空间的算法。在拟议的工作中,我们开发了一种混合算法,可选择数据集中存在的高度辨别特征。使用多板特征等级得分和无监督的鉴别特征排名方法在选择最辨别的特征时,在一些被记录的数据集如ORL上,我们进行了综合实验。与某些最先进的算法相比,我们的实验结果证明了我们的算法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号