首页> 外文会议>IEEE SoutheastCon >Characterization of different datasets for ICA algorithms
【24h】

Characterization of different datasets for ICA algorithms

机译:ICA算法的不同数据集的表征

获取原文

摘要

There are several Independent Component Analysis (ICA) algorithms based on different approaches that are used to estimate the independent components, up to some precision, from a linear mixture of independent components as long as the independent components do not follow Gaussian distribution. Some approaches, however, work better than the other if data distribution and characteristics follow a certain pattern. From the mixture of data comprising two or more independent components it is quite hard, if not impossible, to find out the distribution of the independent components accurately and therefore to characterize one or more ICA approaches to be better than others for certain type of data. This paper describes a framework for ICA algorithms as proposed by Ejaz [1]. In this study we have characterized four different ICA algorithms, with some pre-selected fixed parameters, based on different approaches for a number of different datasets that are linearly mixed with two to five independent components that follow a number of different distributions. ICA algorithms used for the research include FastICA, Extended Infomax, JADE, and Kernel ICA based on canonical correlation analysis. All of these algorithms are discussed briefly yet covering most of their important aspects such that this paper also serves as a tutorial for these ICA algorithms for novice readers. We have done an extensive statistical analysis for more than 300 different datasets to characterize them for one or more of the four algorithms based on to which algorithm estimates the independent components closest to the original components.
机译:只要独立分量不遵循高斯分布,就有几种基于不同方法的独立分量分析(ICA)算法可用于从独立分量的线性混合中估算独立分量(达到一定精度)。但是,如果数据分布和特征遵循某种模式,则某些方法会比其他方法更好。从包含两个或多个独立成分的数据混合中,很难,即使不是不可能,也很难准确地找出独立成分的分布,因此很难确定一种或多种ICA方法对于某些类型的数据要优于其他方法。本文描述了由Ejaz [1]提出的ICA算法框架。在这项研究中,我们针对四种不同数据集的不同方法,采用了一些预先选择的固定参数,对四种不同的ICA算法进行了特征化,这些数据集线性混合有遵循不同分布的2至5个独立成分。用于研究的ICA算法包括基于规范相关分析的FastICA,Extended Infomax,JADE和Kernel ICA。简要讨论了所有这些算法,但涵盖了它们的大部分重要方面,因此,本文也为新手读者提供了这些ICA算法的教程。我们已经对300多个不同的数据集进行了广泛的统计分析,以针对四种算法中的一种或多种对它们进行表征,基于该算法,算法估计出最接近原始成分的独立成分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号