首页> 外文会议>European Conference on Principles of Data Mining and Knowledge Discovery >Finding Hidden Factors Using Independent Component Analysis
【24h】

Finding Hidden Factors Using Independent Component Analysis

机译:使用独立分量分析找到隐藏因素

获取原文

摘要

Independent Component Analysis (ICA) is a computational technique for revealing hidden factors that underlie sets of measurements or signals. ICA assumes a statistical model whereby the observed multivariate data, typically given as a large database of samples, are assumed to be linear or nonlinear mixtures of some unknown latent variables. The mixing coefficients are also unknown. The latent variables are non-gaussian and mutually independent, and they are called the independent components of the observed data. By ICA, these independent components, also called sources or factors, can be found. Thus ICA can be seen as an extension to Principal Component Analysis and Factor Analysis. ICA is a much richer technique, however, capable of finding the sources when these classical methods fail completely. In many cases, the measurements are given as a set of parallel signals or time series. Typical examples are mixtures of simultaneous sounds or human voices that have been picked up by several microphones, brain signal measurements from multiple EEG sensors, several radio signals arriving at a portable phone, or multiple parallel time series obtained from some industrial process. The term blind source separation is used to characterize this problem. The lecture will first cover the basic idea of demixing in the case of a linear mixing model and then take a look at the recent nonlinear demixing approaches. Although ICA was originally developed for digital signal processing applications, it has recently been found that it may be a powerful tool for analyzing text document data as well, if the documents are presented in a suitable numerical form. A case study on analyzing dynamically evolving text is covered in the talk.
机译:独立分量分析(ICA)是一种计算技术,用于揭示底层测量或信号的隐藏因素。 ICA假设一个统计模型,由此假设观察到通常给出的样本数据库的多变量数据是一些未知潜在变量的线性或非线性混合物。混合系数也未知。潜在变量是非高斯和相互独立的,它们称为观察到的数据的独立组件。通过ICA,可以找到这些独立组件,也可以找到源或因素。因此,ICA可以被视为主要成分分析和因子分析的延伸。 ICA是一种更丰富的技术,但是当这些古典方法完全失败时,能够找到这些来源。在许多情况下,测量被给出为一组并行信号或时间序列。典型的示例是已经被几个麦克风拾取的同时声音或人类声音的混合物,来自多个EEG传感器的脑信号测量,到达便携式电话的多个无线电信号,或者从某些工业过程中获得的多个并行时间序列。术语盲源分离用于表征此问题。讲座将首先在线性混合模型的情况下覆盖解脱模的基本思想,然后看看最近的非线性解剖方法。虽然ICA最初用于数字信号处理应用,但最近发现它可能是分析文本文档数据的强大工具,如果文档以合适的数字形式呈现。谈话中涵盖了分析动态发展文本的案例研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号