首页> 外文会议>Twenty-First International Workshop on Database and Expert Systems Applications >Noise and Outlier Filtering in Heterogeneous Medical Data Sources
【24h】

Noise and Outlier Filtering in Heterogeneous Medical Data Sources

机译:异构医学数据源中的噪声和异常值过滤

获取原文

摘要

There is a growing interest in studying the common features from multiple data sources. Fusing information from multiple heterogenous data sources promises to identify complex multivariate relationships among the heterogeneous sources. Such relationships can provide additional connectivity across the sources. A common way to analyze the relationships between a pair of data sources based on their correlation is canonical correlation analysis (CCA). CCA seeks for linear combinations of all variables from each dataset with maximal correlation between the two linear combinations. However, the existence of non-informative data points and features makes it challenging for CCA to identify significant relationships among the examined datasets. In this paper, we propose a novel method, NORA, Noise-Outliers Removal Algorithm, that can be used to filter out the non-informative data points and features before applying the CCA. NORA was applied to preprocess two epilepsy modalities, the MRI and neuropsychology, prior to applying CCA to find the association between them. The results show that the proposed method leads to interpretable results when noise plays a significant role in the acquisition of the data.
机译:对研究来自多个数据源的共同特征的兴趣与日俱增。融合来自多个异构数据源的信息有望在异构源之间识别复杂的多元关系。这样的关系可以提供跨源的附加连接。基于一对数据源之间的相关性来分析它们之间的关系的一种常用方法是规范相关性分析(CCA)。 CCA会从每个数据集中寻找所有变量的线性组合,并在两个线性组合之间建立最大的相关性。但是,非信息性数据点和特征的存在使CCA难以确定所检查数据集之间的重要关系。在本文中,我们提出了一种新的方法,NORA,噪声离群值消除算法,该方法可用于在应用CCA之前滤除非信息性数据点和特征。在应用CCA查找两者之间的关联之前,先将NORA应用于MRI和神经心理学这两种癫痫病的治疗方法。结果表明,当噪声在数据采集中起重要作用时,所提出的方法可得到可解释的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号