首页> 外文会议>Information Retrieval amp; Knowledge Management (CAMP), 2012 International Conference on >A flocking based data mining algorithm for detecting outliers in cancer gene expression microarray data
【24h】

A flocking based data mining algorithm for detecting outliers in cancer gene expression microarray data

机译:基于植绒的数据挖掘算法,用于检测癌症基因表达微阵列数据中的异常值

获取原文
获取原文并翻译 | 示例

摘要

The existence of outliers is a major factor of inaccuracy in cancer gene expression microarray-based experiments. Researchers confirm that in many cases outliers in one class in cancer microarray based-experiments are contaminated. As a result, outliers appear to have gene expression similar to samples of an existing class in the dataset. Hence, it is essential to analyze each class in the dataset independently from other classes. Existing outlier detection algorithms identify outliers with respect to the whole dataset. Our algorithm isolates detected classes and analyzes each class as a separate dataset. We propose a novel, simple and biologically inspired algorithm to detect outliers in cancer microarray data. This algorithm is inspired from the natural phenomena of bird flocking. We model microarray gene expression data as an artificial life where similar samples flock in a virtual space to form swarms and outliers' samples are being naturally repulsed by optimum subswarms. We demonstrate empirically that our algorithm detects biologically meaningful outlier samples. We analyze the performance of the algorithm using real colon cancer dataset widely used in the bioinformatics literature.
机译:离群值的存在是基于癌症基因表达微阵列实验的不准确性的主要因素。研究人员证实,在许多情况下,基于癌症微阵列的实验中一类的离群值都被污染了。结果,离群值似乎具有与数据集中现有类别的样本相似的基因表达。因此,必须独立于其他类分析数据集中的每个类。现有的离群值检测算法会针对整个数据集识别离群值。我们的算法隔离检测到的类别,并将每个类别分析为单独的数据集。我们提出了一种新颖,简单且受生物学启发的算法来检测癌症微阵列数据中的异常值。该算法的灵感来自鸟类聚集的自然现象。我们将微阵列基因表达数据建模为人工生命,其中类似的样本在虚拟空间中聚集以形成群,而离群值的样本被最佳亚群自然排斥。我们凭经验证明我们的算法可检测到生物学上有意义的异常样本。我们使用在生物信息学文献中广泛使用的真实结肠癌数据集来分析算法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号