首页> 外文会议>Annual International Conference of the IEEE Engineering in Medicine and Biology Society >Cloud-scale genomic signals processing classification analysis for gene expression microarray data
【24h】

Cloud-scale genomic signals processing classification analysis for gene expression microarray data

机译:基因表达微阵列数据的云规模基因组信号处理分类分析

获取原文

摘要

As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.
机译:随着可供科学家使用的微阵列数据的数量和复杂性不断增加,通过对科学家有用的DNA / mRNA序列数据的分析来寻找多种推论方法已变得极为重要。尽管已经进行了很多尝试来阐明通过小波预处理和分类来提出生物学推断的问题,但是还没有研究工作集中在在云环境中使用小波阈值技术对微阵列数据进行云规模分类分析,以解决这一问题。识别明显表达的特征。本文提出了一种新颖的方法,该方法使用基于小波的降噪初始化用于确定显着表达的基因进行分类的阈值。此外,这项研究已实施并包含在基于云的分布式处理环境中。利用云计算和小波阈值技术对来自全球癌症图谱(GCM)的14种肿瘤分类进行了分类。结果证明比使用预定义的p值进行差异表达分类更准确。这种新颖的方法分析了基于小波的云环境中基因表达的阈值特征,并通过分析基因模式对样品的表达进行了分类,从而告知了我们生物学过程。此外,使研究人员能够面对大型微阵列数据集的功能基因组学中的数据分析中可能出现的当前和即将出现的挑战。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号