首页> 外文学位 >Modeling and local filtering of noise embedded in genome-scale microarray datasets.
【24h】

Modeling and local filtering of noise embedded in genome-scale microarray datasets.

机译:对基因组规模微阵列数据集中嵌入的噪声进行建模和局部过滤。

获取原文
获取原文并翻译 | 示例

摘要

The genomes of numerous organisms have been sequenced. This advancement creates new opportunities in biomedical research, specifically, for conducting large-scale studies of the behavior of tens of thousands of genes in different cell types or tissues. Microarrays are experimental tools that assay the relative abundance of transcripts from tens of thousands of genes at a time. Unfortunately, microarray data are very noisy. This thesis details procedures to collect and model the noise embedded in microarray datasets, and methods to filter it. The results lead us to an algorithm that yields highly accurate discovery of the relative abundance of genes in two biological samples.;We define a function whose zero set collects a sample of the noise embedded in a dataset. The noise sample is applied to build models and construct local filters that eliminate most of the noise remaining in the dataset. The model includes parameters that are optimized in reference to experimental data obtained from biological samples designed to yield expression ratios > 1 (true positives) or = 1 (true negatives). The algorithm offers significant improvements in both specificity and sensitivity; specificity is at least 1000-fold better and sensitivity is 2-fold higher than existing state-of-the-art methods. Highly specific discovery has numerous applications in biomedical research and medicine including: (1) molecular classification of tumors, (2) the discovery of signaling pathways, and (3) the discovery of molecular systems that create biological phenotypes.
机译:已经对许多生物的基因组进行了测序。这一进步为生物医学研究创造了新的机会,特别是对不同细胞类型或组织中成千上万个基因的行为进行大规模研究。微阵列是一种实验工具,可一次检测成千上万个基因的转录本的相对丰度。不幸的是,微阵列数据非常嘈杂。本文详细介绍了收集和建模微阵列数据集中嵌入的噪声的过程以及进行滤波的方法。结果使我们找到了一种算法,该算法可以高度准确地发现两个生物样本中基因的相对丰度。我们定义了一个函数,该函数的零集收集嵌入到数据集中的噪声样本。噪声样本将用于构建模型和构建局部滤波器,以消除数据集中剩余的大部分噪声。该模型包含参考从生物学样本获得的实验数据进行优化的参数,这些实验数据设计为产生表达比> 1(真阳性)或= 1(真阴性)。该算法在特异性和灵敏性上都有了显着改善。特异性至少比现有技术高10​​00倍,灵敏度高2倍。高度特异性的发现在生物医学研究和医学中具有许多应用,包括:(1)肿瘤的分子分类,(2)信号传导途径的发现,和(3)发现产生生物表型的分子系统的发现。

著录项

  • 作者

    Fathallah-Shaykh, Hassan M.;

  • 作者单位

    University of Illinois at Chicago.;

  • 授予单位 University of Illinois at Chicago.;
  • 学科 Mathematics.;Biology Bioinformatics.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 62 p.
  • 总页数 62
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 遥感技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号