首页> 美国卫生研究院文献>BMC Bioinformatics >A summarization approach for Affymetrix GeneChip data using a reference training set from a large biologically diverse database

【2h】

A summarization approach for Affymetrix GeneChip data using a reference training set from a large biologically diverse database

机译：Affymetrix GeneChip数据的汇总方法使用来自大型生物多样性数据库的参考训练集

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

BackgroundMany of the most popular pre-processing methods for Affymetrix expression arrays, such as RMA, gcRMA, and PLIER, simultaneously analyze data across a set of predetermined arrays to improve precision of the final measures of expression. One problem associated with these algorithms is that expression measurements for a particular sample are highly dependent on the set of samples used for normalization and results obtained by normalization with a different set may not be comparable. A related problem is that an organization producing and/or storing large amounts of data in a sequential fashion will need to either re-run the pre-processing algorithm every time an array is added or store them in batches that are pre-processed together. Furthermore, pre-processing of large numbers of arrays requires loading all the feature-level data into memory which is a difficult task even with modern computers. We utilize a scheme that produces all the information necessary for pre-processing using a very large training set that can be used for summarization of samples outside of the training set. All subsequent pre-processing tasks can be done on an individual array basis. We demonstrate the utility of this approach by defining a new version of the Robust Multi-chip Averaging (RMA) algorithm which we refer to as refRMA.

机译：背景技术Affymetrix表达阵列的许多最流行的预处理方法（例如RMA，gcRMA和PLIER）同时分析一组预定阵列中的数据，以提高最终表达量的准确性。与这些算法相关的一个问题是，特定样本的表达测量高度依赖于用于归一化的样本集，并且使用不同集合归一化所获得的结果可能不具有可比性。一个相关的问题是，按顺序生成和/或存储大量数据的组织将需要在每次添加阵列时重新运行预处理算法，或者将它们分批存储在一起进行预处理。此外，对大量阵列的预处理要求将所有功能级数据加载到内存中，即使对于现代计算机，这也是一项艰巨的任务。我们使用一种方案，该方案使用非常大的训练集生成预处理所需的所有信息，该训练集可用于汇总训练集之外的样本。所有后续的预处理任务都可以在单个阵列的基础上完成。我们通过定义新版本的稳健多芯片平均（RMA）算法（称为refRMA）来演示此方法的实用性。

著录项

期刊名称 BMC Bioinformatics
作者
Simon Katz; Rafael A Irizarry; Xue Lin; Mark Tripputi; Mark W Porter;
展开▼
作者单位

展开▼
年(卷),期 2006(7),-1
年度 2006
页码 464
总页数 11
原文格式 PDF
正文语种
中图分类应用微生物学;生化遗传学;生化药理学;
关键词

相似文献

外文文献
中文文献
专利

1. A summarization approach for Affymetrix GeneChip data using a reference training set from a large, biologically diverse database [J] . Simon Katz, Rafael A Irizarry, Xue Lin, BMC Bioinformatics . 2006,第1期

机译：Affymetrix GeneChip数据的汇总方法，使用来自大型生物多样性数据库的参考训练集
2. SAINTETIQ: a fuzzy set-based approach to database summarization [J] . G. Raschia, N. Mouaddib Fuzzy sets and systems . 2002,第2期

机译：SAINTETIQ：基于模糊集的数据库汇总方法
3. Estimates of Within-Subject Biological Variation Derived from Pathology Databases: An Approach to Allow Assessment of the Effects of Age, Sex, Time between Sample Collections, and Analyte Concentration on Reference Change Values [J] . Graham Ross Dallas Jones Clinical Chemistry: Journal of the American Association for Clinical Chemists . 2019,第4期

机译：从病理数据库得出的受试者内部生物变异的估计：一种允许评估年龄，性别，样品收集之间的时间以及参考浓度对分析物变化值影响的方法
4. Region-based custom chip description formats for reanalysis of publicly available affymetrix® genechip® data sets [C] . Ernur Saka, Benjamin J. Harrison, Kirk West, IEEE International Conference on Computational Advances in Bio and Medical Sciences . 2016

机译：基于区域的定制芯片描述格式，用于重新分析可公开获得的affymetrix®genechip®数据集
5. Knowledge discovery in databases: An attribute-oriented rough set approach. [D] . Hu, Xiaohua. 1995

机译：数据库中的知识发现：一种面向属性的粗糙集方法。
6. Framework for reanalysis of publicly available Affymetrix® GeneChip® data sets based on functional regions of interest [O] . Ernur Saka, Benjamin J. Harrison, Kirk West, 2017

机译：基于感兴趣的功能区域重新分析可公开获得的Affymetrix®GeneChip®数据集的框架
7. A summarization approach for Affymetrix GeneChip data using a reference training set from a large, biologically diverse database [O] . 2006

机译：Affymetrix GeneChip数据的汇总方法，使用来自大型生物多样性数据库的参考训练集
8. Refinement, Maintenance, and Continued Development of Databases for DefenseManpower Data Center Files: Report Summarizing Development of Database Activities [R] . Henderson, L. 1997

机译：Defensemanpower数据中心文件数据库的细化，维护和持续开发：总结数据库活动开发的报告

A summarization approach for Affymetrix GeneChip data using a reference training set from a large biologically diverse database

摘要

著录项

相似文献

相关主题

期刊订阅