Comparison of Methods for Meta-dimensional Data Analysis Using in Silico and Biological Data Sets

机译：使用计算机模拟数据集和生物数据集进行元维度数据分析的方法比较

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recent technological innovations have catalyzed the generation of a massive amount of data at various levels of biological regulation, including DNA, RNA and protein. Due to the complex nature of biology, the underlying model may only be discovered by integrating different types of high-throughput data to perform a "meta-dimensional" analysis. For this study, we used simulated gene expression and genotype data to compare three methods that show potential for integrating different types of data in order to generate models that predict a given phenotype: the Analysis Tool for Heritable and Environmental Network Associations (ATHENA), Random Jungle (RJ), and Lasso. Based on our results, we applied RJ and ATHENA sequentially to a biological data set that consisted of genome-wide genotypes and gene expression levels from lym-phoblastoid cell lines (LCLs) to predict cytotoxicity. The best model consisted of two SNPs and two gene expression variables with an r-squared value of 0.32.

机译：最近的技术创新已催化了在各种生物调控水平（包括DNA，RNA和蛋白质）下生成大量数据。由于生物学的复杂性，只能通过集成不同类型的高通量数据以执行“元维度”分析来发现基础模型。在本研究中，我们使用模拟的基因表达和基因型数据比较了三种显示出整合不同类型数据潜力以生成可预测给定表型的模型的方法：遗传和环境网络关联分析工具（ATHENA），随机丛林（RJ）和套索。根据我们的结果，我们将RJ和ATHENA顺序应用于生物学数据集，该数据集由全基因组基因型和淋巴-成纤维细胞系（LCL）的基因表达水平组成，以预测细胞毒性。最佳模型由两个SNP和两个基因表达变量组成，r平方值为0.32。

著录项

来源
《Evolutionary computation, machine learning and data mining in bioinformatics.》|2012年|p.134-143|共10页
会议地点 Malaga(ES);Malaga(ES)
作者
Emily R. Holzinger; Scott M. Dudek; Alex T. Frase; Brooke Fridley; Prabhakar Chalise; Marylyn D. Ritchie;
展开▼
作者单位

Center for Human Genetics Research, Vanderbilt University, Nashville, TN, USA;

Center for Human Genetics Research, Vanderbilt University, Nashville, TN, USA;

Center for Systems Genomics, Pennsylvania State University, University Park, PA, USA;

Divison of Biomedical Statistics and Informatics, Mayo Clinic College of Medicine,Rochester, MN, USA;

Divison of Biomedical Statistics and Informatics, Mayo Clinic College of Medicine,Rochester, MN, USA;

Center for Systems Genomics, Pennsylvania State University, University Park, PA, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序设计、软件工程;程序设计、软件工程;
关键词
systems biology; neural networks; evolutionary computation; data integration; human genetics;

机译：系统生物学；神经网络;进化计算数据整合；人类遗传学;

相似文献

外文文献
中文文献
专利

1. Analysis of the real EADGENE data set: Comparison of methods and guidelines for data normalisation and selection of diffrentially expressed genes (Open Access publication) [J] . Jaffrezic F, De Koning DJ, Boettcher PJ, Genetics Selection Evolution . 2007,第6期

机译：实际EADGENE数据集的分析：数据归一化和差异表达基因选择的方法和指南的比较（开放获取出版物）
2. Analysis of the real EADGENE data set: Comparison of methods and guidelines for data normalisation and selection of differentially expressed genes (Open Access publication) [J] . Florence Jaffrézic, Dirk-Jan de Koning, Paul J Boettcher, Genetics, selection, evolution . 2007,第6期

机译：实际EADGENE数据集的分析：数据归一化和差异表达基因选择的方法和指南的比较（开放获取出版物）
3. A Biologically Inspired Validity Measure for Comparison of Clustering Methods over Metabolic Data Sets [J] . Stegmayer Georgina, Milone Diego H., Kamenetzky Laura, Computational Biology and Bioinformatics, IEEE/ACM Transactions on . 2012,第3期

机译：一种生物学启发的有效性度量，用于比较代谢数据集上的聚类方法
4. Comparison analysis of data mining methodology and student performance improvement influence factors in small data set [C] . International Conference on Science in Information Technology . 2015

机译：小数据集中数据挖掘方法与学生成绩改善影响因素的比较分析
5. Development, optimization, and application of a meta-dimensional analysis pipeline using in silico and natural data sets. [D] . Holzinger, Emily. 2013

机译：使用计算机和自然数据集开发，优化和应用元维度分析管道。
6. Analysis of the real EADGENE data set: Comparison of methods and guidelines for data normalisation and selection of differentially expressed genes (Open Access publication) [O] . Florence Jaffrézic, Dirk-Jan de Koning, Paul J Boettcher, 2007

机译：实际EADGENE数据集的分析：数据归一化和差异表达基因选择的方法和指南的比较（开放获取出版物）
7. Analysis of the real EADGENE data set: Comparison of methods and guidelines for data normalisation and selection of differentially expressed genes (Open Access publication) [O] . Florence Jaffrézic, Dirk-Jan de Koning, Paul J Boettcher, 2009

机译：实际EADGENE数据集的分析：数据归一化和差异表达基因选择的方法和指南的比较（开放获取出版物）
8. Software for the Statistical Analysis and Display of Comparisons Between Meteorological Measuring Set Balloon Data, National Weather Service Balloon Data, and Battlescale Forecast Model Output [R] . Kirby, S. F. 2000

机译：用于统计分析和显示气象测量集气球数据，国家气象服务气球数据和战场尺度预测模型输出之间比较的软件

Comparison of Methods for Meta-dimensional Data Analysis Using in Silico and Biological Data Sets

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅