Extraction of correlated gene clusters from multiple genomic data by generalized kernel canonical correlation analysis

机译：通过广义核各种典型相关分析从多基因组数据中提取相关基因簇

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivation: A major issue in computational biology is the reconstruction of pathways from several genomic datasets, such as expression data, protein interaction data and phylogenetic profiles. As a first step toward this goal, it is important to investigate the amount of correlation which exists between these data. Method: We present new methods to measure the correlation between several heterogeneous datasets, and to extract sets of genes which share similarities with respect to multiple biologicalattributes. The originality of our approach is the extension of the concept of correlation for non-vectorial data, which is made possible by the use of generalized kernel canonical correlation analysis (KCCA), and the method we propose to extract groupsof genes responsible for the detected correlations. Moreover, two variants of KCCA are proposed when more than two datasets are available. Result: These methods are successfully tested on their ability to recognize operons in the Escherichia coli genome, from the comparison of three datasets corresponding to functional relationships between genes in metabolic pathways, geometrical relationships along the chromosome, and co-expression relationships as observed by gene expression data.

机译：动机：计算生物学中的主要问题是从几种基因组数据集的途径重建，例如表达数据，蛋白质相互作用数据和系统发育谱。作为实现这一目标的第一步，重要的是要调查这些数据之间存在的相关量。方法：我们提出了测量几种异质数据集之间的相关性的新方法，并提取与多种生物疏远的相似性的基因组。我们的方法的原创性是扩展非矢量数据的相关概念，这是通过使用广义核心规范相关分析（KCCA）来实现的，以及我们提出提取负责检测相关的基因的基因的方法。此外，当有两个以上的数据集可用时，提出了两种KCCA的变型。结果：这些方法在其识别大肠杆菌基因组中识别操纵子的能力，从对应于代谢途径中基因之间的功能关系的三个数据集，涉及染色体的几何关系以及基因观察到的共表达关系表达数据。

著录项

来源
《International Conference on Intelligent Systems for Molecular biology》|2003年||共8页
会议地点
作者
Y. Yamanishi; J.-P. Vert; A. Nakaya; M. Kanehisa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 Q811.4-532;
关键词

相似文献

外文文献
中文文献
专利

1. Comparison Of Canonical Correlation Analysis And The Generalized Canonical Correlation Analysis Using The Lognormal And Cauchy Distributed Data [J] . S. I. ONYEAGU, G. A. OSUJI, O.M. ONYIA Mathematical Theory and Modeling . 2014,第5期

机译：使用对数正态和柯西分布数据进行典范相关分析和广义典范相关分析的比较
2. Kernel canonical correlation analysis for data combination of multiple-source datasets [J] . Masaki Mitsuhiro, Takahiro Hoshino Japanese Journal of Statistics and Data Science . 2020,第2期

机译：多源数据集数据组合的内核规范相关分析
3. Multi-group analysis using generalized additive kernel canonical correlation analysis [J] . Eunseong Bae, Ji-Won Hur, Jinyoung Kim, Scientific reports. . 2020,第1期

机译：多群分析使用广义添加剂核典型相关分析
4. Extraction of correlated gene clusters from multiple genomic data by generalized kernel canonical correlation analysis [C] . Y. Yamanishi, J.-P. Vert, A. Nakaya, International Conference on Intelligent Systems for Molecular biology . 2003

机译：通过广义核各种典型相关分析从多基因组数据中提取相关基因簇
5. Multiple Kernel Learning for Gene Prioritization, Clustering, and Functional Enrichment Analysis. [D] . Millis, David H. 2014

机译：用于基因优先级，聚类和功能丰富分析的多核学习。
6. Multi-block Analysis of Genomic Data Using Generalized Canonical Correlation Analysis [O] . Inyoung Jun, Wooree Choi, Mira Park 2018

机译：使用广义典范相关分析对基因组数据进行多块分析
7. Extraction of correlated gene clusters from multiple genomic data by generalized kernel canonical correlation analysis [O] . Y. Yamanishi, J.-P. Vert, A. Nakaya, 2003

机译：通过广义核各种典型相关分析从多基因组数据中提取相关基因簇
8. Canonical Correlations and Generalized SVD (Singular Value Decomposition): Applications and New Algorithms [R] . Ewerbring, L. M., Luk, F. T. 1988

机译：典型相关和广义sVD（奇异值分解）：应用和新算法

Extraction of correlated gene clusters from multiple genomic data by generalized kernel canonical correlation analysis

摘要

著录项

相似文献

相关主题

期刊订阅