Analyzing large biological datasets with association networks

Byung H. Park; Edward C. Uberbacher; Tatiana V. Karpinets

首页> 外文期刊>Nucleic acids research >Analyzing large biological datasets with association networks

【24h】

Analyzing large biological datasets with association networks

机译：使用关联网络分析大型生物数据集

获取原文

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Due to advances in high-throughput biotechnologies biological information is being collected in databases at an amazing rate, requiring novel computational approaches that process collected data into new knowledge in a timely manner. In this study, we propose a computational framework for discovering modular structure, relationships and regularities in complex data. The framework utilizes a semantic-preserving vocabulary to convert records of biological annotations of an object, such as an organism, gene, chemical or sequence, into networks (Anets) of the associated annotations. An association between a pair of annotations in an Anet is determined by the similarity of their co-occurrence pattern with all other annotations in the data. This feature captures associations between annotations that do not necessarily co-occur with each other and facilitates discovery of the most significant relationships in the collected data through clustering and visualization of the Anet. To demonstrate this approach, we applied the framework to the analysis of metadata from the Genomes OnLine Database and produced a biological map of sequenced prokaryotic organisms with three major clusters of metadata that represent pathogens, environmental isolates and plant symbionts.

机译：由于高通量生物技术的进步，生物信息正以惊人的速度被收集到数据库中，这就需要新颖的计算方法来将收集到的数据及时处理为新知识。在这项研究中，我们提出了一个用于发现复杂数据中的模块化结构，关系和规则性的计算框架。该框架利用保留语义的词汇表将对象（例如生物，基因，化学或序列）的生物学注释的记录转换为关联注释的网络（Anets）。 Anet中一对注释之间的关联取决于它们的共现模式与数据中所有其他注释的相似性。此功能捕获不一定相互共存的注释之间的关联，并通过Anet的聚类和可视化促进在收集的数据中发现最重要的关系。为了证明这种方法，我们将该框架应用于了Genomes在线数据库中的元数据分析，并生成了测序的原核生物生物图谱，其中包含代表病原体，环境分离株和植物共生体的三个主要元数据簇。

著录项

来源
《Nucleic acids research》 |2012年第17期|共1页
作者
Byung H. Park; Edward C. Uberbacher; Tatiana V. Karpinets;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类 AB;
关键词

相似文献

外文文献
中文文献
专利

1. SuperMIC: Analyzing Large Biological Datasets in Bioinformatics with Maximal Information Coefficient [J] . Chao Wang, Dong Dai, Xi Li, IEEE/ACM transactions on computational biology and bioinformatics . 2017,第4期

机译：SuperMIC：分析具有最大信息系数的生物信息学中的大型生物数据集
2. ANALYZING BIOLOGICAL PROCESS ON GENE EXPRESSION DATASETS USING HEURISTIC SEARCH [J] . P M BOOMA, DR.S.PRABHAKARAN Journal of Theoretical and Applied Information Technology . 2013,第3期

机译：启发式搜索分析基因表达数据集的生物过程
3. PAST: The Pathway Association Studies Tool to Infer Biological Meaning from GWAS Datasets [J] . Progress in Artificial Intelligence . 2020,第1期

机译：过去：途径协会研究工具从GWAS数据集推断生物学意义
4. Analyzing Factors, Construction of Dataset, Estimating Importance of Factor, and Generation of Association Rules for Indian Road Accident [C] . Suwarna Gothane, M.V. Sarode IEEE International Conference on Advanced Computing . 2016

机译：分析因素，数据集的构造，估计因素的重要性以及印度道路事故的关联规则的生成
5. Querying large biological network datasets [D] . Gulsoy, Gunhan 2013

机译：查询大型生物网络数据集
6. Analyzing large biological datasets with association networks [O] . Tatiana V. Karpinets, Byung H. Park, Edward C. Uberbacher 2012

机译：使用关联网络分析大型生物数据集
7. Identifying biological associations from high-throughput datasets [O] . Barghash Ahmad 2015

机译：从高通量数据集中识别生物学关联

Analyzing large biological datasets with association networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅