首页> 外文会议>International Workshop on Data Integration in the Life Sciences >The Biozon System for Complex Analysis of Heterogeneous Interrelated Biological Data and Discovery of Emergent Structures
【24h】

The Biozon System for Complex Analysis of Heterogeneous Interrelated Biological Data and Discovery of Emergent Structures

机译:复杂分析异质相互关联生物数据的生物兴系统和突出结构的发现

获取原文

摘要

Biological entities are strongly related and mutually dependent on each other. Therefore, there is a growing need to corroborate and integrate data from different resources and aspects of biological systems in order to analyze them effectively. To identify entities, existing databases use explicit references by accession number or a mutual ontology. Some databases relate and cross link elements from other databases based on these identifiers. However, this information is very partial and is not readily available in some. Moreover, these links are not established in coordination with the other linked databases. With the source databases changing rapidly, this leads to problems of consistency and updatability. Furthermore, it is hard to query this wealth of data in ways that can benefit and exploit the mutual dependency between entities. Biozon is a unified biological database that integrates heterogeneous data types and the relationships between them, such as nucleic acid sequences, proteins, structures, protein domains and protein families, protein-protein interactions and cellular pathways, into a single extensive schema. This schema allows one to see each data instance in its full biological context. More importantly it allows for complex searches that span multiple data types from a heterogeneous set of sources and for arbitrary computations on that data. Biozon can also rank results, the same way Google ranks web documents, and uses similarity relationships to extend query results to similar biological entities.
机译:生物实体彼此强烈相关和相互依赖。因此,越来越需要证实和整合来自生物系统的不同资源和方面的数据,以便有效地分析它们。要识别实体,现有数据库使用登录号或共同本体使用显式引用。某些数据库基于这些标识符与其他数据库相关联的链接元素。但是,此信息非常偏为偏袒,并且在某些情况下不易获得。此外,这些链接不与其他链接数据库协调。使用源数据库快速更改,这会导致一致性和更新性的问题。此外,很难以可以使用和利用实体之间的相互依赖性的方式查询这一大量数据。 Biozon是一个统一的生物数据库,其整合异质数据类型和它们之间的关系,例如核酸序列,蛋白质,结构,蛋白质结构域和蛋白质系列,蛋白质 - 蛋白质相互作用和细胞途径,进入一个广泛的模式。此模式允许人们在其完整的生物学上查看每个数据实例。更重要的是,它允许复杂的搜索从异构源集和对该数据的任意计算跨越多个数据类型。 Biozon还可以等级结果,同样的方式谷歌排名Web文档,并使用相似关系将查询结果扩展到类似的生物实体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号