...
首页> 外文期刊>Bioinformatics >Predicting gene function through systematic analysis and quality assessment of high-throughput data
【24h】

Predicting gene function through systematic analysis and quality assessment of high-throughput data

机译:通过系统分析和高通量数据质量评估来预测基因功能

获取原文
获取原文并翻译 | 示例

摘要

Motivation: Determining gene function is an important challenge arising from the availability of whole genome sequences. Until recently, approaches based on sequence homology were the only high-throughput method for predicting gene function. Use of high-throughput generated experimental data sets for determining gene function has been limited for several reasons.Results: Here a new approach is presented for integration of high-throughput data sets, leading to prediction of function based on relationships supported by multiple types and sources of data. This is achieved with a database containing 125 different high-throughput data sets describing phenotypes, cellular localizations, protein interactions and mRNA expression levels from Saccharomyces cerevisiae, using a bit-vector representation and information content-based ranking. The approach takes characteristic and qualitative differences between the data sets into account, is highly flexible, efficient and scalable. Database queries result in predictions for 543 uncharacterized genes, based on multiple functional relationships each supported by at least three types of experimental data. Some of these are experimentally verified, further demonstrating their reliability. The results also generate insights into the relative merits of different data types and provide a coherent framework for functional genomic datamining.
机译:动机:确定基因功能是整个基因组序列可用性的重要挑战。直到最近,基于序列同源性的方法还是预测基因功能的唯一高通量方法。由于以下几个原因,限制了使用高通量生成的实验数据集来确定基因功能。结果:这里提出了一种新的方法,用于整合高通量数据集,从而可以基于多种类型和多种类型支持的关系来预测功能数据来源。这是通过一个数据库实现的,该数据库包含125个不同的高通量数据集,这些数据集使用位向量表示法和基于信息内容的排名来描述啤酒酵母的表型,细胞定位,蛋白质相互作用和mRNA表达水平。该方法考虑了数据集之间的特征和质量差异,具有高度的灵活性,效率和可扩展性。数据库查询可基于多种功能关系对543个未表征的基因进行预测,每种功能关系均受至少三种类型的实验数据支持。其中一些经过实验验证,进一步证明了其可靠性。结果还产生了对不同数据类型相对优点的见解,并为功能基因组数据挖掘提供了一个一致的框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号