...
首页> 外文期刊>Bioinformatics >Mining literature for protein-protein interactions.
【24h】

Mining literature for protein-protein interactions.

机译:蛋白质相互作用的采矿文献。

获取原文
获取原文并翻译 | 示例
           

摘要

MOTIVATION: A central problem in bioinformatics is how to capture information from the vast current scientific literature in a form suitable for analysis by computer. We address the special case of information on protein-protein interactions, and show that the frequencies of words in Medline abstracts can be used to determine whether or not a given paper discusses protein-protein interactions. For those papers determined to discuss this topic, the relevant information can be captured for the Database of Interacting PROTEINS: Furthermore, suitable gene annotations can also be captured. RESULTS: Our Bayesian approach scores Medline abstracts for probability of discussing the topic of interest according to the frequencies of discriminating words found in the abstract. More than 80 discriminating words (e.g. complex, interaction, two-hybrid) were determined from a training set of 260 Medline abstracts corresponding to previously validated entries in the Database of Interacting Proteins. Using these words and a log likelihood scoring function, approximately 2000 Medline abstracts were identified as describing interactions between yeast proteins. This approach now forms the basis for the rapid expansion of the Database of Interacting Proteins.
机译:动机:生物信息学中的一个中心问题是如何以适合计算机分析的形式从当前大量的科学文献中获取信息。我们处理有关蛋白质-蛋白质相互作用的信息的特殊情况,并显示Medline摘要中单词的频率可以用来确定给定的论文是否讨论蛋白质-蛋白质相互作用。对于那些确定要讨论该主题的论文,可以为相互作用蛋白质数据库捕获相关信息:此外,还可以捕获合适的基因注释。结果:我们的贝叶斯方法根据在摘要中发现单词的频率,对Medline摘要进行了讨论感兴趣主题的概率评分。从260套Medline摘要的训练集中确定了80个以上的区分词(例如,复杂,交互,两杂),该摘要与之前在相互作用蛋白数据库中验证的条目相对应。使用这些词和对数似然评分功能,鉴定出大约2000个Medline摘要来描述酵母蛋白之间的相互作用。现在,这种方法构成了快速扩展相互作用蛋白数据库的基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号