首页> 外文期刊>International journal of knowledge discovery in bioinformatics >BioTextRetriever:A Tool to Retrieve Relevant Papers
【24h】

BioTextRetriever:A Tool to Retrieve Relevant Papers

机译:BioTextRetriever:检索相关论文的工具

获取原文
获取原文并翻译 | 示例
       

摘要

Whenever new sequences of DNA or proteins have been decoded it is almost compulsory to look at similar sequences and papers describing those sequences in order to both collect relevant information concerning the function and activity of the new sequences and/or know what is known already about similar sequences. In current web sites and data bases of sequences there are, usually, a set of curated paper references linked to each sequence. Those links are a good starting point to look for relevant information related to a set of sequences. One way to implement such approach is to do a blast with the new decoded sequences, and collect similar sequences. Then one looks at the papers linked with the similar sequences. Most often the number of retrieved papers is small and one has to search large data bases for relevant papers. This paper proposes a process of generating a classifier based on the initially set of relevant papers. First, the authors collect similar sequences using an alignment algorithm like Blast. Then, the authors use the enlarges set of papers to construct a classifier. Finally a classifier is used to automatically enlarge the set of relevant papers by searching the MEDLINE using the automatically constructed classifier.
机译:每当对DNA或蛋白质的新序列进行了解码时,几乎必须查看相似的序列和描述这些序列的论文,以便收集有关新序列的功能和活性的相关信息和/或知道关于相似序列的已知信息序列。在当前的网站和序列数据库中,通常有一组与每个序列链接的精选论文参考。这些链接是寻找与一组序列相关的相关信息的良好起点。一种实现这种方法的方法是对新的解码序列进行快速处理,并收集相似的序列。然后,人们查看与相似序列相关的论文。通常,检索到的论文数量很少,并且必须在大型数据库中搜索相关论文。本文提出了一种基于最初的相关论文集生成分类器的过程。首先,作者使用比对算法(如Blast)收集相似的序列。然后,作者使用放大的论文集构建分类器。最后,通过使用自动构建的分类器搜索MEDLINE,使用分类器自动放大相关论文集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号