...
首页> 外文期刊>Journal of Biomedical Semantics >GO2PUB: Querying PubMed with semantic expansion of gene ontology terms
【24h】

GO2PUB: Querying PubMed with semantic expansion of gene ontology terms

机译:GO2PUB:使用基因本体术语的语义扩展来查询PubMed

获取原文
           

摘要

Background With the development of high throughput methods of gene analyses, there is a growing need for mining tools to retrieve relevant articles in PubMed. As PubMed grows, literature searches become more complex and time-consuming. Automated search tools with good precision and recall are necessary. We developed GO2PUB to automatically enrich PubMed queries with gene names, symbols and synonyms annotated by a GO term of interest or one of its descendants. Results GO2PUB enriches PubMed queries based on selected GO terms and keywords. It processes the result and displays the PMID, title, authors, abstract and bibliographic references of the articles. Gene names, symbols and synonyms that have been generated as extra keywords from the GO terms are also highlighted. GO2PUB is based on a semantic expansion of PubMed queries using the semantic inheritance between terms through the GO graph. Two experts manually assessed the relevance of GO2PUB, GoPubMed and PubMed on three queries about lipid metabolism. Experts’ agreement was high (kappa?=?0.88). GO2PUB returned 69% of the relevant articles, GoPubMed: 40% and PubMed: 29%. GO2PUB and GoPubMed have 17% of their results in common, corresponding to 24% of the total number of relevant results. 70% of the articles returned by more than one tool were relevant. 36% of the relevant articles were returned only by GO2PUB, 17% only by GoPubMed and 14% only by PubMed. For determining whether these results can be generalized, we generated twenty queries based on random GO terms with a granularity similar to those of the first three queries and compared the proportions of GO2PUB and GoPubMed results. These were respectively of 77% and 40% for the first queries, and of 70% and 38% for the random queries. The two experts also assessed the relevance of seven of the twenty queries (the three related to lipid metabolism and four related to other domains). Expert agreement was high (0.93 and 0.8). GO2PUB and GoPubMed performances were similar to those of the first queries. Conclusions We demonstrated that the use of genes annotated by either GO terms of interest or a descendant of these GO terms yields some relevant articles ignored by other tools. The comparison of GO2PUB, based on semantic expansion, with GoPubMed, based on text mining techniques, showed that both tools are complementary. The analysis of the randomly-generated queries suggests that the results obtained about lipid metabolism can be generalized to other biological processes. GO2PUB is available at http://go2pub.genouest.org webcite.
机译:背景技术随着高通量基因分析方法的发展,对挖掘工具来检索PubMed中相关文章的需求不断增长。随着PubMed的发展,文献搜索变得越来越复杂和耗时。需要具有高精度和召回率的自动化搜索工具。我们开发了GO2PUB,可以自动使用带有GO感兴趣的术语或其后代之一注释的基因名称,符号和同义词来丰富PubMed查询。结果GO2PUB根据选定的GO术语和关键字丰富了PubMed查询。它处理结果并显示文章的PMID,标题,作者,摘要和书目参考。还突出显示了从GO术语中作为额外关键字生成的基因名称,符号和同义词。 GO2PUB基于PubMed查询的语义扩展,该查询使用通过GO图的术语之间的语义继承。两名专家在有关脂质代谢的三个问题上手动评估了GO2PUB,GoPubMed和PubMed的相关性。专家的认可度很高(kappa =?0.88)。 GO2PUB返回了69%的相关文章,GoPubMed:40%,PubMed:29%。 GO2PUB和GoPubMed共有17%的结果,占相关结果总数的24%。超过一种工具返回的文章中有70%是相关的。只有GO2PUB返回了36%的相关文章,GoPubMed仅返回了17%,PubMed仅返回了14%。为了确定这些结果是否可以推广,我们基于随机GO项生成了二十个查询,其粒度与前三个查询的粒度相似,并比较了GO2PUB和GoPubMed结果的比例。第一次查询分别为77%和40%,随机查询分别为70%和38%。两位专家还评估了二十个查询中的七个的相关性(三个与脂质代谢有关,四个与其他领域有关)。专家同意率很高(0.93和0.8)。 GO2PUB和GoPubMed的性能与最初查询的性能相似。结论我们证明了使用感兴趣的GO术语或这些GO术语的后代注释的基因会产生一些其他工具忽略的相关文章。基于语义扩展的GO2PUB与基于文本挖掘技术的GoPubMed的比较表明,这两种工具是互补的。对随机产生的查询的分析表明,有关脂质代谢的结果可以推广到其他生物学过程。可以在http://go2pub.genouest.org网站上找到GO2PUB。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号