...
首页> 外文期刊>BMC Bioinformatics >Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text
【24h】

Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text

机译:Pharmspresso:一种文本挖掘工具,用于从全文中提取药物基因组学概念和关系

获取原文
           

摘要

Background Pharmacogenomics studies the relationship between genetic variation and the variation in drug response phenotypes. The field is rapidly gaining importance: it promises drugs targeted to particular subpopulations based on genetic background. The pharmacogenomics literature has expanded rapidly, but is dispersed in many journals. It is challenging, therefore, to identify important associations between drugs and molecular entities – particularly genes and gene variants, and thus these critical connections are often lost. Text mining techniques can allow us to convert the free-style text to a computable, searchable format in which pharmacogenomic concepts (such as genes, drugs, polymorphisms, and diseases) are identified, and important links between these concepts are recorded. Availability of full text articles as input into text mining engines is key, as literature s often do not contain sufficient information to identify these pharmacogenomic associations. Results Thus, building on a tool called Textpresso, we have created the Pharmspresso tool to assist in identifying important pharmacogenomic facts in full text articles. Pharmspresso parses text to find references to human genes, polymorphisms, drugs and diseases and their relationships. It presents these as a series of marked-up text fragments, in which key concepts are visually highlighted. To evaluate Pharmspresso, we used a gold standard of 45 human-curated articles. Pharmspresso identified 78%, 61%, and 74% of target gene, polymorphism, and drug concepts, respectively. Conclusion Pharmspresso is a text analysis tool that extracts pharmacogenomic concepts from the literature automatically and thus captures our current understanding of gene-drug interactions in a computable form. We have made Pharmspresso available at http://pharmspresso.stanford.edu .
机译:背景药物基因组学研究遗传变异与药物反应表型变异之间的关系。该领域正在迅速变得重要:它有望基于遗传背景针对特定亚群的药物。药物基因组学文献迅速发展,但散布在许多期刊中。因此,要确定药物与分子实体之间的重要联系,尤其是基因和基因变体,具有挑战性,因此这些关键联系常常会丢失。文本挖掘技术可以使我们将自由样式的文本转换为可计算,可搜索的格式,其中可以识别出药物基因组学概念(例如基因,药物,多态性和疾病),并记录这些概念之间的重要链接。提供全文文章作为文本挖掘引擎的输入是关键,因为文献通常没有足够的信息来识别这些药物基因组学关联。结果因此,在称为Textpresso的工具的基础上,我们创建了Pharmspresso工具来帮助识别全文文章中重要的药物基因组学事实。 Pharmspresso解析文本以查找对人类基因,多态性,药物和疾病及其关系的引用。它以一系列标记的文本片段的形式呈现,其中直观地突出了关键概念。为了评估Pharmspresso,我们使用了45种人体护理产品的金标准。 Pharmspresso分别确定了目标基因,多态性和药物概念的78%,61%和74%。结论Pharmspresso是一种文本分析工具,可以自动从文献中提取药物基因组学概念,从而以可计算的形式捕获了我们目前对基因-药物相互作用的理解。我们已在http://pharmspresso.stanford.edu上提供了Pharmspresso。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号