首页> 外文期刊>Bioinformatics >SciMiner: web-based literature mining tool for target identification and functional enrichment analysis
【24h】

SciMiner: web-based literature mining tool for target identification and functional enrichment analysis

机译:SciMiner:基于网络的文献挖掘工具,用于目标识别和功能丰富分析

获取原文
获取原文并翻译 | 示例
       

摘要

SciMiner is a web-based literature mining and functional analysis tool that identifies genes and proteins using a context specific analysis of MEDLINE abstracts and full texts. SciMiner accepts a free text query (PubMed Entrez search) or a list of PubMed identifiers as input. SciMiner uses both regular expression patterns and dictionaries of gene symbols and names compiled from multiple sources. Ambiguous acronyms are resolved by a scoring scheme based on the co-occurrence of acronyms and corresponding description terms, which incorporates optional user-defined filters. Functional enrichment analyses are used to identify highly relevant targets (genes and proteins), GO (Gene Ontology) terms, MeSH (Medical Subject Headings) terms, pathways and protein-protein interaction networks by comparing identified targets from one search result with those from other searches or to the full HGNC [HUGO (Human Genome Organization) Gene Nomenclature Committee] gene set. The performance of gene/protein name identification was evaluated using the BioCreAtIvE (Critical Assessment of Information Extraction systems in Biology) version 2 (Year 2006) Gene Normalization Task as a gold standard. SciMiner achieved 87.1% recall, 71.3% precision and 75.8% F-measure. SciMiner's literature mining performance coupled with functional enrichment analyses provides an efficient platform for retrieval and summary of rich biological information from corpora of users' interests. AVAILABILITY: http://jdrf.neurology.med.umich.edu/SciMiner/. A server version of the SciMiner is also available for download and enables users to utilize their institution's journal subscriptions. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
机译:SciMiner是一个基于Web的文献挖掘和功能分析工具,可以使用MEDLINE摘要和全文的上下文特定分析来识别基因和蛋白质。 SciMiner接受自由文本查询(PubMed Entrez搜索)或PubMed标识符列表作为输入。 SciMiner使用正则表达式模式以及从多个来源编译的基因符号和名称的字典。不明确的首字母缩略词是根据首字母缩略词和相应描述术语的共现而采用的计分方案来解决的,该方案结合了可选的用户定义过滤器。通过将一个搜索结果中的目标与其他搜索结果中的目标进行比较,功能丰富的分析可用于识别高度相关的目标(基因和蛋白质),GO(基因本体论)术语,MeSH(医学主题词)术语,途径和蛋白质-蛋白质相互作用网络搜索或搜索完整的HGNC [HUGO(人类基因组组织)基因命名委员会]基因集。使用BioCreAtIvE(生物学信息提取系统的关键评估)第2版(2006年)基因归一化任务作为金标准,评估了基因/蛋白质名称鉴定的性能。 SciMiner实现了87.1%的召回率,71.3%的精度和75.8%的F测量。 SciMiner的文献挖掘性能结合功能丰富的分析提供了一个有效的平台,可以从用户兴趣集中检索和汇总丰富的生物信息。可用性:http://jdrf.neurology.med.umich.edu/SciMiner/。还可以下载SciMiner的服务器版本,并使用户能够利用其机构的期刊订阅。补充信息:补充数据可从Bioinformatics在线获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号