...
首页> 外文期刊>Journal of biomedical informatics. >ProNormz - An integrated approach for human proteins and protein kinases normalization
【24h】

ProNormz - An integrated approach for human proteins and protein kinases normalization

机译:pronormz - 一种人体蛋白质和蛋白激酶归一化的综合方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The task of recognizing and normalizing protein name mentions in biomedical literature is a challenging task and important for text mining applications such as protein-protein interactions, pathway reconstruction and many more. In this paper, we present ProNormz, an integrated approach for human proteins (HPs) tagging and normalization. In Homo sapiens, a greater number of biological processes are regulated by a large human gene family called protein kinases by post translational phosphorylation. Recognition and normalization of human protein kinases (HPKs) is considered to be important for the extraction of the underlying information on its regulatory mechanism from biomedical literature. ProNormz distinguishes HPKs from other HPs besides tagging and normalization. To our knowledge, ProNormz is the first normalization system available to distinguish HPKs from other HPs in addition to gene normalization task. ProNormz incorporates a specialized synonyms dictionary for human proteins and protein kinases, a set of 15 string matching rules and a disambiguation module to achieve the normalization. Experimental results on benchmark BioCreative II training and test datasets show that our integrated approach achieve a fairly good performance and outperforms more sophisticated semantic similarity and disambiguation systems presented in BioCreative II GN task. As a freely available web tool, ProNormz is useful to developers as extensible gene normalization implementation, to researchers as a standard for comparing their innovative techniques, and to biologists for normalization and categorization of HPs and HPKs mentions in biomedical literature. URL: http://www.biominingbu.org/pronormz.
机译:识别和正常化蛋白质名称在生物医学文献中提到的任务是一个具有挑战性的任务,并且对于蛋白质 - 蛋白质相互作用,途径重建等文本挖掘应用以及更多。在本文中,我们呈现PronOMZ,人类蛋白质(HPS)标记和标准化的综合方法。在同性全角中,通过翻译后磷酸化,通过称为蛋白激酶的大型人类基因家族来调节更多的生物过程。人蛋白激酶(HPKS)的识别和标准化被认为对从生物医学文献的监管机制提取潜在信息是重要的。 PronOMz除了标记和标准化之外,PronOMz还将HPK与其他HPS区分开来。据我们所知,PronOMZ是第一个可用于区分来自其他HPS的第一个标准化系统,除了基因标准化任务之外还可以区分HP。 pronormz包含一个专门的同义词词典,用于人类蛋白质和蛋白激酶,一组15个字符串匹配规则和消歧模块,以实现归一化。基准生物重建II培训和测试数据集的实验结果表明,我们的综合方法实现了一个相当良好的性能,优于生物重建II GN任务所呈现的更复杂的语义相似性和消歧系统。作为可自由的Web工具,PronOMZ对开发人员可用作可扩展的基因标准化实施,研究人员作为比较其创新技术的标准,以及对生物医学文献中HPS和HPKS的正常化和分类的生物学家。 URL:http://www.biominingbu.org/pronormz。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号