...
首页> 外文期刊>Methods: A Companion to Methods in Enzymology >An approach to improve kernel-based Protein-Protein Interaction extraction by learning from large-scale network data
【24h】

An approach to improve kernel-based Protein-Protein Interaction extraction by learning from large-scale network data

机译:从大规模网络数据学习改善基于核蛋白质 - 蛋白质相互作用提取的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Protein-Protein Interaction extraction (PPIe) from biomedical literatures is an important task in biomedi-cal text mining and has achieved desirable results on the annotated datasets. However, the traditional machine learning methods on PPIe suffer badly from vocabulary gap and data sparseness, which weakens classification performance. In this work, an approach capturing external information from the web-based data is introduced to address these problems and boost the existing methods. The approach involves three kinds of word representation techniques: distributed representation, vector clustering and Brown clusters. Experimental results show that our method outperforms the state-of-the-art methods on five publicly available corpora. Our code and data are available at: http://chaoslog.com/improving-kernel-based-protein-protein-interaction-extraction-by-unsupervised-word-representation-codes-and-data.html.
机译:来自生物医学文献的蛋白质 - 蛋白质相互作用提取(PPIE)是BioMeDi-Cal文本挖掘中的重要任务,并在注释的数据集中实现了所需的结果。 然而,PPIE上的传统机器学习方法与词汇差距和数据稀疏性遭受严重影响,这削弱了分类性能。 在这项工作中,引入了一种从基于Web的数据捕获外部信息的方法来解决这些问题并提高现有方法。 该方法涉及三种单词表示技术:分布式表示,矢量聚类和棕色集群。 实验结果表明,我们的方法优于五大公共集团的最先进的方法。 我们的代码和数据可用于:http://chaoslog.com/improving-kernel-based-protein-pritein-interaction-extration-by-unsupervised-word -representation-codes-and-data.html。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号