首页> 外文期刊>Expert systems with applications >Uncertainty sampling-based active learning for protein-protein interaction extraction from biomedical literature
【24h】

Uncertainty sampling-based active learning for protein-protein interaction extraction from biomedical literature

机译:基于不确定性采样的主动学习,用于从生物医学文献中提取蛋白质-蛋白质相互作用

获取原文
获取原文并翻译 | 示例

摘要

Protein-protein interaction (PPI) extraction from biomedical literature has become a research focus with the rapid growth of the number of biomedical literature. Many methods have been proposed for PPI extraction including natural language processing techniques and machine learning approaches. One problem of applying machine learning approaches to PPI extraction is that large amounts of data are available but the cost of correctly labeling it prohibits its use. To reduce the amount of human labeling effort while maintaining the PPI extraction performance, the paper presents an uncertainty sampling-based method of active learning (USAL) in a lexical feature-based SVM model to tag the most informative unlabeled samples. In addition, some specific samples are ignored to speed up learning process while maintaining desired accuracy. The experiment results on AIMED and CB corpora show that our method can reduce the labeling by 40% and 20%, respectively, without degrading the performance.
机译:随着生物医学文献数量的快速增长,从生物医学文献中提取蛋白质-蛋白质相互作用(PPI)已成为研究重点。已经提出了许多用于PPI提取的方法,包括自然语言处理技术和机器学习方法。将机器学习方法应用于PPI提取的一个问题是大量数据可用,但是正确标记它的成本禁止使用它。为了减少人工标记的工作量,同时保持PPI提取性能,本文提出了一种基于词汇特征的SVM模型中基于不确定性采样的主动学习方法(USAL),以标记信息最多的未标记样本。另外,忽略了一些特定的样本以加快学习过程,同时保持所需的准确性。在AIMED和CB语料库上的实验结果表明,我们的方法可以分别将标记减少40%和20%,而不会降低性能。

著录项

  • 来源
    《Expert systems with applications》 |2009年第7期|10344-10350|共7页
  • 作者单位

    Department of Computer Science and Engineering, Dalian University of Technology, No. 2 LingGong Road, ShaHeKou District, Dalian 116023, China;

    Department of Computer Science and Engineering, Dalian University of Technology, No. 2 LingGong Road, ShaHeKou District, Dalian 116023, China;

    Department of Computer Science and Engineering, Dalian University of Technology, No. 2 LingGong Road, ShaHeKou District, Dalian 116023, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    active learning; uncertainty sampling; protein-protein interaction extraction;

    机译:主动学习;不确定性采样蛋白质-蛋白质相互作用提取;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号