...
首页> 外文期刊>BMC Bioinformatics >Combining active learning and semi-supervised learning techniques to extract protein interaction sentences
【24h】

Combining active learning and semi-supervised learning techniques to extract protein interaction sentences

机译:结合主动学习和半监督学习技术提取蛋白质互动句

获取原文
           

摘要

BackgroundProtein-protein interaction (PPI) extraction has been a focal point of many biomedical research and database curation tools. Both Active Learning and Semi-supervised SVMs have recently been applied to extract PPI automatically. In this paper, we explore combining the AL with the SSL to improve the performance of the PPI task.MethodsWe propose a novel PPI extraction technique called PPISpotter by combining Deterministic Annealing-based SSL and an AL technique to extract protein-protein interaction. In addition, we extract a comprehensive set of features from MEDLINE records by Natural Language Processing (NLP) techniques, which further improve the SVM classifiers. In our feature selection technique, syntactic, semantic, and lexical properties of text are incorporated into feature selection that boosts the system performance significantly.ResultsBy conducting experiments with three different PPI corpuses, we show that PPISpotter is superior to the other techniques incorporated into semi-supervised SVMs such as Random Sampling, Clustering, and Transductive SVMs by precision, recall, and F-measure.ConclusionsOur system is a novel, state-of-the-art technique for efficiently extracting protein-protein interaction pairs.
机译:背景蛋白 - 蛋白质相互作用(PPI)提取是许多生物医学研究和数据库策择工具的焦点。最近已应用了主动学习和半监督SVM,以自动提取PPI。在本文中,我们将Al与SSL组合以改善PPI任务的性能。通过组合基于确定性退火的SSL和AL技术提取蛋白质 - 蛋白质相互作用,提出一种新的PPI提取技术。此外,我们通过自然语言处理(NLP)技术从Medline记录中提取了一系列全面的功能,从而进一步改进了SVM分类器。在我们的特征选择技术中,文本的句法,语义和词汇属性被纳入特征选择,显着提高了系统性能。我们认为PPISPOTETER与结合到半的其他技术优于其他技术通过精确,召回和F测量,监督SVM等随机采样,聚类和转换SVMS,是一种新颖的技术,用于有效地提取蛋白质 - 蛋白质相互作用对。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号