首页> 外文期刊>Journal of Biomedical Semantics >Multiple kernels learning-based biological entity relationship extraction method
【24h】

Multiple kernels learning-based biological entity relationship extraction method

机译:基于多核学习的生物实体关系提取方法

获取原文
           

摘要

BackgroundAutomatic extracting protein entity interaction information from biomedical literature can help to build protein relation network and design new drugs. There are more than 20 million literature abstracts included in MEDLINE, which is the most authoritative textual database in the field of biomedicine, and follow an exponential growth over time. This frantic expansion of the biomedical literature can often be difficult to absorb or manually analyze. Thus efficient and automated search engines are necessary to efficiently explore the biomedical literature using text mining techniques. ResultsThe P, R, and F value of tag graph method in Aimed corpus are 50.82, 69.76, and 58.61%, respectively. The P, R, and F value of tag graph kernel method in other four evaluation corpuses are 2–5% higher than that of all-paths graph kernel. And The P, R and F value of feature kernel and tag graph kernel fuse methods is 53.43, 71.62 and 61.30%, respectively. The P, R and F value of feature kernel and tag graph kernel fuse methods is 55.47, 70.29 and 60.37%, respectively. It indicated that the performance of the two kinds of kernel fusion methods is better than that of simple kernel. ConclusionIn comparison with the all-paths graph kernel method, the tag graph kernel method is superior in terms of overall performance. Experiments show that the performance of the multi-kernels method is better than that of the three separate single-kernel method and the dual-mutually fused kernel method used hereof in five corpus sets.
机译:背景技术从生物医学文献中自动提取蛋白质实体相互作用信息可以帮助建立蛋白质关系网络和设计新药。 MEDLINE是生物医学领域最权威的文本数据库,MEDLINE中包含超过2000万篇文献摘要,并且随着时间的推移呈指数级增长。生物医学文献的这种疯狂扩张通常难以吸收或手动分析。因此,高效和自动化的搜索引擎对于使用文本挖掘技术有效地探索生物医学文献是必不可少的。结果目标语料库中标记图法的P,R和F值分别为50.82%,69.76%和58.61%。在其他四个评估语料库中,标记图核方法的P,R和F值比全路径图核的P,R和F值高2–5%。特征核和标记图核融合方法的P,R和F值分别为53.43%,71.62%和61.30%。特征核和标记图核融合方法的P,R和F值分别为55.47、70.29和60.37%。结果表明,两种核融合方法的性能均优于简单核。结论与全路径图核方法相比,标记图核方法在整体性能上具有优势。实验表明,在五个语料集中,多核方法的性能优于本文所用的三个单独的单核方法和双核融合方法的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号