首页> 美国卫生研究院文献>Database: The Journal of Biological Databases and Curation >iSimp in BioC standard format: enhancing the interoperability of a sentence simplification system
【2h】

iSimp in BioC standard format: enhancing the interoperability of a sentence simplification system

机译:iSimp采用BioC标准格式:增强句子简化系统的互操作性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This article reports the use of the BioC standard format in our sentence simplification system, iSimp, and demonstrates its general utility. iSimp is designed to simplify complex sentences commonly found in the biomedical text, and has been shown to improve existing text mining applications that rely on the analysis of sentence structures. By adopting the BioC format, we aim to make iSimp readily interoperable with other applications in the biomedical domain. To examine the utility of iSimp in BioC, we implemented a rule-based relation extraction system that uses iSimp as a preprocessing module and BioC for data exchange. Evaluation on the training corpus of BioNLP-ST 2011 GENIA Event Extraction (GE) task showed that iSimp sentence simplification improved the recall by 3.2% without reducing precision. The iSimp simplification-annotated corpora, both our previously used corpus and the GE corpus in the current study, have been converted into the BioC format and made publicly available at the project’s Web site: .>Database URL:
机译:本文报告了我们的句子简化系统iSimp中BioC标准格式的使用,并演示了其一般用途。 iSimp旨在简化生物医学文本中常见的复杂句子,并且已被证明可以改善依赖于句子结构分析的现有文本挖掘应用程序。通过采用BioC格式,我们旨在使iSimp易于与生物医学领域的其他应用程序互操作。为了检查iSimp在BioC中的效用,我们实现了一个基于规则的关系提取系统,该系统使用iSimp作为预处理模块并使用BioC进行数据交换。对BioNLP-ST 2011 GENIA事件提取(GE)任务的训练语料库的评估表明,iSimp句子的简化可将召回率提高3.2%,而不会降低准确性。 iSimp简化注释语料库(我们以前使用的语料库和当前研究中的GE语料库)已转换为BioC格式,并可以在项目的网站上公开使用。>数据库URL:

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号