首页> 外文会议>International joint conference on natural language processing;Conference on empirical methods in natural language processing >Multi-Input Multi-Output Sequence Labeling for Joint Extraction of Fact and Condition Tuples from Scientific Text
【24h】

Multi-Input Multi-Output Sequence Labeling for Joint Extraction of Fact and Condition Tuples from Scientific Text

机译:从科学文本联合提取事实和条件元组的多输入多输出序列标签

获取原文

摘要

Condition is essential in scientific statement. Without the conditions (e.g., equipment, environment) that were precisely specified, facts (e.g., observations) in the statements may no longer be valid. Existing ScienceIE methods, which aim at extracting factual tuples from scientific text, do not consider the conditions. In this work, we propose a new sequence labeling framework (as well as a new tag schema) to jointly extract the fact and condition tuples from statement sentences. The framework has (1) a multi-output module to generate one or multiple tuples and (2) a multi-input module to feed in multiple types of signals as sequences. It improves F1 score relatively by 4.2% on BioNLP2013 and by 6.2% on a new bio-text dataset for tuple extraction.
机译:条件在科学陈述中至关重要。如果没有精确指定的条件(例如设备,环境),声明中的事实(例如观察)可能不再有效。旨在从科学文本中提取事实元组的现有ScienceIE方法没有考虑条件。在这项工作中,我们提出了一个新的序列标记框架(以及新的标记模式),以从语句语句中共同提取事实和条件元组。该框架具有(1)产生一个或多个元组的多输出模块,以及(2)产生多种信号作为序列的多输入模块。在BioNLP2013上,F1分数相对提高了4.2%,在用于元组提取的新生物文本数据集上,其F1分数相对提高了6.2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号