...
首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >Multiple relations extraction among multiple entities in unstructured text
【24h】

Multiple relations extraction among multiple entities in unstructured text

机译:非结构化文本中多个实体中的多种关系提取

获取原文
获取原文并翻译 | 示例

摘要

Relations extraction is a widely researched topic in nature language processing. However, most of the work in the literature concentrate on the methods that are dealing with single relation between two named entities. In the task of multiple relations extraction, traditional statistic-based methods have difficulties in selecting features and improving the performance of extraction model. In this paper, we presented formal definitions of multiple entities and multiple relations and put forward three labeling methods which were used to label entity categories, relation categories and relation conditions. We also proposed a novel relation extraction model which is based on dynamic long short-term memory network. To train our model, entity feature, entity position feature and part of speech feature are used together. These features are used to describe complex relations and improve the performance of relation extraction model. In the experiments, we classified the corpus into three sets which are composed of 0-20 words, 20-35 words and 35+ words sentences. On conll04.corp, the final precision, recall rate and F-measure reached 72.9, 70.8 and 67.9% respectively.
机译:关系提取是自然语言处理中的广泛研究的主题。然而,文献中的大部分工作都集中在处理两个命名实体之间的单一关系的方法上。在多种关系提取的任务中,传统的基于统计方法在选择特征和提高提取模型的性能方面具有困难。在本文中,我们提出了多个实体和多个关系的正式定义,并提出了三种标签方法,用于标记实体类别,关系类别和关系条件。我们还提出了一种基于动态长短期存储网络的新型关系提取模型。要培训我们的模型,实体功能,实体位置特征和一部分语音功能。这些特征用于描述复杂的关系,提高关系提取模型的性能。在实验中,我们将语料库分为三组,由0-20字组成,20-35个单词和35多个单词句子。在Conll04.Corp上,最终精确度,召回率和F措施分别达到72.9,70.8和67.9%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号