首页> 外文期刊>Brazilian Computer Society. Journal >RelHunter: a machine learning method for?relation extraction from text
【24h】

RelHunter: a machine learning method for?relation extraction from text

机译: RelHunter :一种用于从文本中提取关系的机器学习方法

获取原文
           

摘要

We propose RelHunter , a machine learning-based method for the extraction of structured information from text. RelHunter ’s key idea is to model the target structures as a relation over entities. Hence, the modeling effort is reduced to the identification of entities and the generation of a candidate relation, which are simpler problems than the original one. RelHunter fits a very broad spectrum of complex computational linguistic problems. We apply it to five tasks: phrase chunking, clause identification, hedge detection, quotation extraction, and dependency parsing. We compare RelHunter to token classification approaches through several computational experiments on seven multilingual corpora. RelHunter outperforms the token classification approaches by 2.14% on average. Moreover, we compare the derived systems against state-of-the-art systems for each corpus. Our systems achieve state-of-the-art performances for three corpora: Portuguese phrase chunking, Portuguese clause identification, and English quotation extraction. Additionally, the derived systems show good quality performance for the other four corpora.
机译:我们提出RelHunter,这是一种基于机器学习的方法,用于从文本中提取结构化信息。 RelHunter的主要思想是将目标结构建模为实体之间的关系。因此,建模工作减少了实体的标识和候选关系的生成,这是比原始问题简单的问题。 RelHunter适合广泛的复杂计算语言问题。我们将其应用于五个任务:短语组块,子句识别,对冲检测,引号提取和依赖项解析。通过对七个多语言语料库的几次计算实验,我们将RelHunter与令牌分类方法进行了比较。 RelHunter的性能平均比令牌分类方法高出2.14%。此外,我们将派生的系统与每个语料库的最新系统进行了比较。我们的系统针对三种语料库实现了最先进的性能:葡萄牙语短语组块,葡萄牙语子句识别和英语引号提取。此外,派生的系统对其他四个语料库显示出良好的质量性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号