首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Minimally Supervised Novel Relation Extraction Using a Latent Relational Mapping
【24h】

Minimally Supervised Novel Relation Extraction Using a Latent Relational Mapping

机译:使用潜在关系映射的最小监督新颖关系提取

获取原文
获取原文并翻译 | 示例
       

摘要

The World Wide Web includes semantic relations of numerous types that exist among different entities. Extracting the relations that exist between two entities is an important step in various Web-related tasks such as information retrieval (IR), information extraction, and social network extraction. A supervised relation extraction system that is trained to extract a particular relation type (source relation) might not accurately extract a new type of a relation (target relation) for which it has not been trained. However, it is costly to create training data manually for every new relation type that one might want to extract. We propose a method to adapt an existing relation extraction system to extract new relation types with minimum supervision. Our proposed method comprises two stages: learning a lower dimensional projection between different relations, and learning a relational classifier for the target relation type with instance sampling. First, to represent a semantic relation that exists between two entities, we extract lexical and syntactic patterns from contexts in which those two entities co-occur. Then, we construct a bipartite graph between relation-specific (RS) and relation-independent (RI) patterns. Spectral clustering is performed on the bipartite graph to compute a lower dimensional projection. Second, we train a classifier for the target relation type using a small number of labeled instances. To account for the lack of target relation training instances, we present a one-sided under sampling method. We evaluate the proposed method using a data set that contains 2,000 instances for 20 different relation types. Our experimental results show that the proposed method achieves a statistically significant macroaverage F-score of 62.77. Moreover, the proposed method outperforms numerous baselines and a previously proposed weakly supervised relation extraction method.
机译:万维网包含存在于不同实体之间的多种类型的语义关系。提取两个实体之间存在的关系是与Web相关的各种任务(例如信息检索(IR),信息提取和社交网络提取)中的重要步骤。受过训练以提取特定关系类型(源关系)的监督关系提取系统可能无法准确地提取尚未对其进行训练的新型关系(目标关系)。但是,为可能要提取的每种新关系类型手动创建培训数据的成本很高。我们提出一种方法,使现有的关系提取系统适用于在最少的监督下提取新的关系类型。我们提出的方法包括两个阶段:学习不同关系之间的低维投影,以及通过实例采样学习目标关系类型的关系分类器。首先,为了表示两个实体之间存在的语义关系,我们从这两个实体同时出现的上下文中提取词汇和句法模式。然后,我们在关系特定(RS)模式和关系独立(RI)模式之间构造了二部图。对二部图执行谱聚类以计算低维投影。其次,我们使用少量带标签的实例训练目标关系类型的分类器。为了解决缺乏目标关系训练实例的问题,我们提出一种单面抽样方法。我们使用包含20种不同关系类型的2,000个实例的数据集评估提出的方法。我们的实验结果表明,所提出的方法达到了统计上显着的62.77的宏平均F分数。此外,所提出的方法优于许多基线和先前提出的弱监督关系提取方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号