首页> 外文会议>Joint international conference on semantic technology >Leveraging Chinese Encyclopedia for Weakly Supervised Relation Extraction
【24h】

Leveraging Chinese Encyclopedia for Weakly Supervised Relation Extraction

机译:利用中文百科全书进行弱监督关系抽取

获取原文

摘要

In the research of named-entity relation extraction based on supervision, selecting relation features for traditional methods are usually finished by people, and it's hard to implement these methods for large-scale corpus. On the other hand, fixing relation types is the premise, so the practicabilities of these methods are not so ideal. This paper presents a weakly supervised method for Chinese named-entity relation extraction without man-made annotations, and the relation types in this method are not chosen artificially. The method collects entity relation types from the structured knowledge in encyclopedia pages, and then automatically annotates the relation instances existing in the texts based on these relation types. Simultaneously, the syntactic and semantic features of entity relations will be considered in this method, then the machine learning data will be completed, finally we use Support Vector Machine (SVM) model to train relation classifiers from training data, and these classifiers could try to extract entity relations from testing data. We carry out the experiment with the data from Chinese Baidu Encyclopedia pages, and the results show the effectiveness of this method, the overall Fl value reaches to 83.12 %. In order to probe the universality of this method, we also use the acquired relation classifiers to extract entity relations from news texts, and the results manifest that this method owns certain universality.
机译:在基于监督的命名实体关系提取的研究中,传统方法的关系特征的选择通常是由人们来完成的,而对于大型语料库则很难实施这些方法。另一方面,固定关系类型是前提,因此这些方法的实用性不是很理想。本文提出了一种在没有人为注释的情况下对中文命名实体关系进行提取的弱监督方法,该方法中的关系类型不是人为选择的。该方法从百科全书页中的结构化知识中收集实体关系类型,然后基于这些关系类型自动注释文本中存在的关系实例。同时,该方法将考虑实体关系的句法和语义特征,然后完成机器学习数据,最后我们使用支持向量机(SVM)模型从训练数据中训练关系分类器,这些分类器可以尝试从测试数据中提取实体关系。我们使用百度百科页面上的数据进行了实验,结果表明了该方法的有效性,总Fl值达到83.12%。为了探究该方法的普遍性,我们还使用获得的关系分类器从新闻文本中提取实体关系,结果表明该方法具有一定的普遍性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号