首页> 外文会议>Natural language processing and information systems >Analysis and Improvement of Minimally Supervised Machine Learning for Relation Extraction
【24h】

Analysis and Improvement of Minimally Supervised Machine Learning for Relation Extraction

机译:用于关系提取的最小监督机器学习的分析和改进

获取原文
获取原文并翻译 | 示例

摘要

The main contribution of this paper is a systematic analysis of a minimally supervised machine learning method for relation extraction grammars. The method is based on a bootstrapping approach in which the bootstrapping is triggered by semantic seeds. The starting point of our analysis is the pattern-learning graph which is a subgraph of the bipartite graph representing all connections between linguistic patterns and relation instances exhibited by the data. It is shown that the performance of such general learning framework for actual tasks is dependent on certain properties of the data and on the selection of seeds. Several experiments have been conducted to gain explanatory insights into the interaction of these two factors. Prom the investigation of more effective seeds and benevolent data we understand how to improve the learning in less fortunate configurations. A relation extraction method only based on positive examples cannot avoid all false positives, especially when the data properties yield a high recall. Therefore, negative seeds are employed to learn negative patterns, which boost precision.
机译:本文的主要贡献是对关系抽取语法的最小监督机器学习方法进行了系统分析。该方法基于一种引导方法,其中引导是由语义种子触发的。我们分析的起点是模式学习图,它是二部图的子图,代表了语言模式与数据所展示的关系实例之间的所有联系。结果表明,这种针对实际任务的通用学习框架的性能取决于数据的某些属性以及种子的选择。已经进行了一些实验来获得对这两个因素相互作用的解释性见解。舞会调查更有效的种子和慈善数据,我们了解如何在不太幸运的配置中改善学习。仅基于肯定示例的关系提取方法无法避免所有误报,尤其是在数据属性产生较高召回率时。因此,可以使用负种子来学习负模式,从而提高精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号