首页> 外文会议>Annual Conference on Neural Information Processing Systems >Learning syntactic patterns for automatic hypernym discovery
【24h】

Learning syntactic patterns for automatic hypernym discovery

机译:学习自动超性发现的句法模式

获取原文

摘要

Semantic taxonomies such as WordNet provide a rich source of knowledge for natural language processing applications, but are expensive to build, maintain, and extend. Motivated by the problem of automatically constructing and extending such taxonomies, in this paper we present a new algorithm for automatically learning hypernym (is-a) relations from text. Our method generalizes earlier work that had relied on using small numbers of hand-crafted regular expression patterns to identify hypernym pairs. Using "dependency path" features extracted from parse trees, we introduce a general-purpose formalization and generalization of these patterns. Given a training set of text containing known hypernym pairs, our algorithm automatically extracts useful dependency paths and applies them to new corpora to identify novel pairs. On our evaluation task (determining whether two nouns in a news article participate in a hypernym relationship), our automatically extracted database of hypernyms attains both higher precision and higher recall than WordNet.
机译:Wordnet等语义分类管理为自然语言处理应用提供丰富的知识来源,但构建,维护和扩展昂贵。在本文中,通过自动构建和扩展此类分类问题的问题,我们提出了一种新的算法,用于自动学习文本的关系(IS-A)关系。我们的方法概括了依赖于使用少量手工制作的正则表达式模式来识别Hypernym对的早期工作。使用从解析树中提取的“依赖路径”功能,我们介绍了这些模式的通用形式化和泛化。给定培训包含已知的Hypernym对的文本,我们的算法自动提取有用的依赖路径,并将它们应用于新的语料库以识别新颖对。在我们的评估任务(确定新闻文章中的两个名词是否参加了一个高性关系),我们自动提取的Hypernym数据库比WordNet获得更高的精度和更高的召回。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号