首页> 外文会议>9th International conference on language resources and evaluation >A Framework for Compiling High Quality Knowledge Resources From Raw Corpora
【24h】

A Framework for Compiling High Quality Knowledge Resources From Raw Corpora

机译:从Raw Corpora编译高质量知识资源的框架

获取原文

摘要

The identification of various types of relations is a necessary step to allow computers to understand natural language text. In particular, the clarification of relations between predicates and their arguments is essential because predicate-argument structures convey most of the information in natural languages. To precisely capture these relations, wide-coverage knowledge resources are indispensable. Such knowledge resources can be derived from automatic parses of raw corpora, but unfortunately parsing still has not achieved a high enough performance for precise knowledge acquisition. We present a framework for compiling high quality knowledge resources from raw corpora. Our proposed framework selects high quality dependency relations from automatic parses and makes use of them for not only the calculation of fundamental distributional similarity but also the acquisition of knowledge such as case frames.
机译:识别各种类型的关系是允许计算机理解自然语言文本的必要步骤。特别是,阐明谓词及其自变量之间的关系非常重要,因为谓词自变量结构以自然语言传达了大多数信息。为了准确把握这些关系,必不可少的知识资源是必不可少的。这样的知识资源可以从原始语料库的自动解析中获取,但是不幸的是,解析仍然无法获得足够高的性能来进行精确的知识获取。我们提供了一个从原始语料库中汇编高质量知识资源的框架。我们提出的框架从自动解析中选择高质量的依赖关系,并将其不仅用于基本分布相似性的计算,而且还用于诸如案例框架之类的知识的获取。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号