首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Acquisition of linguistic patterns for knowledge-based information extraction
【24h】

Acquisition of linguistic patterns for knowledge-based information extraction

机译:获取基于知识的信息提取的语言模式

获取原文
获取原文并翻译 | 示例
           

摘要

The paper presents an automatic acquisition of linguistic patterns that can be used for knowledge based information extraction from texts. In knowledge based information extraction, linguistic patterns play a central role in the recognition and classification of input texts. Although the knowledge based approach has been proved effective for information extraction on limited domains, there are difficulties in construction of a large number of domain specific linguistic patterns. Manual creation of patterns is time consuming and error prone, even for a small application domain. To solve the scalability and the portability problem, an automatic acquisition of patterns must be provided. We present the PALKA (Parallel Automatic Linguistic Knowledge Acquisition) system that acquires linguistic patterns from a set of domain specific training texts and their desired outputs. A specialized representation of patterns called FP structures has been defined. Patterns are constructed in the form of FP structures from training texts, and the acquired patterns are tuned further through the generalization of semantic constraints. Inductive learning mechanism is applied in the generalization step. The PALKA system has been used to generate patterns for our information extraction system developed for the fourth Message Understanding Conference (MUC-4).
机译:本文提出了一种语言模式的自动获取,该模式可用于从文本中提取基于知识的信息。在基于知识的信息提取中,语言模式在输入文本的识别和分类中起着核心作用。尽管已经证明了基于知识的方法对于有限域上的信息提取是有效的,但是在构造大量特定于域的语言模式方面仍然存在困难。手动创建模式非常耗时且容易出错,即使对于小型应用程序域也是如此。为了解决可伸缩性和可移植性问题,必须提供模式的自动获取。我们介绍了PALKA(并行自动语言知识获取)系统,该系统从一组领域特定的培训文本及其期望的输出中获取语言模式。已经定义了称为FP结构的模式的专用表示形式。从训练文本中以FP结构的形式构造模式,并通过语义约束的泛化来进一步调整获取的模式。在归纳步骤中应用归纳学习机制。 PALKA系统已用于为第四次消息理解会议(MUC-4)开发的信息提取系统生成模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号