首页> 外文会议>International Workshop on Implementing Automata >An Extendible Regular Expression Compiler for Finite-State Approaches in Natural Language Processing
【24h】

An Extendible Regular Expression Compiler for Finite-State Approaches in Natural Language Processing

机译:一种用于自然语言处理的有限状态方法的可扩展常规表达式编译器

获取原文

摘要

Finite-state techniques are widely used in various areas of Natural Language Processing (NLP). As Kaplan and Kay [12] have argued, regular expressions are the appropriate level of abstraction for thinking about finite-state languages and finite-state relations. More complex finite-state operations (such as contexted replacement) are defined on the basis of basic operations (such as Kleene closure, complementation, composition). In order to be able to experiment with such complex finite-state operations the FSA Utilities (version 5) provides an extendible regular expression compiler. The paper discusses the regular expression operations provided by the compiler, and the possibilities to create new regular expression operators. The benefits of such an extendible regular expression compiler are illustrated with a number of examples taken from recent publications in the area of finite-state approaches to NLP.
机译:有限状态技术广泛用于自然语言处理(NLP)的各种领域。正如Kaplan和Kay [12]所说,正则表达式是思考有限州语言和有限状态关系的适当抽象水平。更复杂的有限状态操作(例如上下文更换)是基于基本操作(例如Kleene Close,Competuction,Constains)的基础定义。为了能够尝试如此复杂的有限状态操作,FSA实用程序(版本5)提供了一个可扩展的正则表达式编译器。本文讨论了编译器提供的正则表达式操作,以及创建新的正则表达式运算符的可能性。这种可扩展的正则表达式编译器的好处被说明,其中一些示例来自最近在NLP的有限状态接近区域中的出版物。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号