首页> 外文会议>International conference on computational linguistics >An Empirical Architecture for Verb Subcategorization Frame: a Lexicon for a Real-world Scale Japanese-English Interlingual MT
【24h】

An Empirical Architecture for Verb Subcategorization Frame: a Lexicon for a Real-world Scale Japanese-English Interlingual MT

机译:动词子类别框架的经验架构:一个用于真实世界规模的lexicon日语 - 英语interlingument mt

获取原文

摘要

The verb subcategorization frame information plays a major role of disambiguations in many NLP applications. Japanese, however, imposes difficulties of subcategorizing in part because it allows arbitrary ellipses of case elements. We propose a new type of verb subcategorization frame code set that combines the verb's surface case set and the deep case set, as a solution to the difficulties of empirical researches on Japanese. The lexicon developed by this design has comprehensive information on the correspondences between the surface case frame and the deep case frame, and yet restrains the potential combinatorial explosion of the number of verb subcategorization frames by carefully identifying superficially different frames with an idea of alternative case markers and semantic roles, and by introducing the notion of surface case frame permutations. The number of different surface/ deep case mapping types is 250, after we completed the new subcategorization frame code development for 30,000 verbs and adjectives.
机译:动词子类别帧信息在许多NLP应用中扮演歧义的主要作用。然而,日语部分难以部分地困难,因为它允许案例元素的任意椭圆形。我们提出了一种新型的动词子类别帧代码集,将动词的表面盒组和深案例集合,作为对日语实证研究困难的解决方案。由该设计开发的Lexicon具有关于表面壳体帧和深壳体帧之间的对应关系的全面信息,并且通过仔细地识别具有替代案例标记的想法,通过仔细地识别出色不同的帧来抑制动词子类别帧数的潜在组合爆炸和语义角色,并通过引入表面壳体帧置换的概念。在完成30,000个动词和形容词的新子类别帧代码开发之后,不同表面/深案例映射类型的数量为250。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号