首页> 外文会议>22nd International Conference on Computational Linguistics >Re-estimation of Lexical Parameters for Treebank PCFGs
【24h】

Re-estimation of Lexical Parameters for Treebank PCFGs

机译:树库PCFG的词法参数的重新估计

获取原文
获取原文并翻译 | 示例

摘要

We present procedures which pool lexical information estimated from unlabeled data via the Inside-Outside algorithm, with lexical information from a treebank PCFG. The procedures produce substantial improvements (up to 31.6% error reduction) on the task of determining subcategoriza-tion frames of novel verbs, relative to a smoothed Penn Treebank-trained PCFG. Even with relatively small quantities of unlabeled training data, the re-estimated models show promising improvements in labeled bracketing f-scores on Wall Street Journal parsing, and substantial benefit in acquiring the subcategorization preferences of low-frequency verbs.
机译:我们介绍了通过内部-外部算法将未标记数据估计的词汇信息与树库PCFG的词汇信息合并在一起的过程。相对于平滑的Penn树库训练的PCFG,该程序在确定新颖动词的子类别框架的任务上产生了显着的改进(最多减少了31.6%的错误)。即使使用相对较少量的未标记训练数据,重新估计的模型仍显示出在《华尔街日报》解析中标记括号f分数的有希望的改进,并且在获取低频动词的子类别首选项方面具有显着优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号