首页> 外文会议>Pacific Asia Conference on Language, Information and Computation >Augmented Parsing of Unknown Word by Graph-based Semi-supervised Learning
【24h】

Augmented Parsing of Unknown Word by Graph-based Semi-supervised Learning

机译:通过基于图形的半监督学习来增强未知单词的解析

获取原文

摘要

This paper presents a novel method using graph-based semi-supervised learning (SSL) to improve the syntax parsing of unknown words. Different from conventional approaches that uses hand-crafted rules, rich morphological features, or a character-based model to handle unknown words, this method is based on a graph-based label propagation technique. It gives greater improvement on grammars trained on a smaller amount of labeled data and a large amount of unlabeled one. A transductiv graph-based SSL method is employed to propagate POS and derive the emission distributions from labeled data to unlabeled one. The derived distributions are incorporated into the parsing process. The proposed method effectively augments the original supervised parsing model by contributing 2.28% and 1.72% absolute improvement on the accuracy of POS tagging and syntax parsing for Penn Chinese Treebank respectively.
机译:本文介绍了一种新的方法,使用基于图形的半监督学习(SSL)来改进未知单词的语法解析。 与使用手工制作规则,丰富的形态特征或基于角色的模型处理未知单词的传统方法不同,这种方法基于基于图形的标签传播技术。 它对培训的语法更好地改善了较少数量的标记数据和大量未标记的语法。 基于转换图形的SSL方法用于传播POS并从标记数据从标记数据推导到未标记的SSL方法。 派生的分布被纳入解析过程中。 所提出的方法通过贡献2.28%和1.72%的绝对改善分别为POS ChineseBank的POS标记和语法解析的准确性提高了2.28%和1.72%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号