首页> 外文期刊>ACM transactions on Asian language information processing >Using Short Dependency Relations from Auto-Parsed Data for Chinese Dependency Parsing
【24h】

Using Short Dependency Relations from Auto-Parsed Data for Chinese Dependency Parsing

机译:使用自动解析数据中的短依赖性关系进行中文依赖性分析

获取原文
获取原文并翻译 | 示例
       

摘要

Dependency parsing has become increasingly popular for a surge of interest lately for applications such as machine translation and question answering. Currently, several supervised learning methods can be used for training high-performance dependency parsers if sufficient labeled data are available.rnHowever, currently used statistical dependency parsers provide poor results for words separated by long distances. In order to solve this problem, this article presents an effective dependency parsing approach of incorporating short dependency information from unlabeled data. The unlabeled data is automatically parsed by using a deterministic dependency parser, which exhibits a relatively high performance for short dependencies between words. We then train another parser that uses the information on short dependency relations extracted from the output of the first parser. The proposed approach achieves an unlabeled attachment score of 86.52%, an absolute 1.24% improvement over the baseline system on the Chinese Treebank data set. The results indicate that the proposed approach improves the parsing performance for longer distance words.
机译:近年来,对于诸如机器翻译和问题解答之类的应用程序,兴趣分析的兴起使依赖解析变得越来越流行。当前,如果有足够的标记数据可用,则可以使用多种监督学习方法来训练高性能依赖解析器。但是,当前使用的统计依赖解析器对于长距离分隔的单词提供的结果较差。为了解决此问题,本文提出了一种有效的依赖项解析方法,该方法将从未标记的数据中合并短的依赖项信息。未标记的数据通过使用确定性相关性解析器自动解析,该解析器对于单词之间的短相关性表现出较高的性能。然后,我们训练另一个解析器,该解析器使用有关从第一个解析器的输出中提取的短依赖关系的信息。所提出的方法获得了86.52%的无标签依恋分数,比中国树库数据集的基准系统绝对提高了1.24%。结果表明,所提出的方法提高了长距离单词的解析性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号