【24h】

Resolving Chinese Zero Pronoun with Word Embedding

机译:用词嵌入解决中文零代词

获取原文

摘要

Elliptical sentences are frequently seen in Chinese, especially in some particular situations, such as dialogues, which is challengeable to understand specific semantic. Chinese zero pronoun resolution, which recovers a noun phrase in the elliptical position, is an effective method to help machines understand natural languages. Traditional methods use the features, which are extracted from syntactic parsing trees manually. However, the long running time and the inaccuracy of automatic parsing algorithms have a bad influence on practical applications. In this work, we propose a new method based on long-short-term memory network that calculates dense vector representations for mention pairs without using features from syntactic parsing trees. These representations, which capture significant semantics for zero pronoun resolution, are built on distributed representation of words in surrounding contexts and candidate antecedents. Our method contributes to reducing the manual work of extracting features from parsing tress, which improves the F1-score of Chinese zero pronoun resolution system. Experimental results on OnotoNotes 5.0 Chinese dataset show our method achieves better performance compared with the state-of-the-art method.
机译:椭圆形句子经常看到中文,特别是在某些特殊情况下,例如对话,这是理解特定语义的竞争力。中文零代词分辨率在椭圆位置恢复名词短语,是帮助机器理解自然语言的有效方法。传统方法使用功能从语法解析树中提取。然而,自动解析算法的长时间运行时间和不准确性对实际应用有不良影响。在这项工作中,我们提出了一种基于长短短期存储器网络的新方法,该方法计算密集的矢量表示,而不是使用句法解析树的功能。这些表示捕获零代词解析的重要语义,基于周围上下文和候选前一种的单词的分布式表示。我们的方法有助于减少解析发辫中提取功能的手工工作,从而提高了中国零代词解析系统的F1分数。 Onotonotes 5.0中文数据集的实验结果表明,与最先进的方法相比,我们的方法实现了更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号