首页> 外文会议>Natural language understanding and intelligent applications >Resolving Chinese Zero Pronoun with Word Embedding
【24h】

Resolving Chinese Zero Pronoun with Word Embedding

机译:用词嵌入法解决汉语零代词

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Elliptical sentences are frequently seen in Chinese, especially in some particular situations, such as dialogues, which is challengeable to understand specific semantic. Chinese zero pronoun resolution, which recovers a noun phrase in the elliptical position, is an effective method to help machines understand natural languages. Traditional methods use the features, which are extracted from syntactic parsing trees manually. However, the long running time and the inaccuracy of automatic parsing algorithms have a bad influence on practical applications. In this work, we propose a new method based on long-short-term memory network that calculates dense vector representations for mention pairs without using features from syntactic parsing trees. These representations, which capture significant semantics for zero pronoun resolution, are built on distributed representation of words in surrounding contexts and candidate antecedents. Our method contributes to reducing the manual work of extracting features from parsing tress, which improves the F1-score of Chinese zero pronoun resolution system. Experimental results on OnotoNotes 5.0 Chinese dataset show our method achieves better performance compared with the state-of-the-art method.
机译:省略句经常在中文中出现,尤其是在某些特殊情况下,例如对话,这对于理解特定的语义是有挑战性的。汉语零代词解析可以在椭圆位置恢复名词短语,是帮助机器理解自然语言的有效方法。传统方法使用特征,这些特征是从语法分析树中手动提取的。但是,运行时间长和自动解析算法的不准确性对实际应用有不利影响。在这项工作中,我们提出了一种基于长短期记忆网络的新方法,该方法无需使用语法分析树中的特征即可计算提及对的密集矢量表示。这些表示捕获零代词解析的重要语义的表示形式是建立在周围上下文和候选先行词中的分布式表示形式之上的。我们的方法有助于减少解析树皮中提取特征的人工工作,从而提高了汉语零代词解析系统的F1分数。在OnotoNotes 5.0中文数据集上的实验结果表明,与最新方法相比,我们的方法具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号