首页> 外文会议>International Symposium on Chinese Spoken Language Processing >Sentence Decomplexification using holistic aspect-based clause detection for long sentence understanding
【24h】

Sentence Decomplexification using holistic aspect-based clause detection for long sentence understanding

机译:句子解复用使用整体基于宽度的子句检测来缩短句子理解

获取原文

摘要

Long sentences have posed significant challenges for many natural language processing (NLP) tasks such as machine translation and language understanding, because it is still very difficult for the state-of-the-art parsers to analyze them. In this paper, we identify the Sentence Decomplexification (SD) problem and propose models for SD to help understand long sentences. Given a complex sentence, SD seeks to return two sentences, one main clause and the other subordinate clause. These two clauses together include all the information of the original sentence. Since identifying subordinate clauses is a more difficult task than traditional chunking, we also propose a holistic aspect-based detection (HAD) method for clause detection to reduce the overhead required for SD sentence similarity computation. We provide the formalisms of SD and show that HAD can be used for efficiency purposes to this task. The SD system was used to improve the performance of a long sentence understanding system. Experimental results show that the task of SD achieves 78.7% accuracy using Chinese Gigaword Corpus as sentence comparison corpus. For the performance of long sentence understanding, the proposed method reports an improvement of accuracy from 70.7% to 75.5% as compared to that without using SD.
机译:长期句子对许多自然语言处理(NLP)任务(如机器翻译和语言理解)构成了重大挑战,因为最先进的解析器仍然非常困难分析它们。在本文中,我们识别句子解用化(SD)问题并提出SD的模型,以帮助了解长句。鉴于一个复杂的句子,SD旨在返回两个句子,一个主子句和其他从属条款。这两个条款一起包括原始句子的所有信息。由于识别从属条款是一种比传统的块更困难的任务,因此我们还提出了一种基于宽方面的检测(have)条款检测方法,以减少SD句子相似性计算所需的开销。我们提供SD的形式主义和表演,可以用于此任务的效率目的。 SD系统用于提高漫长句子理解系统的性能。实验结果表明,SD的任务达到了78.7%的准确性,使用中国吉拉夫罗德语料库作为句子比较语料库。对于长句子的表现,与不使用SD的情况相比,该方法报告了从70.7%的准确性提高到75.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号