首页> 外文会议>2010 7th International Symposium on Chinese Spoken Language Processing >Sentence Decomplexification using holistic aspect-based clause detection for long sentence understanding
【24h】

Sentence Decomplexification using holistic aspect-based clause detection for long sentence understanding

机译:使用基于整体方面的子句检测的句子去复杂化,以了解长句

获取原文

摘要

Long sentences have posed significant challenges for many natural language processing (NLP) tasks such as machine translation and language understanding, because it is still very difficult for the state-of-the-art parsers to analyze them. In this paper, we identify the Sentence Decomplexification (SD) problem and propose models for SD to help understand long sentences. Given a complex sentence, SD seeks to return two sentences, one main clause and the other subordinate clause. These two clauses together include all the information of the original sentence. Since identifying subordinate clauses is a more difficult task than traditional chunking, we also propose a holistic aspect-based detection (HAD) method for clause detection to reduce the overhead required for SD sentence similarity computation. We provide the formalisms of SD and show that HAD can be used for efficiency purposes to this task. The SD system was used to improve the performance of a long sentence understanding system. Experimental results show that the task of SD achieves 78.7% accuracy using Chinese Gigaword Corpus as sentence comparison corpus. For the performance of long sentence understanding, the proposed method reports an improvement of accuracy from 70.7% to 75.5% as compared to that without using SD.
机译:长句子对许多自然语言处理(NLP)任务(例如机器翻译和语言理解)提出了重大挑战,因为最新的解析器仍然很难对其进行分析。在本文中,我们确定了句子去复杂化(SD)问题,并提出了SD模型以帮助理解长句子。给定一个复杂的句子,SD试图返回两个句子,一个主句和另一个从句。这两个子句一起包含原始句子的所有信息。由于识别从属子句比传统的分块更加困难,因此,我们还提出了一种用于子句检测的基于方面的整体检测(HAD)方法,以减少SD句子相似度计算所需的开销。我们提供了SD的形式,并表明HAD可以用于此任务的效率目的。 SD系统用于改善长句子理解系统的性能。实验结果表明,以汉字Gigaword语料库为句子比较语料库,SD任务的准确率达到了78.7%。对于长句理解的性能,与不使用SD的方法相比,所提出的方法报告的准确性从70.7%提高到75.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号