首页> 外文会议>Annual meeting of the Association for Computational Linguistics;ACL 2011 >Extracting Paraphrases from Definition Sentences on the Web
【24h】

Extracting Paraphrases from Definition Sentences on the Web

机译:从Web上的定义句中提取复述

获取原文

摘要

We propose an automatic method of extracting paraphrases from definition sentences, which are also automatically acquired from the Web. We observe that a huge number of concepts are defined in Web documents, and that the sentences that define the same concept tend to convey mostly the same information using different expressions and thus contain many paraphrases. We show that a large number of paraphrases can be automatically extracted with high precision by regarding the sentences that define the same concept as parallel corpora. Experimental results indicated that with our method it was possible to extract about 300,000 paraphrases from 6 x 108 Web documents with a precision rate of about 94%.
机译:我们提出了一种从定义语句中提取复述的自动方法,这些定义语句也是从Web上自动获取的。我们注意到,Web文档中定义了许多概念,并且定义相同概念的句子倾向于使用不同的表达式传达大多数相同的信息,因此包含许多释义。我们表明,通过考虑定义与并行语料库相同的概念的句子,可以以高精度自动提取大量复述。实验结果表明,使用我们的方法可以从6 x 108个Web文档中提取大约300,000个复述,准确率约为94%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号