【24h】

Towards Multidocument Summarization by Reformulation: Progress and Prospects

机译:通过重新编制实现多文档摘要:进展和前景

获取原文
获取原文并翻译 | 示例

摘要

By synthesizing information common to retrieveed documents, multi-document summarization can help users of information retrieval systems to find relevant documents with a minimal amount of reading. We are developing a multidocument summarization system to automatically generate a concise summary by identifying and synthesizing similarities across a set of related documents. Our approach is unique in its integration of machine learning and statistical techniques to identify similar paragraphs, intersection of similar phrases within paragraphs, and language generation to reformulate the wording of the summary. Our evaluation of system components shows that learning over multiple extracted linguistic features is more effective than information retrieval approaches at identifying similar text units for summarization and that it is possible to generate a fluent summary that conveys similarities among documents even when full semantic interpretations of the input text are not available.
机译:通过合成检索到的文档共有的信息,多文档摘要可以帮助信息检索系统的用户以最少的阅读量找到相关文档。我们正在开发一种多文档摘要系统,该系统通过识别和综合一组相关文档的相似性来自动生成简明摘要。我们的方法在整合机器学习和统计技术以识别相似的段落,相似的短语在段落中的相交以及语言生成以重新形成摘要的措辞方面具有独特性。我们对系统组件的评估表明,在识别相似的文本单元进行汇总时,对多个提取的语言特征进行学习比信息检索方法更有效,并且即使输入内容经过完整的语义解释,也可能生成流畅的摘要来传达文档之间的相似之处文本不可用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号