...
首页> 外文期刊>Procedia Computer Science >Attention based Abstractive Summarization of Malayalam Document
【24h】

Attention based Abstractive Summarization of Malayalam Document

机译:马拉雅拉姆文献的注意力摘要

获取原文

摘要

There are different textual content summarization processes available in natural Language Processing. Amongst them abstractive textual content summarization is one of the challenging problems in natural language processing and that too, with very little research done in regional languages. Unlike other summarization techniques, which reuses the words and phrases from the source text, abstractive text summarization builds a short and concise precis of a huge text document built from the underlying message of the text not necessarily using the same words and phrases from the source. The objective of the proposed work is to create a brief and understandable abstractive summary of a Malayalam document. Malayalam is one of the 22 scheduled languages of India spoken by over 34 million people and is designated as a Classical Language in India. Being a Classical language, Malayalam has a very unique syntactic and semantic rules which makes this work more important. The proposed work attempts to create an attention mechanism to generate the summary of the source document. In this work, the goal was to compare the efficiency of Attention model with sequence to sequence baseline model of Malayalam text and thereby implementing a better abstractive text summarizer for a malayalam document.
机译:有不同的文本内容摘要过程,自然语言处理可用。其中,抽象文本内容摘要是自然语言处理中的挑战性问题之一,而且在区域语言中的研究非常少。与其他摘要技术不同,该技术从源文本中重用单词和短语,抽象文本摘要构建了从文本的基础消息构建的庞大文本文档的简短主题,不一定使用来自源的相同单词和短语。拟议工作的目的是创建一个Malayalam文件的简短和理解的抽象概述。 Malayalam是超过3400万人所说的印度的22种计划之一,被指定为印度的古典语言。 Malayalam具有一种古典语言,具有一个非常独特的句法和语义规则,使这项工作更加重要。所提出的工作试图创建注意机制以生成源文档的摘要。在这项工作中,目标是将注意力模型的效率与序列进行比较到Malayalam文本的序列基线模型,从而实现了Malayalam文件的更好的抽象文本摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号