首页> 外文会议>International Conference on Intelligent Human-Machine Systems and Cybernetics >The Mixture of Textrank and Lexrank Techniques of Single Document Automatic Summarization Research in Tibetan
【24h】

The Mixture of Textrank and Lexrank Techniques of Single Document Automatic Summarization Research in Tibetan

机译:藏文单文档自动摘要研究的Textrank和Lexrank技术的混合

获取原文

摘要

Today is an era of knowledge economy and information dominated. Automatic summarization is an important research in the field of natural language processing, its purpose to explore human obtain valuable information from natural language texts. As the Tibetan information processing technology is backward, and the achievements of automatic summarization have not been publicly reported in Tibetan. This paper references the existed Chinese and English automatic summarization technology in domestic and foreign, and proposes a method of Tibetan automatic summarization. Combination with the advantage of keyword processing based on TextRank and processing of the relationship between sentences based on LexRank algorithm. Take full account of the frequency, part of speech, word position, word length, content and position of a sentence. In particular, the generated summarization considering the similarity of candidate sentences. Experiments analysis three summarization methods based on TextRank, based on LexRank and based on LexRank+TextRank respectively, and using the ROUGE value to evaluate the effect of summarization. Experimental results show that, the effect of the mixture of TextRank and LexRank techniques of single document automatic summarization in Tibetan is better and accuracy reached 80%.
机译:今天是知识经济和信息主导的时代。自动摘要是自然语言处理领域中的一项重要研究,其目的是探索人类从自然语言文本中获取有价值的信息。由于藏族信息处理技术落后,藏族尚未公开报道自动摘要的成果。本文借鉴国内外现有的中英文自动摘要技术,提出了藏文自动摘要的方法。结合了基于TextRank的关键字处理和基于LexRank算法的句子之间关系处理的优势。充分考虑频率,词性,单词位置,单词长度,句子的内容和位置。特别地,考虑候选句子的相似性而生成的摘要。实验分别分析了三种基于TextRank的摘要方法,基于LexRank的摘要方法和基于LexRank + TextRank的摘要方法,并使用ROUGE值评估了摘要效果。实验结果表明,TextRank和LexRank混合使用的单文档自动摘要在藏文中效果更好,准确率达到80%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号