首页> 外文会议>2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference >Research on keyword extraction of Tibetan web news based on improved TEXT-RANK algorithm
【24h】

Research on keyword extraction of Tibetan web news based on improved TEXT-RANK algorithm

机译:基于改进的TEXT-RANK算法的藏文网络新闻关键词提取研究

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes an improved TEXT-RANK algorithm and applies it to keyword extraction in Tibetan web news. In this paper, we first apply the graph model algorithm to Tibetan keyword extraction, and improve the TEXT-RANK algorithm. In text preprocessing, according to the characteristics of new words and proper nouns in network news texts, the named entity recognition method is integrated by CRF and MaxEnt. In this paper, the voting mechanism of TEXT-RANK algorithm is improved by studying the grammatical features of Tibetan and the writing rules of Tibetan network news. Experiments show that the improved TEXT-RANK algorithm can effectively improve the accuracy of keyword extraction.
机译:提出了一种改进的TEXT-RANK算法,并将其应用于藏文网络新闻中的关键词提取。在本文中,我们首先将图模型算法应用于藏语关键词提取中,并对TEXT-RANK算法进行了改进。在文本预处理中,根据网络新闻文本中新词和专有名词的特征,通过CRF和MaxEnt集成命名实体识别方法。通过研究藏文的语法特征和藏文网络新闻的写作规则,改进了TEXT-RANK算法的投票机制。实验表明,改进的TEXT-RANK算法可以有效提高关键词提取的准确性。

著录项

  • 来源
  • 会议地点 Chengdu(CN)
  • 作者单位

    Gansu Key Laboratory of Intelligent Processing of Ethnic Languages, Northwest University for Nationalities, Lanzhou, Gansu 730000, China;

    Gansu Key Laboratory of Intelligent Processing of Ethnic Languages, Northwest University for Nationalities, Lanzhou, Gansu 730000, China;

    Gansu Key Laboratory of Intelligent Processing of Ethnic Languages, Northwest University for Nationalities, Lanzhou, Gansu 730000, China;

    Gansu Key Laboratory of Intelligent Processing of Ethnic Languages, Northwest University for Nationalities, Lanzhou, Gansu 730000, China;

    Gansu Key Laboratory of Intelligent Processing of Ethnic Languages, Northwest University for Nationalities, Lanzhou, Gansu 730000, China;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    graph theory; information retrieval; natural language processing; text analysis;

    机译:图论;信息检索;自然语言处理;文本分析;;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号