首页> 外文会议>International Conference on Signal Processing and Integrated Networks >Keyword and keyphrase extraction from single Hindi document using statistical approach
【24h】

Keyword and keyphrase extraction from single Hindi document using statistical approach

机译:使用统计方法从单个印地语文档中提取关键字和关键词短语

获取原文

摘要

In this paper we propose an unsupervised, domain independent as well as corpus independent approach for automatic keyword extraction. In second part of the paper we have suggested an extension of the approach to extract keyphrases from the document. The approach is general and can be applied to any language. However, we have tested the approach on Hindi language. Our approach combines the information contained in frequency and spatial distribution of a word in order to extract keywords from a document. Our work is especially significant in the light that it has been implemented and tested on Hindi which is a resource poor and underrepresented language.
机译:在本文中,我们提出了一种无监督,领域独立以及语料库独立的自动关键字提取方法。在本文的第二部分中,我们建议了一种扩展方法,用于从文档中提取关键短语。该方法是通用的,可以应用于任何语言。但是,我们已经在印地语语言上测试了该方法。我们的方法结合了单词的频率和空间分布中包含的信息,以便从文档中提取关键字。鉴于已在印地语上实施和测试了印地语,这是一种资源匮乏且代表性不足的语言,因此我们的工作尤其重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号