首页> 外文会议>International Conference on Intelligent Computing, Communication and Devices >Keyword Extraction from Hindi Documents Using Statistical Approach
【24h】

Keyword Extraction from Hindi Documents Using Statistical Approach

机译:使用统计方法从印地语文件中提取关键词

获取原文

摘要

Keywords of a document give us an idea about its important points without going through the whole text. In this paper, we propose an unsupervised, domain-independent, and corpus-independent approach for automatic keyword extraction. The approach is general and can be applied to any language. However, we have tested the approach on Hindi language. Our approach combines the information contained in frequency and spatial distribution of a word in order to extract keywords from a document. Our work is specially significant in the light that it has been implemented and tested on Hindi which is a resource poor and underrepresented language.
机译:文件的关键字在不经过整个文本的情况下向我们了解其重要观点。在本文中,我们提出了一种无监督,域独立的和独立于语料库的自动关键字提取方法。该方法是一般的,可以应用于任何语言。但是,我们测试了印地语语言的方法。我们的方法将包含的信息与单词的空间分布中包含的信息相结合,以便从文档中提取关键字。我们的工作是特别重要的,即它在印地语上实施和测试了,这是一种资源贫困和代表性不足的语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号