首页> 外文会议>International Conference on Information Systems Design and Intelligence Applications >Keyword Extraction from Hindi Documents Using Document Statistics and Fuzzy Modelling
【24h】

Keyword Extraction from Hindi Documents Using Document Statistics and Fuzzy Modelling

机译:使用文档统计和模糊建模的印度文档从印地文文件提取

获取原文

摘要

In this paper, we put forward a novel unsupervised, domain independent and corpus independent approach for automatic keyword extraction. Our approach combines the document statistics of frequency and spatial distribution of a word in order to extract the keywords. We have extracted keywords from Hindi documents using document statistics and utilized the power of fuzzy logic to combine those document statistics effectively for better results. Further, we use this information to frame fuzzy rules for keyword extraction. Main advantages of our approach are that it uses the fuzzy membership for the variables instead of dealing with crisp thresholds and corpus independent setting of fuzzy membership boundaries. Our work is especially significant in the light that it has been implemented and tested on Hindi which is a resource poor and underrepresented language.
机译:在本文中,我们提出了一种新颖的无监督,域独立和语料库的自动关键词提取。我们的方法结合了一个单词的频率和空间分布的文档统计信息,以便提取关键字。我们使用文档统计信息从Hindi文档中提取关键字,并利用模糊逻辑的力量,以便有效地将这些文档统计信息组合起来。此外,我们使用此信息来帧为关键字提取的模糊规则。我们的方法的主要优点是它使用了变量模糊成员资格而不是处理模糊会员边界的清晰阈值和语料库。我们的作品在光线下,它的实施和测试是在印地文实施和测试的,这是一种资源差和代表性不足的语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号