Keyword Extraction from Hindi Documents Using Document Statistics and Fuzzy Modelling

机译：使用文档统计和模糊建模的印度文档从印地文文件提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we put forward a novel unsupervised, domain independent and corpus independent approach for automatic keyword extraction. Our approach combines the document statistics of frequency and spatial distribution of a word in order to extract the keywords. We have extracted keywords from Hindi documents using document statistics and utilized the power of fuzzy logic to combine those document statistics effectively for better results. Further, we use this information to frame fuzzy rules for keyword extraction. Main advantages of our approach are that it uses the fuzzy membership for the variables instead of dealing with crisp thresholds and corpus independent setting of fuzzy membership boundaries. Our work is especially significant in the light that it has been implemented and tested on Hindi which is a resource poor and underrepresented language.

机译：在本文中，我们提出了一种新颖的无监督，域独立和语料库的自动关键词提取。我们的方法结合了一个单词的频率和空间分布的文档统计信息，以便提取关键字。我们使用文档统计信息从Hindi文档中提取关键字，并利用模糊逻辑的力量，以便有效地将这些文档统计信息组合起来。此外，我们使用此信息来帧为关键字提取的模糊规则。我们的方法的主要优点是它使用了变量模糊成员资格而不是处理模糊会员边界的清晰阈值和语料库。我们的作品在光线下，它的实施和测试是在印地文实施和测试的，这是一种资源差和代表性不足的语言。

著录项

来源
《International Conference on Information Systems Design and Intelligence Applications》|2018年|xxv 1088 p :|共10页
会议地点
作者
Sifatullah Siddiqi; Aditi Sharan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP302.1-532;
关键词
Keyword Extraction; Hindi Documents; Fuzzy Modelling;

机译：关键词提取;印地文文件;模糊建模;

相似文献

外文文献
中文文献
专利

1. SwiftRank: An Unsupervised Statistical Approach of Keyword and Salient Sentence Extraction for Individual Documents [J] . Htet Myet Lynn, Eunji Lee, Chang Choi, Procedia Computer Science . 2017,第1期

机译：SwiftRank：单个文档的关键字和显着句子提取的无监督统计方法
2. Construction of Keyword Extraction using Statistical Approaches and Document Clustering by Agglomerative method [J] . R. Nagarajan, Dr. P. Aruna International Journal of Engineering Research and Applications . 2016,第1期

机译：统计方法和关键词聚类的凝聚方法构建关键词提取
3. KEYWORD EXTRACTION FROM A SINGLE DOCUMENT USING WORD CO-OCCURRENCE STATISTICAL INFORMATION [J] . Y. MATSUO, M. ISHIZUKA International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2004,第1期

机译：使用单词同现统计信息从单个文档中提取关键词
4. Keyword Extraction from Hindi Documents Using Document Statistics and Fuzzy Modelling [C] . Sifatullah Siddiqi, Aditi Sharan International Conference on Information Systems Design and Intelligence Applications . 2018

机译：使用文档统计和模糊建模的印度文档从印地文文件提取
5. Keywords in the mist: Automated keyword extraction for very large documents and back of the book indexing. [D] . Csomai, Andras. 2008

机译：薄雾中的关键字：自动提取非常大的文档并在书后建立索引的关键字。
6. A System for Automated Extraction of Metadata from Scanned Documents using Layout Recognition and String Pattern Search Models [O] . Dharitri Misra, Siyuan Chen, George R. Thoma -1

机译：使用布局识别和字符串模式搜索模型从扫描文档中自动提取元数据的系统
7. Keyword extraction from a single document using word co-occurrence statistical information [O] . Yutaka Matsuo, Mitsuru Ishizuka 2013

机译：使用单词共现统计信息从单个文档中提取关键字

Keyword Extraction from Hindi Documents Using Document Statistics and Fuzzy Modelling

摘要

著录项

相似文献

相关主题

期刊订阅