An Improved Approach to Bengali Keyphrase Extraction

机译：孟加拉关键酶提取的改进方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a new approach for automatically extracting key phrases from a Bengali document. Our proposed approach presented in this paper has two important steps: (1) a shallow parsing based candidate key phrase identification that uses lexical information and case markers for candidate key phrase identification and (2) choosing the best items from the set of the candidates using a ranking method that combines the statistical features and the linguistic features for ranking the candidates. The feature set includes term frequency, position of the phrase's first occurrence, named entity information and lexical information. The proposed system has been tested on a collection of Bengali news documents. The experimental results show that it performs better than the existing approaches to which it is compared.

机译：本文介绍了一种自动从孟加拉文档中提取关键短语的新方法。我们本文提出的建议方法有两个重要步骤：（1）基于浅析的候选密钥短语识别，用于候选密钥短语识别的词汇信息和案例标记，并且使用（2）使用候选人集中的最佳项目一种组合统计特征和语言特征来排名候选的排名方法。该特征集包括术语频率，短语第一出现的位置，命名实体信息和词法信息。拟议的系统已经在孟加拉新闻文件的集合上进行了测试。实验结果表明，它比比较的现有方法更好。

著录项

来源
《International Conference of Emerging Applications of Information Technology》|2014年||共6页
会议地点
作者
Sarkar Kamal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
Bengali; Case markers; Keyphrase Extraction; Named entities; Shallow parsing;

机译：孟加拉;案例标记;关键疗法提取;命名实体;浅析;

相似文献

外文文献
中文文献
专利

1. A Keyphrase-Based Approach to Text Summarization for English and Bengali Documents [J] . Kamal Sarkar International journal of technology diffusion . 2014,第2期

机译：基于关键字的英语和孟加拉语文档文本摘要方法
2. An Efficient Approach to Improve Arabic Documents Clustering Based on a new Keyphrases Extraction Algorithm [J] . Hanane FROUD, Issam SAHMOUDI, Abdelmonaime LACHKAR Computer Science & Information Technology . 2013,第8期

机译：一种基于新的关键词提取算法的阿拉伯文档聚类改进方法
3. Single-Document Keyphrase Extraction for Multi-Document Keyphrase Extraction [J] . Gábor Berend, Richárd Farkas Computacion y Sistemas . 2013,第2期

机译：单文档关键字提取用于多文档关键字提取
4. An Improved Approach to Bengali Keyphrase Extraction [C] . Sarkar Kamal 2014 Fourth International Conference of Emerging Applications of Information Technology . 2014

机译：孟加拉语关键词提取的一种改进方法
5. Keyphrase Extraction and Its Applications to Digital Libraries [D] . Patel, Krutarth Indubhai. 2021

机译：关键词提取及其对数字图书馆的应用
6. Deep neural model with self-training for scientific keyphrase extraction [O] . Xun Zhu, Chen Lyu, Donghong Ji, 2020

机译：具有自我训练的深度神经模型用于科学关键训练
7. AN EFFICIENT APPROACH TO IMPROVE ARABIC DOCUMENTS CLUSTERING BASED ON A NEW KEYPHRASES EXTRACTION ALGORITHM [O] . Hanane Froud, Issam Sahmoudi, Abdelmonaime Lachkar 2014

机译：基于新的关键短语提取算法的阿拉伯文件聚类的有效方法

An Improved Approach to Bengali Keyphrase Extraction

摘要

著录项

相似文献

相关主题

期刊订阅