A New Scheme for Scoring Phrases in Unsupervised Keyphrase Extraction

机译：一种新的核心关键词提取中的短语计划

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many unsupervised methods for keyphrase extraction typically compute a score for each word in a document based on various measures such as tf-idf or the PageRank score computed from the word graph built from the text document. The final score of a candidate phrase is then calculated by summing up the scores of its constituent words. A potential problem with the sum up scoring scheme is that the length of a phrase highly impacts its score. To reduce this impact and extract keyphrases of varied lengths, we propose a new scheme for scoring phrases which calculates the final score using the average of the scores of individual words weighted by the frequency of the phrase in the document. We show experimentally that the unsupervised approaches that use this new scheme outperform their counterparts that use the sum up scheme to score phrases.

机译：对于关键斑点提取的许多无监督方法通常基于从文本文档中构建的单词图中计算的各种措施（例如TF-IDF或PageRank分数）计算文档中的每个单词的分数。然后通过总结其组成词的分数来计算候选词组的最终得分。总结得分方案的潜在问题是短语的长度高度影响其分数。为减少这种影响和提取各种长度的关键效果，我们提出了一种新的计划，用于使用由文档中短语的频率加权的单个单词的分数的平均值计算最终分数的新方案。我们通过实验显示了使用此新方案的无监督方法优先于其对应于使用总结方案来衡量短语的对应物。

著录项

来源
《European Conference on Information Retrieval Research》|2017年|799p|共7页
会议地点
作者
Corina Florescu; Cornelia Caragea;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
入库时间 2022-08-20 20:11:40

相似文献

外文文献
中文文献
专利

1. Accurate keyphrase extraction by discriminating overlapping phrases [J] . Mounia Haddoud, Saied Abdeddaiem Journal of Information Science . 2014,第4期

机译：通过区分重叠短语来准确提取关键短语
2. Highlighting keyphrases using senti-scoring and fuzzy entropy for unsupervised sentiment analysis [J] . Vashishtha Srishti, Susan Seba Expert systems with applications . 2021,第May期

机译：使用Senti-Scoring和模糊熵突出关键杂交，以进行无监督的情绪分析
3. KP-Rank: a semantic-based unsupervised approach for keyphrase extraction from text data [J] . Aman Muhammad, Abdulkadir Said Jadid, Aziz Izzatdin Abdul, Multimedia Tools and Applications . 2021,第8期

机译：kp-and：基于语义的无调节方法，用于从文本数据中提取关键词
4. A New Scheme for Scoring Phrases in Unsupervised Keyphrase Extraction [C] . Corina Florescu, Cornelia Caragea European conference on IR research . 2017

机译：无监督关键字短语提取中短语评分的新方案
5. Keyphrase Extraction and Its Applications to Digital Libraries [D] . Patel, Krutarth Indubhai. 2021

机译：关键词提取及其对数字图书馆的应用
6. Deep neural model with self-training for scientific keyphrase extraction [O] . Xun Zhu, Chen Lyu, Donghong Ji, 2020

机译：具有自我训练的深度神经模型用于科学关键训练
7. Unsupervised Key-phrase Extraction using Noun Phrases [O] . Shailendra Singh, Shubhrita Tiwari, Anubha Varshney, 2017

机译：使用名词短语提取无人监督的关键词

A New Scheme for Scoring Phrases in Unsupervised Keyphrase Extraction

摘要

著录项

相似文献

相关主题

期刊订阅