A Semi-Supervised Key Phrase Extraction Approach: Learning from Title Phrases through a Document Semantic Network

机译：半监督的关键短语提取方法：通过文档语义网络从标题短语中学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is a fundamental and important task to extract key phrases from documents. Generally, phrases in a document are not independent in delivering the content of the document. In order to capture and make better use of their relationships in key phrase extraction, we suggest exploring the Wikipedia knowledge to model a document as a semantic network, where both n-ary and binary relationships among phrases are formulated. Based on a commonly accepted assumption that the title of a document is always elaborated to reflect the content of a document and consequently key phrases tend to have close semantics to the title, we propose a novel semi-supervised key phrase extraction approach in this paper by computing the phrase importance in the semantic network, through which the influence of title phrases is propagated to the other phrases iteratively. Experimental results demonstrate the remarkable performance of this approach.

机译：从文档中提取关键短语是一项基本而重要的任务。通常，文档中的短语在传递文档内容方面并不独立。为了在关键短语提取中捕获并更好地利用它们之间的关系，我们建议您探索Wikipedia知识，以将文档建模为语义网络，在该网络中，表达短语之间的n元和二进制关系。基于一个普遍接受的假设，即文档标题总是经过精心设计以反映文档的内容，因此，关键短语倾向于与标题具有紧密的语义，因此，我们在本文中提出了一种新颖的半监督关键字短语提取方法计算语义网络中短语的重要性，通过该重要性网络将标题短语的影响迭代地传播到其他短语。实验结果证明了这种方法的卓越性能。

著录项

来源
《Annual meeting of the Association for Computational Linguistics;Meeting of the Association for Computational Linguistics》|2010年|p.296-300|共5页
会议地点
作者
Decong Li; Sujian Li; Wenjie Li; Wei Wang; Weiguang Qu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Semantic key phrase-based model for document management [J] . Prafulla Bafna, Dhanya Pramod, Shailaja Shrwaikar, Benchmarking . 2019,第6期

机译：基于语义关键短语的文档管理模型
2. Semantic key phrase-based model for document management [J] . Prafulla Bafna, Dhanya Pramod, Shailaja Shrwaikar, Benchmarking . 2019,第6期

机译：基于语义关键短语的文档管理模型
3. Charismatic Document Clustering Through Novel K-Means Non-negative Matrix Factorization (KNMF) Algorithm Using Key Phrase Extraction [J] . E. Laxmi Lydia, P. Krishna Kumar, K. Shankar, International journal of parallel programming . 2020,第3期

机译：通过新颖的K-Mean非负矩阵分解（KNMF）算法使用关键短语提取的魅力文档聚类
4. A Semi-Supervised Key Phrase Extraction Approach: Learning from Title Phrases through a Document Semantic Network [C] . Decong Li, Sujian Li, Wenjie Li, Annual meeting of the Association for Computational Linguistics . 2010

机译：半监督关键短语提取方法：通过文档语义网络从标题短语学习
5. Noun phrases in documents: Preprocessing, automatic extraction, and statistical analysis in different categories of text. [D] . Kim, Youngin. 2002

机译：文档中的名词短语：对不同类别的文本进行预处理，自动提取和统计分析。
6. Empirical data for the semantic interpretation of prepositional phrases in medical documents. [O] . M. Romacker, U. Hahn 2001

机译：医学文档中介词短语语义解释的经验数据。
7. Automatic Titling of Electronic Documents with Noun Phrase Extraction [O] . Violaine Prince, Mathieu Roche 2015

机译：用名词短语提取自动标题电子文档
8. Searching the ASRS Database Using QUORUM Keyword Search, Phrase Search, Phrase Generation, and Phrase Discovery [R] . McGreevy, M. W. 2001

机译：使用QUORUm关键字搜索，短语搜索，短语生成和短语发现搜索asRs数据库

A Semi-Supervised Key Phrase Extraction Approach: Learning from Title Phrases through a Document Semantic Network

摘要

著录项

相似文献

相关主题

期刊订阅