Keyword Extraction from a Single Document using Word Co-occurrence Statistical Information

机译：使用单词共现统计信息从单个文档中提取关键字

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a new keyword extraction algorithm that applies to a single document without using a corpus. Frequent terms are extracted first, then a set of cooccurrence between each term and the frequent terms, i.e., occurrences in the same sentences, is generated. Co-occurrence distribution shows importance of a term in the document as follows. If probability distribution of co-occurrence between term a and the frequent terms is biased to a particular subset of frequent terms, then term a is likely to be a keyword. The degree of biases of distribution is measured by the x~2-measure. Our algorithm shows comparable performance to tfidf without using a corpus.

机译：我们提出了一种新的关键字提取算法，该算法适用于单个文档而无需使用语料库。首先提取频繁项，然后生成每个项与频繁项之间的一组同现，即同一句子中的出现。共现分布显示文档中术语的重要性，如下所示。如果术语a和频繁术语之间的共现概率分布偏向频繁术语的特定子集，则术语a可能是关键字。分布的偏差程度通过x〜2-度量来度量。我们的算法在不使用语料库的情况下显示了与tfidf相当的性能。

著录项

来源
《International Florida Artiticial Intelligence Research Society Conference and International Flairs Conference: Recent Advances in Artificial Intelligece; 2003》|2003年|P.392-396|共5页
会议地点
作者
Yutaka Matsuo; Mitsuru Ishizuka;
展开▼
作者单位

National Institute of Advanced Industrial Science and Technology Aomi 2-41-6, Koto-ku, Tokyo 135-0064, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化系统理论;工程模拟;
关键词

相似文献

外文文献
中文文献
专利

1. KEYWORD EXTRACTION FROM A SINGLE DOCUMENT USING WORD CO-OCCURRENCE STATISTICAL INFORMATION [J] . Y. MATSUO, M. ISHIZUKA International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2004,第1期

机译：使用单词同现统计信息从单个文档中提取关键词
2. Single document keyword extraction via quantifying higher-order structural features of word co-occurrence graph [J] . Chen Yan, Wang Jie, Li Ping, Computer speech and language . 2019,第SEPa期

机译：通过量化单词共现图的高阶结构特征提取单文档关键词
3. Single document keyword extraction via quantifying higher-order structural features of word co-occurrence graph [J] . Chen Yan, Wang Jie, Li Ping, Computer speech and language . 2019,第Sepa期

机译：单个文档关键字提取通过量化Word Co-antionrence图的高阶结构特征
4. Keyword extraction from a single document using word Co-occurrence statistical information [C] . Yutaka Matsuo, Mitsuru Ishizuka International Florida Artificial Intelligence Research Society Conference . 2003

机译：使用Word Co-Feationsence统计信息从单个文档提取关键字提取
5. Keywords in the mist: Automated keyword extraction for very large documents and back of the book indexing. [D] . Csomai, Andras. 2008

机译：薄雾中的关键字：自动提取非常大的文档并在书后建立索引的关键字。
6. Click-words: learning to predict document keywords from a user perspective [O] . Rezarta Islamaj Doğan, Zhiyong Lu -1

机译：点击字词：从用户角度学习预测文档关键字
7. Keyword extraction from a single document using word co-occurrence statistical information [O] . Yutaka Matsuo, Mitsuru Ishizuka 2013

机译：使用单词共现统计信息从单个文档中提取关键字

Keyword Extraction from a Single Document using Word Co-occurrence Statistical Information

摘要

著录项

相似文献

相关主题

期刊订阅