Deriving Concept Hierarchies From Text

机译：从文本中导出概念层次结构

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a means of automatically deriving a hierarchical organization of concepts from a set of documents without use of training data or standard clustering techniques. Instead, salient words and phrases extracted from the documents are organized hierarchically using a type of co-occurrence known as subsumption. The resulting structure is displayed as a series of hierarchical menus. When generated from a set of retrieved documents, a user browsing the menus is provided with a detailed overview of their content in a manner distinct from existing overview and summarization techniques. The methods used to build the structure are simple, but appear to be effective: a smallscale user study reveals that the generated hierarchy possesses properties expected of such a structure in that general terms are placed at the top levels leading to related and more specific terms below. The formation and presentation of the hierarchy is described along with the user study and some other informal evaluations. The organization of a set of documents into a concept hierarchy derived automatically from the set itself is undoubtedly one goal of information retrieval. Were this goal to be achieved, the documents would be organized into a form somewhat like existing manually constructed subject hierarchies, such as the Library of Congress categories, or the Dewey Decimal system. The only difference being that the categories would be customized to the set of documents itself. For example, from a collection of media related articles, the category 'Entertainment' might appear near the top level; below it, (amongst others) one might find the category 'Movies', a type of entertainment; and below that, there could be the category 'Actors AND Actresses', an aspect of movies. As can be seen, the arrangement of the categories provides an overview of the topic structure of those articles.

著录项

作者
Sanderson, M. ; Croft, B.;
展开▼
作者单位

展开▼
年度 2005
页码 1-9
总页数 9
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Information retrieval; Hierarchies; Clustering; Collection; User needs; Documents;

机译：信息检索;层次结构;聚类;集合;用户需求;文档;

相似文献

外文文献
中文文献
专利

1. A parametric network approach for concepts hierarchy generation in text corpus [J] . Universitatea "Ovidius" Constanta. Analele. Seria Matematica . 2016,第2016期

机译：文本语料库中概念层次生成的参数网络方法
2. Learning subsumption hierarchies of ontology concepts from texts [J] . Elias Zavitsanos, rnGeorgios Paliouras, rnGeorge A. Vouros, Web Intelligence and Agent Systems . 2010,第2期

机译：从文本中学习本体概念的包含层次
3. Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis [J] . Cimiano P., Hotho A., Staab S. The Journal of Artificial Intelligence Research . 2005,第12期

机译：使用正式概念分析从文本语料库学习概念层次结构
4. Deriving concept hierarchies from text [C] . Mark Sanderson, Bruce Croft Annual international ACM SIGIR conference on Research and development in information retrieval;International ACM SIGIR conference on Research and development in information retrieval . 1999

机译：从文本派生概念层次结构
5. Text summarization using concept hierarchy. [D] . Huang, Xiaomei. 2009

机译：使用概念层次结构的文本摘要。
6. Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text [O] . Yael Garten, Russ B Altman 2009

机译：Pharmspresso：一种文本挖掘工具用于从全文中提取药物基因组学概念和关系
7. Deriving Concept Hierarchies From Text [O] . Mark Sanderson, Bruce Croft 1999

机译：从文本派生概念层次结构

Deriving Concept Hierarchies From Text

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅