The TaxGen Framework: Automating the Generation of a Taxonomy for a Large Document Collection

机译：TAXGEN框架：自动化为大型文件收集生成分类法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text Mining is an active area of research and development, which combines and expands techniques found in related areas like information retrieval, computational linguistics, and data mining to perform an analysis of large corpora of digital documents. This paper describes the TaxGen Text Mining project carried out at the IBM Software Development Lab. at Boeblingen, Germany. The goal of TaxGen was the automatic generation of a taxonomy for a collection of previously unstructured documents, namely a set of 73.000 news wire documents spanning one year.

机译：文本挖掘是一个活跃的研发领域，它结合和扩展了信息检索，计算语言学和数据挖掘等相关领域的技术，以分析了数字文件的大型语料库。本文介绍了IBM软件开发实验室执行的TAXGEN TEXT挖掘项目。在德国Boeblingen。 TAXGEN的目标是自动生成一系列以前非结构化文件的分类物，即一年的一组73.000新闻文件文件。

著录项

来源
《Hawaii International Conference on System Sciences, Annual》|1999年||共9页
会议地点
作者
Adrian Muler; Jochen Dore; Peter Gerstl; Roland Seiffert;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 N94-53;
关键词

相似文献

外文文献
中文文献
专利

1. A digital library framework for heterogeneous music collections: from document acquisition to cross-modal interaction [J] . David Damm, Christian Fremerey, Verena Thomas, International journal on digital libraries . 2012,第2a3期

机译：用于异构音乐收藏的数字图书馆框架：从文档获取到跨模式交互
2. Access Control Framework for XML Document Collections [J] . Goran Sladi??, Branko Milosavljevi??, Zora Konjovi??, Computer Science and Information Systems . 2011,第3期

机译：XML文档集合的访问控制框架
3. A framework for predicting competition between native and exotic hymenopteran parasitoids of lepidopteran larvae using taxonomic collections and species level traits [J] . McGrath Zane, MacDonald Frances, Walker Graham, BioControl: Journal of the International Organization for Biological Control . 2021,第1期

机译：使用分类学系列和物种级别特征预测鳞翅目幼虫的天然和异国情调的Hymenopteran寄生虫癌的框架
4. The TaxGen framework: automating the generation of a taxonomy for alarge document collection [C] . Muller A., Dorre J., Gerstl P., . 1999

机译：TaxGen框架：自动生成分类标准大文件收集
5. A computational framework for automating generation of finite element mesh sizing function via skeletons. [D] . Quadros, William Roshan. 2005

机译：通过骨架自动生成有限元网格尺寸调整函数的计算框架。
6. Automated generation of massive image knowledge collections using Microsoft Live Labs Pivot to promote neuroimaging and translational research [O] . Teeradache Viangteeravat, Matthew N Anyanwu, Venkateswara Ra Nagisetty, 2011

机译：使用Microsoft Live Labs Pivot自动生成海量图像知识以促进神经成像和翻译研究
7. A FIRST STEP TOWARDS A FUZZY FRAMEWORK FOR ANALYZING COLLECTIONS OF JSON DOCUMENTS [O] . Giuseppe Psaila, Stefania Marrara 2019

机译：迈向模糊框架的第一步，用于分析JSON文件集合
8. Automated knowledge acquisition for second generation knowledge base systems: A conceptual analysis and taxonomy. [R] . Williams, K. E., Kotnour, T. 1991

机译：第二代知识库系统的自动知识获取：概念分析和分类。

The TaxGen Framework: Automating the Generation of a Taxonomy for a Large Document Collection

摘要

著录项

相似文献

相关主题

期刊订阅