首页> 外文会议> >The TaxGen framework: automating the generation of a taxonomy for alarge document collection

【24h】

The TaxGen framework: automating the generation of a taxonomy for alarge document collection

机译：TaxGen框架：自动生成分类标准大文件收集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text mining is an active area of research and development, whichcombines and expands techniques found in related areas like informationretrieval, computational linguistics and data mining to perform ananalysis of large corpora of digital documents. This paper describes theTaxGen text mining project carried out at the IBM Software DevelopmentLab. at Boeblingen, Germany. The goal of TaxGen was the automaticgeneration of a taxonomy for a collection of previously unstructureddocuments, namely a set of 73,000 news wire documents spanning one year

机译：文本挖掘是研究和开发的活跃领域，结合并扩展在相关领域（如信息）中发现的技术检索，计算语言学和数据挖掘以执行大型数字文档的分析。本文介绍了在IBM软件开发公司进行的TaxGen文本挖掘项目实验室在德国的伯布林根。 TaxGen的目标是自动为先前非结构化的集合生成分类法文档，即一组为期7年的73,000条新闻专线文档

著录项

来源
《》|1999年|p.1-9|共9页
会议地点
作者
Muller A.; Dorre J.; Gerstl P.; Seiffert R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类系统科学;
关键词

相似文献

外文文献
中文文献
专利

1. A digital library framework for heterogeneous music collections: from document acquisition to cross-modal interaction [J] . David Damm, Christian Fremerey, Verena Thomas, International journal on digital libraries . 2012,第2a3期

机译：用于异构音乐收藏的数字图书馆框架：从文档获取到跨模式交互
2. Access Control Framework for XML Document Collections [J] . Goran Sladi??, Branko Milosavljevi??, Zora Konjovi??, Computer Science and Information Systems . 2011,第3期

机译：XML文档集合的访问控制框架
3. A framework for predicting competition between native and exotic hymenopteran parasitoids of lepidopteran larvae using taxonomic collections and species level traits [J] . McGrath Zane, MacDonald Frances, Walker Graham, BioControl: Journal of the International Organization for Biological Control . 2021,第1期

机译：使用分类学系列和物种级别特征预测鳞翅目幼虫的天然和异国情调的Hymenopteran寄生虫癌的框架
4. The TaxGen Framework: Automating the Generation of a Taxonomy for a Large Document Collection [C] . Adrian Muler, Jochen Dore, Peter Gerstl, Hawaii International Conference on System Sciences, Annual . 1999

机译：TAXGEN框架：自动化为大型文件收集生成分类法
5. Diatoms of the Esteros Del Ibera?: A Taxonomic and Ecological Comparison of Historical and Contemporary Collections [D] . ?Swenson, Jordan 2020

机译：Esteros del Ibera的硅藻子？：历史和当代系列的分类和生态比较
6. Automated generation of massive image knowledge collections using Microsoft Live Labs Pivot to promote neuroimaging and translational research [O] . Teeradache Viangteeravat, Matthew N Anyanwu, Venkateswara Ra Nagisetty, 2011

机译：使用Microsoft Live Labs Pivot自动生成海量图像知识以促进神经成像和翻译研究
7. A FIRST STEP TOWARDS A FUZZY FRAMEWORK FOR ANALYZING COLLECTIONS OF JSON DOCUMENTS [O] . Giuseppe Psaila, Stefania Marrara 2019

机译：迈向模糊框架的第一步，用于分析JSON文件集合
8. Automated knowledge acquisition for second generation knowledge base systems: A conceptual analysis and taxonomy. [R] . Williams, K. E., Kotnour, T. 1991

机译：第二代知识库系统的自动知识获取：概念分析和分类。

The TaxGen framework: automating the generation of a taxonomy for alarge document collection

摘要

著录项

相似文献

相关主题

期刊订阅