首页> 外文会议>Natural Language Processing and Information Systems >A Flexible Workbench for Document Analysis and Text Mining
【24h】

A Flexible Workbench for Document Analysis and Text Mining

机译:用于文档分析和文本挖掘的灵活工作台

获取原文
获取外文期刊封面目录资料

摘要

Document analysis and text mining techniques are used to pre-process documents in information retrieval systems, to extract concepts in ontology construction processes, and to discover and classify knowledge along several dimensions. In most cases it is not obvious how the techniques should be configured and combined, and it is a time-consuming process to set up and test various combinations of techniques. In this paper, we present a workbench that makes it easy to plug in new document analysis and text mining techniques and experiment with different constellations of techniques. We explain the architecture of the workbench and show how the workbench has been used to extract ontological concepts and relationships for a document collection published by the Norwegian Center for Medical Informatics.
机译:文档分析和文本挖掘技术用于预处理信息检索系统中的文档,提取本体构建过程中的概念以及沿多个维度发现和分类知识。在大多数情况下,如何配置和组合这些技术并不明显,并且设置和测试各种技术组合是一个耗时的过程。在本文中,我们提供了一个工作台,可轻松插入新的文档分析和文本挖掘技术,并使用不同的技术组合进行实验。我们将解释工作台的体系结构,并展示如何使用工作台为挪威医学信息中心发布的文档集提取本体概念和关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号