首页> 外文会议>10th ACM symposium on document engineering 2010 >On Helmholtz's Principle for Documents Processing
【24h】

On Helmholtz's Principle for Documents Processing

机译:亥姆霍兹文件处理原则

获取原文
获取原文并翻译 | 示例

摘要

Keyword extraction is a fundamental problem in text data mining and document processing. A large number of document processing applications directly depend on the quality and speed of keyword extraction algorithms. In this article, a novel approach to rapid change detection in data streams and documents is developed. It is based on ideas from image processing and especially on the Helmholtz Principle from the Gestalt Theory of human perception. Applied to the problem of keywords extraction, it delivers fast and effective tools to identify meaningful keywords using parameter-free methods. We also define a level of meaningfulness of the keywords which can be used to modify the set of keywords depending on application needs.
机译:关键字提取是文本数据挖掘和文档处理中的一个基本问题。大量文档处理应用程序直接取决于关键字提取算法的质量和速度。在本文中,开发了一种新颖的方法来快速检测数据流和文档中的变化。它基于图像处理的思想,特别是基于人类感知的格式塔理论的亥姆霍兹原理。它应用于关键字提取问题,它提供了快速有效的工具,可使用无参数方法来识别有意义的关键字。我们还定义了关键字的有意义程度,可根据应用程序的需求来修改关键字集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号