【24h】

Hardware Support for Language Aware Information Mining

机译:语言感知信息挖掘的硬件支持

获取原文
获取原文并翻译 | 示例

摘要

Information retrieval from text or 'text mining' is the process of extracting interesting and non-trivial knowledge from unstructured text. With the ever increasing amounts of information stored on the web or archived within a computing system, high performance data processing architectures are required to process this data in real time. The aim of the work presented in this paper is the development of a hardware text mining IP-Core for use in FPGA based systems. In this paper we will describe the pre-processing engine we have developed for the PRESENCE Ⅱ PCI card, to accelerate the identification of significant words within a document, logging their frequency and position. The performance of this system is then compared to an equivalent software implementation using the Lucene software package.
机译:从文本或“文本挖掘”中检索信息是从非结构化文本中提取有趣且不平凡的知识的过程。随着存储在网络上或存储在计算系统中的信息量的不断增长,需要高性能的数据处理体系结构来实时处理该数据。本文提出的工作目标是开发用于基于FPGA的系统的硬件文本挖掘IP-Core。在本文中,我们将描述为PRESENCEⅡPCI卡开发的预处理引擎,以加快文档中重要单词的识别,记录其频率和位置。然后将该系统的性能与使用Lucene软件包的等效软件实现进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号