【24h】

Integrating a dynamic lexicon with a dynamic full-text retrieval system

机译:将动态词典与动态全文检索系统集成

获取原文

摘要

There has been a great deal of interest within the Information Retrieval community in evaluating the use of linguistic knowledge to improve the indexing and searching of textual databases. Such systems must often employ a lexicon to store information about the words and phrases comprising the application's domain. Unlike a static lexicon, a dynamic lexicon raises practical concerns about the coordination between the state of the lexicon and IR indexing schemes based on lexical knowledge. Additionally, it introduces a host of database management issues, many of which are similar to those found in the text databases as well. In this paper, we explore a range of system design and performance issues that arise when integrating a dynamic lexicon with a dynamic full-text information retrieval system. We observe that the principle of functional isolation argues against the use of language-dependent information in article indexes and favors the use of query-time strategies for applying lexical knowledge. We propose and evaluate a system architecture which embodies this principle. We also show how a storage and retrieval infrastructure based on Burkowski's [BURKOWSKI92] "containment model" abstraction can be employed to implement both the text retrieval and lexicon facilities required in an integrated system.

机译:

在信息检索社区中,人们对评估语言知识的使用以改善文本数据库的索引和搜索产生了浓厚的兴趣。这样的系统通常必须使用词典来存储有关构成应用程序域的单词和短语的信息。与 static 词典不同, dynamic 词典引起了有关词典状态与基于词汇知识的IR索引方案之间的协调的实际问题。此外,它引入了许多数据库管理问题,其中许多问题也与文本数据库中的问题类似。在本文中,我们探讨了将动态词典与动态全文信息检索系统集成在一起时出现的一系列系统设计和性能问题。我们观察到,功能隔离的原则反对在文章索引中使用依赖语言的信息,并且赞成使用查询时间策略来应用词汇知识。我们提出并评估了体现这一原理的系统架构。我们还将展示如何使用基于Burkowski的[BURKOWSKI92]“包含模型”抽象的存储和检索基础结构来实现集成系统中所需的文本检索和词典功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号