首页> 外文学位 >Multi-user file system search.
【24h】

Multi-user file system search.

机译:多用户文件系统搜索。

获取原文
获取原文并翻译 | 示例

摘要

Information retrieval research usually deals with globally visible, static document collections. Practical applications, in contrast, like file system search and enterprise search, have to cope with highly dynamic text collections and have to take into account user-specific access permissions when generating the results to a search query.;The techniques proposed in this thesis are evaluated theoretically, based on a Zipfian model of term distribution, and through a large number of experiments, involving text collections of non-trivial size---varying between a few gigabytes and a few hundred gigabytes.;The goal of this thesis is to close the gap between information retrieval research and the requirements exacted by these real-life applications. The algorithms and data structures presented in this thesis can be used to implement a file system search engine that is able to react to changes in the file system by updating its index data in real time. File changes (insertions, deletions, or modifications) are reflected by the search results within a few seconds, even under a very high system workload. The search engine exhibits a low main memory consumption. By integrating security restrictions into the query processing logic, as opposed to applying them in a postprocessing step, it produces search results that are guaranteed to be consistent with the access permissions defined by the file system.
机译:信息检索研究通常处理全局可见的静态文档集合。相比之下,诸如文件系统搜索和企业搜索之类的实际应用程序必须处理高度动态的文本集合,并且在将结果生成搜索查询时必须考虑用户特定的访问权限。基于Zipfian术语分布模型并经过大量实验,涉及非平凡大小的文本集-在几GB到几百GB之间变化,从理论上进行了评估。缩小了信息检索研究与这些实际应用所提出的要求之间的差距。本文提出的算法和数据结构可用于实现文件系统搜索引擎,该引擎能够通过实时更新其索引数据来对文件系统中的更改做出反应。即使在非常高的系统工作量下,文件更改(插入,删除或修改)也会在几秒钟内由搜索结果反映出来。搜索引擎具有较低的主内存消耗。通过将安全性限制集成到查询处理逻辑中,而不是在后处理步骤中应用安全性限制,它可以生成保证与文件系统定义的访问权限一致的搜索结果。

著录项

  • 作者

    Buttcher, Stefan.;

  • 作者单位

    University of Waterloo (Canada).;

  • 授予单位 University of Waterloo (Canada).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2007
  • 页码 232 p.
  • 总页数 232
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号