...
【24h】

Information Retrieval on the Web

机译:网上信息检索

获取原文
获取原文并翻译 | 示例
           

摘要

How do we find information on the Web? Although information on the Web is distributed and decentralized, the Web can be viewed as a single, virtual document collection. In that regard, the fundamental questions and approaches of traditional information retrieval (IR) research (e.g., term weighting, query expansion) are likely to be relevant in Web document retrieval. Findings from traditional IR research, however, may not always be applicable in a Web setting. The Web document collection-massive in size and diverse in content, format, purpose, and quality-challenges the validity of previous research findings that are based on relatively small and homogeneous test collections. Moreover, some traditional IR approaches, although applicable in theory, may be impossible or impractical to implement in a Web setting. For instance, the size, distribution, and dynamic nature of Web information make it extremely difficult to construct a complete and up-to-date data representation of the kind required for a model IR system.
机译:我们如何在网上找到信息?尽管Web上的信息是分布式和分散的,但Web可以看作是单个虚拟文档集合。在这方面,传统信息检索(IR)研究的基本问题和方法(例如术语权重,查询扩展)可能与Web文档检索有关。但是,来自传统IR研究的结果可能并不总是适用于Web设置。 Web文档集合规模庞大,内容,格式,目的和质量各不相同,这挑战了以前的研究成果的有效性,这些研究成果是基于相对较小且同类的测试集合而来的。此外,一些传统的IR方法虽然在理论上适用,但在Web设置中可能无法实现或不切实际。例如,Web信息的大小,分布和动态性质使构建模型IR系统所需类型的完整且最新的数据表示形式极为困难。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号