首页> 外文会议> >Identification of requirements for focused crawlers in technology intelligence
【24h】

Identification of requirements for focused crawlers in technology intelligence

机译:确定技术情报中重点爬虫的要求

获取原文
获取原文并翻译 | 示例

摘要

The fast and high availability of knowledge is at first seen as a benefit for knowledge workers in the information age. On closer examination the outcome of this is a big challenge: The amount of data that is available these days has to be reasonably structured and conditioned. Only the US Library of Congress collected 235 terabyte of data on its own by April 2011. Technology intelligence as a fundamental component of technology management is expected to monitor these data, so technology managers are able to respond to new developments and trends just in time. Possible tools to meet this challenge in an efficient way are the focused crawlers. These are programs, which explore data collections independently to identify material related to the current working context. To implement such a tool, there exist a multitude of different approaches within the field of information retrieval, but they have to be used and combined on an individual basis to fit the requirements of a particular task. Hence, before a focused crawler can make the processes of technology intelligence more efficient, the dedicated requirements have to be identified. In this paper we develop a requirements model to close this gap.
机译:起初,知识的快速和高度可用性对信息时代的知识工作者来说是一种好处。从更仔细的研究来看,这是一个很大的挑战:必须合理地构造和调整这些天可用的数据量。到2011年4月,只有美国国会图书馆自己收集了235 TB的数据。技术情报作为技术管理的基本组成部分,有望监视这些数据,因此技术经理能够及时响应新的发展和趋势。专注的爬虫可以有效地应对这一挑战。这些程序可独立探索数据收集,以识别与当前工作环境相关的材料。为了实现这种工具,在信息检索领域内存在多种不同的方法,但是必须单独使用和组合它们以适合特定任务的要求。因此,在专注的搜寻器可以使技术智能的过程更高效之前,必须先确定专用要求。在本文中,我们开发了一个需求模型来弥补这一差距。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号