首页> 外文会议>International conference for the International Association for Management of Technology >CONFIGURATION MODEL FOR FOCUSED CRAWLERS IN TECHNOLOGY INTELLIGENCE
【24h】

CONFIGURATION MODEL FOR FOCUSED CRAWLERS IN TECHNOLOGY INTELLIGENCE

机译:技术智能中的重点爬虫配置模型

获取原文

摘要

Due to a steady increase of competitive constraints caused by ongoing globalization and dynamically growing markets, technology intelligence has become an important element of strategic business intelligence. The objective of technology intelligence is to focus on the systematic identification of future chances but also threats to companies caused by new technologies and further technology developments. To operate technology intelligence efficiently, access to up-to-date, relevant, and sufficiently complete information is essential. Indeed, availability of information is higher than ever by reason of digitalization. However, it also causes the problem of information overload. The available mass of data has to be searched, assorted and assessed to identify the actual needed information. In addition, the entire information processing has to be continued permanently or to be repeated for each new object of investigation, otherwise the validity of the results is not given any more. Accordingly, it appears reasonable to automate this process by widely using smart software solutions. One of the promising approaches is "focused crawling" which not just runs through given data sources in the web, but also rates each data record to make an autonomous decision, which information is relevant for the further process, and which data records should reasonably be analyzed next. To implement such crawlers, different approaches exist in the field of information retrieval: For example, different rating and discovery algorithms. This paper presents the status quo of ongoing research to develop a configuration model for focused crawlers to fulfill the varying requirements of technology intelligence tasks. At first, the assessment criteria for information in a technology intelligence process and the configuration possibilities of focused crawlers are described. As a result, a first approach of a matching between the requirements of technology intelligence tasks and the consequences of different focused crawler configurations is presented. Closing, the paper explains how this approach will be improved and validated in case studies prospectively.
机译:由于持续的全球化和不断增长的市场所导致的竞争约束的稳定增长,技术情报已成为战略商业情报的重要组成部分。技术情报的目标是着眼于系统地识别未来机会,同时也要关注由新技术和进一步技术发展对公司造成的威胁。为了有效地操作技术情报,访问最新,相关且足够完整的信息至关重要。实际上,由于数字化,信息的可用性比以往更高。但是,这也导致信息过载的问题。必须搜索,分类和评估可用数据量,以识别实际所需的信息。另外,对于每个新的调查对象,整个信息处理必须永久性地继续或重复进行,否则将不再给出结果的有效性。因此,通过广泛使用智能软件解决方案来使该过程自动化似乎是合理的。一种有前途的方法是“集中爬网”,它不仅在网络中的给定数据源中运行,而且还对每个数据记录进行评级以做出自主决定,哪些信息与进一步处理相关,以及哪些数据记录应合理地进行处理。接下来分析。为了实现这样的搜寻器,在信息检索领域存在不同的方法:例如,不同的评级和发现算法。本文介绍了正在进行的研究现状,以开发针对性爬虫的配置模型,以满足技术情报任务的各种要求。首先,描述了技术智能过程中信息的评估标准以及重点爬虫的配置可能性。结果,提出了一种在技术情报任务的要求和不同的集中爬虫配置的结果之间进行匹配的第一种方法。最后,本文解释了如何在案例研究中对这种方法进行改进和验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号