...
首页> 外文期刊>ACM Transactions on Internet Technology >Automated Gathering of Web Information: An In-Depth Examination of Agents Interacting with Search Engines
【24h】

Automated Gathering of Web Information: An In-Depth Examination of Agents Interacting with Search Engines

机译:自动收集Web信息:与搜索引擎交互的代理的深入检查

获取原文
获取原文并翻译 | 示例
           

摘要

The Web has become a worldwide repository of information which individuals, companies, and organizations utilize to solve or address various information problems. Many of these Web users utilize automated agents to gather this information for them. Some assume that this approach represents a more sophisticated method of searching. However, there is little research investigating how Web agents search for online information. In this research, we first provide a classification for information agent using stages of information gathering, gathering approaches, and agent architecture. We then examine an implementation of one of the resulting classifications in detail, investigating how agents search for information on Web search engines, including the session, query, term, duration and frequency of interactions. For this temporal study, we analyzed three data sets of queries and page views from agents interacting with the Excite and AltaVista search engines from 1997 to 2002, examining approximately 900,b agents are searching for a relatively limited variety of information, wherein only 18% of the terms used are unique, and (4) the duration of agent-Web search engine interaction typically spans several hours. We discuss the implications for Web information agents and search engines.
机译:Web已成为个人,公司和组织用来解决或解决各种信息问题的全球信息存储库。这些Web用户中的许多人都使用自动代理为他们收集此信息。有些人认为这种方法代表了一种更复杂的搜索方法。但是,很少有研究调查Web代理如何搜索在线信息。在这项研究中,我们首先使用信息收集,收集方法和代理体系结构的阶段为信息代理提供分类。然后,我们详细检查所得分类之一的实现,调查代理如何在Web搜索引擎上搜索信息,包括会话,查询,术语,交互的持续时间和频率。对于此时间研究,我们分析了1997年至2002年与Excite和AltaVista搜索引擎进行交互的代理商的三个查询和页面浏览数据集,检查了大约900,b个代理商在搜索相对有限的信息,其中只有18%其中使用的术语是唯一的,并且(4)代理与Web搜索引擎交互的持续时间通常跨几个小时。我们讨论了对Web信息代理和搜索引擎的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号