The increasing complexity, heterogeneity and dynamism of web and its applications have made web information retrieval less recent, less relevant and unmanageable. The search engines face the problem of keeping the contents of its repository consistent with the pages present in the global database using optimum resource utilization. This paper proposes a Traffic Adaptive Optimum updating Scheme (TAOS) to eliminate the needless requests of web crawlers in updating the search engine repository. The scheme also incorporates partial upload of the updated document to the search engine repository. A selfmanaging autonomic computing architecture is proposed to regulate the load on network bandwidth and web servers. The proposed updating scheme is compared for the freshness of search engine repository with the page refresh policies used by web crawlers. The load on network bandwidth and web servers are also analysed for effective resource utilization and is compared with the one consumed during crawler updating.
展开▼