EFFICIENT BANDWIDTH UTILIZATION FOR DOWNLOADING WEB PAGES

Anirban Kundu

首页> 外文期刊>International Journal of Computers & Applications >EFFICIENT BANDWIDTH UTILIZATION FOR DOWNLOADING WEB PAGES

【24h】

EFFICIENT BANDWIDTH UTILIZATION FOR DOWNLOADING WEB PAGES

机译：高效地利用带宽下载网页

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Web crawler is a computer program that browses World Wide Web in methodical and automated manner. Latest crawling techniques in use are parallel crawling and hierarchical crawling. In later case, total Web site is extracted by dividing it into a few levels. The homepage from which crawling process starts is considered to be the first level. All the hyperlinks present on that Web page all together is considered to be the next level and so on. In this crawling process all the Web pages at a single level gets downloaded simultaneously by the creation of multiple crawlers dynamically depending on the number of hyperlinks on that level. But in real-life scenario the bandwidth available is limited and acts as a deterrent in this case. In this paper, a scheduling algorithm has been proposed on the basis of the sizes of the Web pages to make full utilization of the bandwidth available. To achieve this, a modified type of queue (Y-type) is introduced where URLs of the Web pages are kept in an orderly manner and they are released in such a way that the total size of the Web pages issued is closest to the bandwidth available.

机译：Web搜寻器是一种计算机程序，可以有条不紊和自动化地浏览万维网。最新使用的爬网技术是并行爬网和分层爬网。在以后的情况中，通过将整个网站划分为几个级别来提取整个网站。从其开始抓取过程的主页被认为是第一级。该网页上出现的所有超链接一起被认为是下一个级别，依此类推。在此爬网过程中，通过创建多个爬网程序来动态下载单个级别上的所有网页，具体取决于该级别上超链接的数量。但是在现实生活中，可用带宽是有限的，并且在这种情况下起到了威慑作用。在本文中，基于网页的大小提出了一种调度算法，以充分利用可用带宽。为实现此目的，引入了一种修改的队列（Y型），其中网页的URL有序地保存，并以使发布的网页的总大小最接近带宽的方式释放它们。可用。

著录项

来源
《International Journal of Computers & Applications》 |2014年第1期|1-6|共6页
作者
Anirban Kundu;
展开▼
作者单位

Innovation Research Lab (IRL), Howrah, West Bengal, India 711103 Kuang-Chi Institute of Advanced Technology, Shenzhen, P. R. China 518057;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Crawling; hierarchical crawling; bandwidth utilization; Y-type queue;

机译：爬行;分级爬网;带宽利用率;Y型队列;
入库时间 2022-08-18 00:39:01

相似文献

外文文献
中文文献
专利

1. A Parallel Downloading Method to Utilize Variable Bandwidth [J] . Junichi FUNASAKA, Nozomi NAKAWAKI, Kenji ISHIDA, IEICE Transactions on Communications . 2003,第10期

机译：利用可变带宽的并行下载方法
2. Efficient upstream bandwidth utilization with minimum bandwidth waste for time and wavelength division passive optical network [J] . Butt Rizwan Aslam, Faheem M., Ashraf M. Waqar Optical and quantum electronics . 2020,第1期

机译：有效的上行带宽利用率，时分和波分无源光网络的带宽浪费最少
3. A hybrid WDM ring-tree topology delivering efficient utilization of bandwidth over resilient infrastructure [J] . Singh Sukhbir, Singh Surinder Photonic network communications . 2018,第3期

机译：混合WDM环形树拓扑可通过弹性基础架构有效地利用带宽
4. Distributed bandwidth reservation strategies to support efficient bandwidth utilization and QoS on a per-link basis in IEEE 802.16 Mesh Networks [C] . Mogre Parag S., Hollick Matthias, Steinmetz Ralf, Local Computer Networks, 2009. LCN 2009 . 2009

机译：分布式带宽预留策略可支持IEEE 802.16 Mesh网络中每个链路的有效带宽利用率和QoS
5. Algorithms for efficient utilization of wireless bandwidth and to provide quality-of-service in wireless networks. [D] . Kakani, Naveen Kumar. 2000

机译：有效利用无线带宽并在无线网络中提供服务质量的算法。
6. An Efficient Time-Varying Filter for Detrending and Bandwidth Limiting the Heart Rate Variability Tachogram without Resampling: MATLAB Open-Source Code and Internet Web-Based Implementation [O] . A. Eleuteri, A. C. Fisher, D. Groves, 2012

机译：一个有效的时变过滤器用于不进行重采样的趋势和带宽限制心率变异性速度图：MATLAB开源代码和基于Internet网络的实现
7. An Efficient Bandwidth Optimization and Minimizing Energy Consumption Utilizing Efficient Reliability and Interval Discrepant Routing (ERIDR) Algorithm [O] . Sivashanmugam N, Jothi Venkateshwaran C 2018

机译：利用高效可靠性和间隔差异路由（ERIDR）算法利用高效的带宽优化和最小化能耗

EFFICIENT BANDWIDTH UTILIZATION FOR DOWNLOADING WEB PAGES

摘要

著录项

相似文献

相关主题

期刊订阅