首页>
外国专利>
WEB CRAWLING INITIAL POINT SELECTION SYSTEM, METHOD, AND PROGRAM
WEB CRAWLING INITIAL POINT SELECTION SYSTEM, METHOD, AND PROGRAM
展开▼
机译:网页抓取初始点选择系统,方法和程序
展开▼
页面导航
摘要
著录项
相似文献
摘要
A graph construction means calculates the weight of web data according to the extent in which the web data matches the information associated with a designated category, and constructs a weighted directed graph which is a graph including the weight of the web data and the directed link between the web data. An initial point selection means selects the web data with the highest score on the basis of a rule in which, with reference to the weighted directed graph, the higher the weight between a web data and another linked web data, the higher the score of the latter is calculated. A crawling depth determination means determines the depth from the initial point in which the web data is crawled on the basis of a rule in which, with reference to the weighted directed graph, the score is calculated lower as the number of web data at a depth from the initial point increases.
展开▼