This paper presents a new metric, Page Rank × Inverse Links-to-word count Ratio (PR × ILW), used in classifying web pages as content or navigation. The metric combines web page size and number of hyperlinks on a page, and the page rank metric based on website topology, to compute the new metric. We present a theoretical basis for the new metric, and the results of a web page classification study, which show that the new metric, when combined with the links-to-word count ratio of web pages, accurately classifies the pages into the two categories.
展开▼