In this paper we will present a way to measure the similarity of two web sites. The web sites used are composed only of web pages created with HTML. We will present and compare two algorithms for calculating the similarity degree between two web sites. The first algorithm compares all the web pages of the web sites while the second one selects, after a certain criterion, only a part of the web pages using a relation between them. To implement the algorithms and compare the results we used Java language.
展开▼