针对来自不同用户的Web浏览序列往往长短不一的问题,引入编辑距离用于计算浏览序列之间的不相似性。运用含两个阈值的顺序聚类算法对Web浏览模式进行分析,无需事先指定聚类的数量,降低了对浏览序列参与聚类的顺序的依赖性。数据来源于真实数据的仿真实验证明了方法的有效性和灵活性。%Aiming at the problem that the sizes of various Web navigation sequences are different, edit-distance is introduced as dissimilarity among navigation sequences. Web navigation patterns is analyzed by using Two-Threshold Sequential Algorithm, which can red
展开▼