首页>
外国专利>
Method for mining path traversal patterns in a web environment by converting an original log sequence into a set of traversal sub-sequences
Method for mining path traversal patterns in a web environment by converting an original log sequence into a set of traversal sub-sequences
展开▼
机译:通过将原始日志序列转换为一组遍历子序列来挖掘Web环境中路径遍历模式的方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
An efficient computer implemented method of mining path traversal patterns in a communications network. The method of the present invention comprises two steps. A method, called MF (standing for maximal forward references), is first used to convert an original sequence of log data into a set of traversal subsequences. Each traversal subsequence represents a maximal forward reference from the starting point of a user access. This step of converting the original log sequence into a set of maximal forward references will filter out the effect of backward references which are mainly made for ease of traveling, and enable us to concentrate on mining meaningful user access sequences. Accordingly, when backward references occur, a forward reference path terminates. This resulting forward reference path is termed a maximal forward reference. After a maximal forward reference is obtained, we back track to the starting point of the forward reference and begin a new forward reference path. In addition, the occurrence of a null source node also indicates the termination of an ongoing forward reference path and the beginning of a new one. Second, methods are developed to determine the frequent traversal patterns, termed large reference sequences, from the maximal forward references obtained above, where a large reference sequence is a reference sequence that appeared a sufficient number of times in the database to exceed a predetermined threshold.
展开▼