Deep Web Repeated Pattern Discovering Based on the Largest Block Strategy

机译：基于最大块策略的深度Web重复模式发现

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Repeated pattern is a common phenomenon in query result pages of deep web sites. The deep web back-end data can be accessed by mining repeated patterns. So far, most of the algorithms of discovering repeated pattern use traditional web information extraction methods. But the recall percentage and accuracy are not high. How to obtain the repeated pattern accurately and completely is still a difficulty. We propose a method based on the largest block strategy to discover such pattern. The core of the method is using the largest block strategy to discover the repeated pattern layer. We can quickly navigate to the region of the entity data, and then analyze the sub tree in this area, finally, get the simplified repeated pattern of the deep web site. According to the results of the experiment, this method can get the repeated pattern data more accurately and more completely than the traditional methods. It can also address the multi-pattern problem which has not been solved yet in other methods.

机译：重复模式是深层网站的查询结果页面中的常见现象。可以通过挖掘重复的模式来访问深层Web后端数据。到目前为止，发现重复模式的大多数算法都使用传统的Web信息提取方法。但是召回率和准确性不高。如何准确，完整地获得重复图案仍然是一个难题。我们提出了一种基于最大块策略的方法来发现这种模式。该方法的核心是使用最大的块策略来发现重复的图案层。我们可以快速导航到实体数据的区域，然后分析该区域中的子树，最后，获得深度网站的简化重复模式。根据实验结果，该方法可以比传统方法更准确，更完整地获得重复的图案数据。它还可以解决在其他方法中尚未解决的多模式问题。

著录项

来源
《2012 IEEE 12th International Conference on Computer and Information Technology.》|2012年|p.1082- 1086|共5页
会议地点 Chengdu(CN);Chengdu(CN)
作者
Ye Feiyue; Tang Haibo; Luo Xiangfeng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;计算技术、计算机技术;
关键词
入库时间 2022-08-26 14:29:09

相似文献

外文文献
中文文献
专利

1. A fuzzy neural network based framework to discover user access patterns from web log data [J] . Ansari Zahid A., Sattar Syed Abdul, Babu A. Vinaya Advances in data analysis and classification . 2017,第3期

机译：基于模糊的神经网络的框架，用于发现来自Web日志数据的用户访问模式
2. A study of students' heuristics and strategy patterns in web-based reciprocal peer assessment for science learning [J] . Tsivitanidou Olia E., Constantinou Constantinos P. The internet and higher education . 2016,第apra期

机译：基于网络的互惠同peer评估科学学习中学生的启发式和策略模式的研究
3. CMOS compatible strategy based on selective atomic layer deposition of a hard mask for transferring block copolymer lithography patterns [J] . Gay G., Baron T., Agraffeil C., Nanotechnology . 2010,第43期

机译：基于用于转移嵌段共聚物光刻图案的硬掩模的选择性原子层沉积的CMOS兼容策略
4. An algorithm to find the largest adjacent repeated pattern based on Suffix Tree [C] . Yuan-Chun Xu Consumer Electronics, Communications and Networks (CECNet), 2012 2nd International Conference on . 2012

机译：基于后缀树的最大相邻重复图案查找算法
5. Discovering and mining user Web-page traversal patterns. [D] . Mortazavi-Asl, Behzad. 2001

机译：发现和挖掘用户网页遍历模式。
6. miRvestigator: web application to identify miRNAs responsible for co-regulated gene expression patterns discovered through transcriptome profiling [O] . Christopher L. Plaisier, J. Christopher Bare, Nitin S. Baliga 2011

机译：miRvestigator：用于识别通过转录组分析发现的共同调控基因表达模式的miRNA的Web应用程序
7. Relationships among student attitudes, motivation, learning styles, learning strategies, patterns of learning and achievement: a formative evaluation of distance education via Web-based courses [O] . Shih, Ching-Chun 1998

机译：学生态度，动机，学习方式，学习策略，学习方式和成就之间的关系：通过基于网络的课程对远程教育的形成性评估

Deep Web Repeated Pattern Discovering Based on the Largest Block Strategy

摘要

著录项

相似文献

相关主题

期刊订阅