Web Page Template and Data Separation for Better Maintainability

机译：网页模板和数据分离以获得更好的可维护性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Separating a web page into template code and data records populated into the template is an important problem. This problem has a wide range of applications in web page compression and information extraction. We study this problem with the aim to separate a web page into easily maintainable template code and data records. We show that this problem is NP-hard. We then propose a heuristic algorithm to solve the problem. The main idea of our algorithm is to parse a web page into a tree and then to process it recursively in a bottom-up manner with three steps: splitting, folding, and alignment. We perform experiments on real datasets to evaluate the performance of our proposed algorithms in maximizing the maintainability of the template code produced. The experimental results show that our proposed algorithms outperform the baseline algorithms by 25% in the maintainability measure.

机译：将网页分成模板代码和填充到模板中的数据记录是一个重要问题。此问题在网页压缩和信息提取中具有广泛的应用。我们研究这个问题，目的是将网页分开到易于维护的模板代码和数据记录中。我们展示这个问题是NP - 硬。然后我们提出了一种启发式算法来解决问题。我们的算法的主要思想是将网页解析为树，然后用三个步骤以自下而上的方式递归地处理它：拆分，折叠和对齐。我们对实际数据集进行实验，以评估我们提出的算法在最大化所产生的模板代码的可维护性方面的性能。实验结果表明，我们所提出的算法优于基线算法在可维护性测量中将基线算法达到25％。

著录项

来源
《International Conference on Web Information Systems Engineering》|2018年|516p|共11页
会议地点
作者
Chenxu Zhao; Rui Zhang; Jianzhong Qi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词
Web page template extraction; Maintainability index; Dual teaching and learning based optimization;

机译：网页模板提取;可维护性指标;双教学和基于学习的优化;

相似文献

外文文献
中文文献
专利

1. The Personal Sequence Database: a suite of tools to create and maintain web-accessible sequence databases [J] . Scott A Givan, Christopher M Sullivan, James C Carrington BMC Bioinformatics . 2007,第1期

机译：个人序列数据库：一套创建和维护Web访问序列数据库的工具
2. Mapping Databases To Ontologies To Design And Maintain Data In A Semantic Web Environment [J] . Olivier Curé Journal of Systemics, Cybernetics and Informatics . 2006,第4期

机译：将数据库映射到本体以在语义Web环境中设计和维护数据
3. Open tubular capillary columns with basic templates made by the generalized preparation protocol in capillary electrochromatography chiral separation and template structural effects on chiral separation capability [J] . Zaidi S.A., Lee S.M., Cheong W.J. Journal of chromatography, A: Including electrophoresis and other separation methods . 2011,第9期

机译：毛细管电色谱手性分离中通用制备方案制备的具有基本模板的开放式毛细管柱，基本模板对手性分离能力的影响
4. Web Page Template and Data Separation for Better Maintainability [C] . Chenxu Zhao, Rui Zhang, Jianzhong Qi International conference on web information systems engineering;WISE international workshop on data quality and trust in big data;International workshop on edge-based computing for next-generation wireless networks;International workshop on information security and privacy for mobile cloud computing, web, and internet of things;International workshop on cloud computing economic . 2018

机译：网页模板和数据分离以提高可维护性
5. Separations using biological carriers immobilized in porous polymeric and sol-gel template synthesized nanotubular membranes. [D] . Lakshmi, Brinda B. 1998

机译：使用固定在多孔聚合物和溶胶-凝胶模板合成的纳米管膜中的生物载体进行分离。
6. The Personal Sequence Database: a suite of tools to create and maintain web-accessible sequence databases [O] . Scott A Givan, Christopher M Sullivan, James C Carrington 2007

机译：个人序列数据库：一套用于创建和维护可通过网络访问的序列数据库的工具
7. A double-model approach to achieve effective model-view separation in template based web applications [O] . F. J. García, Raúl Izquierdo Castanedo, Aquilino A. Juan Fuente 2015

机译：在基于模板的Web应用程序中实现有效模型 - 视图分离的双模型方法
8. Mapping the footsteps of the green anole: A template for publishing ecological data on the World Wide Web [R] . Carnes, E. T. , Truett, D. F. , Truett, L. F. 1996

机译：绘制绿色anole的足迹：用于在万维网上发布生态数据的模板

Web Page Template and Data Separation for Better Maintainability

摘要

著录项

相似文献

相关主题

期刊订阅