Using Visual Clues Concept for Extracting Main Data from Deep Web Pages

机译：使用Visual Clues概念从深层网页提取主数据

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Extracting data from deep Web pages is a challenging problem due to the underlying intricate structures of such pages. A large number of techniques have been proposed to address this problem, but all of them have inherent limitations because they are Web-page-programming-language-dependent. The contents on Web pages are always displayed regularly for users to browse. There is different ways for deep Web data extraction to overcome the limitations of previous works by utilizing some interesting common visual features on the deep Web pages. In this paper vision-based approach is web page programming-language-independent approach is proposed. This approach utilizes the visual features of the web pages to extract data from deep web pages including data record extraction and data item extraction. Again we also propose a new evaluation measure revision to capture human effort needed to produce exact extraction of data. Our implementation on large set of web databases describes the proposed vision-based approach is highly effective for data extraction from deep web pages.

机译：由于深层网页的底层复杂结构，因此从深层网页中提取数据是一个具有挑战性的问题。已经提出了许多技术来解决这个问题，但是由于它们是与网页编程语言相关的，所以它们都具有固有的局限性。网页上的内容始终定期显示，以供用户浏览。通过利用深层Web页面上的一些有趣的通用视觉功能，深层Web数据提取有多种方法可以克服先前工作的局限性。本文提出了一种基于视觉的方法，即网页编程与语言无关的方法。这种方法利用网页的视觉特征从深层网页中提取数据，包括数据记录提取和数据项提取。同样，我们还提出了一种新的评估方法修订版，以捕获为准确提取数据所需的人工。我们在大型Web数据库上的实现描述了所提出的基于视觉的方法对于从深层网页提取数据非常有效。

著录项

来源
《2014 International Conference on Electronic Systems, Signal Processing, and Computing Technologies》|2014年|190-193|共4页
会议地点 Nagpur(IN)
作者
Pusdekar Satish J.; Chhaware Shaikh Phiroj;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
visual features for web pages; web data extraction; web data mining;

机译：网页的视觉功能；网络数据提取；网络数据挖掘；;

相似文献

外文文献
中文文献
专利

1. GeoArchaeology Web 2.0: Geospatial Information Services Facilitate New Concepts of Web-Based Data Visualization Strategies in Archaeology—Two Case Studies from Surveys in Sudan (Wadi) and Turkey (Doliche) [J] . Torsten Prinz, Stephanie Walter, André Wieghardt, Archaeological Discovery . 2014,第4期

机译：GeoArchaeology Web 2.0：地理空间信息服务促进考古学中基于Web的数据可视化策略的新概念-来自苏丹（Wadi）和土耳其（Doliche）的两个案例研究
2. WEB-BASED SPATIAL DATA QUALITY VISUALISATION: CONCEPTS AND IMPLEMENTATION [J] . Ting Yang, Jinling Wang Geomatics Research Australasia . 2004,第80期

机译：基于Web的空间数据质量可视化：概念和实现
3. Formal concept analysis approach for data extraction from a limited deep web database [J] . Zhuo Zhang, Juan Du, Liming Wang Journal of Intelligent Information Systems . 2013,第2期

机译：从有限的深度Web数据库提取数据的形式化概念分析方法
4. Using Visual Clues Concept for Extracting Main Data from Deep Web Pages [C] . Satish J. Pusdekar, Shaikh. Phiroj Chhaware International Conference on Electronic Systems, Signal Processing and Computing . 2014

机译：使用视觉线索概念从深网页提取主要数据
5. From document clues to descriptive metadata: Document characteristics used by graduate students in judging the usefulness of Web documents. [D] . Lan, Wen-Chin. 2002

机译：从文档线索到描述性元数据：研究生在判断Web文档有用性时使用的文档特征。
6. Pergola-web: a web server for the visualization and analysis of longitudinal behavioral data using repurposed genomics tools and standards [O] . Jose Espinosa-Carrasco, Toni Hermoso Pulido, Ionas Erb, 2019

机译：Pergola-web：使用重新设计的基因组学工具和标准对纵向行为数据进行可视化和分析的Web服务器
7. Visually Extracting Data Records from the Deep Web [O] . Neil Anderson, Jun Hong 2014

机译：从Deep Web直观地提取数据记录

Using Visual Clues Concept for Extracting Main Data from Deep Web Pages

摘要

著录项

相似文献

相关主题

期刊订阅