Multi Level Web Data Extraction Based Topical Visual Structure Clustering for Efficient Web Search

Sureshkumar T; Shanthi N

首页> 外文期刊>Journal of computational and theoretical nanoscience >Multi Level Web Data Extraction Based Topical Visual Structure Clustering for Efficient Web Search

【24h】

Multi Level Web Data Extraction Based Topical Visual Structure Clustering for Efficient Web Search

机译：基于多级Web数据提取的高效网络搜索的局部视觉结构聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The problem of web clustering has been approached in various strategies; however they suffer to achieve performance due to the poor data extraction and clustering approaches used. The most methods do not use all the features of web document than the textual features. To improve theperformance of web data extraction and clustering to support web search, the author present an efficient multi level web data extraction technique in this paper. The web document has been preprocessed to obtain various features from the text, structural and visual features. Extracted featureshave been used to perform Topical-Visual-Structure Clustering. Each class of the cluster has been organized into three different sub classes. The method first computes the topical similarity measure to identify the class of the document. Then the visual similarity and structural similaritymeasure has been used to identify the next level subclass of each cluster. The method focused to improve the performance of the web search and from the input query the method identifies the type of result the user expects. The proposed method increases the performance of web data extractionand web search.

机译：Web集群的问题已在各种策略中接近;然而，由于使用的数据提取和使用的聚类方法差，它们遭受了实现的性能。最多的方法不使用Web文档的所有功能而不是文本功能。为了提高Web数据提取和聚类的性能来支持网络搜索，提交人在本文中提出了一种有效的多级Web数据提取技术。已预处理Web文档以获取文本，结构和视觉功能的各种功能。提取的特性已被用于执行局部视觉结构聚类。群集的每个类都被组织成三个不同的子类。该方法首先计算局部相似度量以识别文档的类别。然后，视觉相似性和结构相似性已经用于标识每个群集的下一个级别子类。该方法的重点是提高Web搜索的性能以及从输入查询的方法该方法标识用户期望的结果类型。该方法提高了Web数据提取的性能。

著录项

来源
《Journal of computational and theoretical nanoscience》 |2017年第9期|共6页
作者
Sureshkumar T; Shanthi N;
展开▼
作者单位

Department of Information Technology K. S. Rangasamy College of Technology Tiruchengode 637215 India;

Department of Computer Science and Engineering Nandha College of Engineering Erode 638004 India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类薄膜技术;
关键词
Multi Level Web Data; TVS Clustering; Web Data Extraction; Web Search;

机译：多级Web数据;TVS聚类;Web数据提取;网页搜索;

相似文献

外文文献
中文文献
专利

1. Multi Level Web Data Extraction Based Topical Visual Structure Clustering for Efficient Web Search [J] . Sureshkumar T, Shanthi N Journal of computational and theoretical nanoscience . 2017,第9期

机译：基于多级Web数据提取的高效网络搜索的局部视觉结构聚类
2. web-rMKL: a web server for dimensionality reduction and sample clustering of multi-view data based on unsupervised multiple kernel learning [J] . Benedict R?der, Nicolas Kersten, Marius Herr, Nucleic acids research . 2019,第W1期

机译：web-rMKL：一种基于无监督多核学习的降维和多视图数据样本聚类的Web服务器
3. Enhancing web search result clustering model based on multiview multirepresentation consensus cluster ensemble (mmcc) approach [J] . Ali Sabah, Sabrina Tiun, Nor Samsiah Sani, PLoS One . 2021,第1期

机译：基于MultiView多重特派复断的共识群组（MMCC）方法，增强基于MultiView Multimirepration的群集群集模型
4. Clustering Visually Similar Web Page Elements for Structured Web Data Extraction [C] . Tomas Grigalis, Lukas Radvilavicius, Antanas Cenys, . 2012

机译：聚集外观相似的网页元素以进行结构化Web数据提取
5. Key Phrase Extraction and Co-clustering for Web Search Result Visualization. [D] . Chu, Shixian. 2011

机译：Web搜索结果可视化的关键短语提取和联合聚类。
6. web-rMKL: a web server for dimensionality reduction and sample clustering of multi-view data based on unsupervised multiple kernel learning [O] . Benedict Röder, Nicolas Kersten, Marius Herr, 2019

机译：web-rMKL：一种基于无监督多核学习的降维和多视图数据样本聚类的Web服务器
7. Clustering Visually Similar Web Page Elements for Structured Web Data Extraction [O] . Tomas Grigalis, Lukas Radvilavičius, Antanas Čenys, 2012

机译：群集视觉上类似的网页元素用于结构化Web数据提取
8. Web-Scale Search-Based Data Extraction and Integration [R] . Chang, K. C., Shuck, T., Kabra, G. 2011

机译：基于Web规模搜索的数据提取与集成

Multi Level Web Data Extraction Based Topical Visual Structure Clustering for Efficient Web Search

摘要

著录项

相似文献

相关主题

期刊订阅