首页> 外文会议>International conference on web information systems and technologies >Web Page Classification Using Image Analysis Features
【24h】

Web Page Classification Using Image Analysis Features

机译:使用图像分析功能的网页分类

获取原文

摘要

Classification of web pages is usually done by extracting the textual content of the page and/or by extracting structural features from the HTML. In this work, we present a different approach, where we use the visual appearance of web pages for their classification. We extract generic, low-level visual features directly from the page as it is rendered by a web browser. The visual features used in this document are simple color and edge histograms, Gabor and texture features. These were extracted using an off-the-shelf visual feature extraction method. In three experiments, we classify web pages based on their aesthetic value, their recency and the type of website. Results show that these simple, global visual features already produce good classification results. We also introduce an online tool that uses the trained classifiers to assess new web pages.
机译:通常通过提取页面的文本内容和/或通过HTML提取结构特征来完成网页的分类。在这项工作中,我们提出了一种不同的方法,在那里我们使用网页的视觉外观进行分类。我们直接从页面中提取通用,低级视觉功能,因为它由Web浏览器呈现。本文档中使用的可视功能是简单的颜色和边缘直方图,Gabor和纹理功能。这些用特写视觉特征提取方法提取。在三个实验中,我们根据他们的审美价值,他们的新记忆和网站类型来分类网页。结果表明,这些简单,全球视觉功能已经产生了良好的分类结果。我们还介绍了一个在线工具,它使用训练有素的分类器来评估新的网页。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号