Extracting informative images from web news pages via imbalanced classification

机译：通过不平衡分类从Web新闻页面提取信息图像

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose an imbalanced classification algorithm to extract informative images from web news pages. Our algorithm resolve the difficult problem based on two approaches. First, we limit our dataset to a specific application area so that the patterns of the informative images can be captured by existing classification algorithms. Second, we propose an automatic negative samples filtering algorithm to eliminate most negative samples, so that the classification training data is rebalanced. Because most classification algorithms have reduced performance on imbalanced training data, our algorithm improves the overall performance significantly. In addition, our approach is inherently robust to new web sites and style/layout change of web sites.

机译：在本文中，我们提出了一种不平衡分类算法来从Web新闻页面提取信息图像。我们的算法基于两种方法解决了难题。首先，我们将数据集限制在特定的应用领域，以便可以通过现有的分类算法捕获信息图像的图案。其次，我们提出了一种自动的负样本过滤算法，以消除大多数负样本，从而使分类训练数据重新平衡。由于大多数分类算法在不平衡训练数据上的性能降低，因此我们的算法可显着提高整体性能。另外，我们的方法对于新网站和网站的样式/布局更改具有固有的鲁棒性。

著录项

来源
《ACM international conference on Multimedia》|2009年|P.1123 - 1124|共2页
会议地点
作者
Wei Gong; Hangzai Luo; Jianping Fan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类多媒体技术与多媒体计算机;
关键词
imbalanced classification; informative image;

机译：分类不平衡;信息量大;

相似文献

外文文献
中文文献
专利

1. Incorporating Top-Down Guidance for Extracting Informative Patches for Image Classification [J] . Shuang BAI, Tetsuya MATSUMOTO, Yoshinori TAKEUCHI, IEICE transactions on information and systems . 2012,第3期

机译：合并自上而下的指导以提取用于图像分类的信息性补丁
2. Incorporating Top-Down Guidance for Extracting Informative Patches for Image Classification [J] . Shuang BAI, Tetsuya MATSUMOTO, Yoshinori TAKEUCHI, IEICE Transactions on Information and Systems . 2012,第3期

机译：结合自上而下的指导以提取用于图像分类的信息性补丁
3. Generic Image Classification by Web Image Mining Experiments Using a Large Number of Web Images [J] . Keiji YANAI 電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing . 2003,第391期

机译：通过使用大量Web图像的Web图像挖掘实验进行的通用图像分类
4. Extracting informative images from web news pages via imbalanced classification [C] . Wei Gong, Hangzai Luo, Jianping Fan ACM international conference on Multimedia . 2009

机译：从Web News页面中提取信息图像通过不平衡分类
5. Medical Image Classification under Class Imbalance [D] . Zhang, Chuanhai. 2019

机译：医学图像分类在课堂上不平衡
6. An Impartial Semi-Supervised Learning Strategy for Imbalanced Classification on VHR Images [O] . Fei Sun, Fang Fang, Run Wang, 2020

机译：VHR图像上不平衡分类的公正半监督学习策略
7. Incorporating Top-Down Guidance for Extracting Informative Patches for Image Classification [O] . Shuang BAI, Tetsuya MATSUMOTO, Yoshinori TAKEUCHI, 2012

机译：纳入自上而下的指导，用于提取用于图像分类的信息补丁

Extracting informative images from web news pages via imbalanced classification

摘要

著录项

相似文献

相关主题

期刊订阅