...
首页> 外文期刊>Expert Systems with Application >Coordinate CNNs and LSTMs to categorize scene images with multi-views and multi-levels of abstraction
【24h】

Coordinate CNNs and LSTMs to categorize scene images with multi-views and multi-levels of abstraction

机译:协调CNN和LSTM将场景图像分类为多视图和多抽象级别

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Due to complexities of scene images, scene categorization is a challenging task in the computer vision community. To categorize scene images effectively, in this paper, we propose to coordinate Convolutional Neural Networks (CNNs) and Long Short-Term Memory networks (LSTMs) to perform scene categorization with multi-views and multi-levels of abstraction. Specifically, to utilize the complementary properties of features of different levels of abstraction, we employ CNNs to extract features of multi-levels of abstraction based on its hierarchical structure. Furthermore, in order to deal with variations in scene image contents, we represent each image with multiple views, and in order to take correlation between image views into consideration, we treat image view features from the same image as a sequence and employ Long Short-Term Memory networks (LSTMs) to perform classification. Based on the proposed method, information of multi-views and multi-levels of abstraction can be made full use of in a single framework. We evaluate the proposed method on two challenging scene datasets, MIT indoor scene 67 and SUN 397. Obtained results demonstrate the effectiveness of utilizing CNNs and LSTMs to categorize scene images with multi-views and multi-levels of abstraction. Experiments on comparison to state-of-the-art methods show that the proposed method outperforms all the other methods used for comparison. (C) 2018 Elsevier Ltd. All rights reserved.
机译:由于场景图像的复杂性,在计算机视觉社区中,场景分类是一项艰巨的任务。为了有效地对场景图像进行分类,在本文中,我们建议协调卷积神经网络(CNN)和长短期记忆网络(LSTM)来执行具有多视图和多抽象级别的场景分类。具体来说,为了利用不同抽象级别的特征的互补属性,我们使用CNN来基于其层次结构提取多抽象级别的特征。此外,为了处理场景图像内容的变化,我们用多个视图表示每个图像,并且为了考虑图像视图之间的相关性,我们将来自同一图像的图像视图特征视为一个序列,并采用Long Short-术语存储网络(LSTM)进行分类。基于所提出的方法,可以在单个框架中充分利用多视图和抽象级别的信息。我们在两个具有挑战性的场景数据集(MIT室内场景67和SUN 397)上评估了该方法。获得的结果证明了利用CNN和LSTM对具有多视图和多抽象级别的场景图像进行分类的有效性。与最先进方法进行比较的实验表明,所提出的方法优于所有其他用于比较的方法。 (C)2018 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号