首页> 外文会议>IEEE International Conference on Pervasive Computing and Communications Workshops >Where Am I? Comparing CNN and LSTM for Location Classification in Egocentric Videos
【24h】

Where Am I? Comparing CNN and LSTM for Location Classification in Egocentric Videos

机译:我在哪里?比较CNN和LSTM在EgoCentric视频中的位置分类

获取原文

摘要

Egocentric vision is a technology that exists in a variety of fields such as life-logging, sports recording and robot navigation. Plenty of research work focuses on location detection and activity recognition, with applications in the area of Ambient Assisted Living. The basis of this work is the idea that locations can be characterized by the presence of specific objects. Our objective is the recognition of locations in egocentric videos that mainly consist of indoor house scenes. We perform an extensive comparison between Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) based classification methods that aim at finding the in-house location by classifying the detected objects which are extracted with a state-of-the-art object detector. We show that location classification is affected by the quality of the detected objects, i.e., the false detections among the correct ones in a series of frames, but this effect can be greatly limited by taking into account the temporal structure of the information by using LSTM. Finally, we argue about the potential for useful real-world applications.
机译:Egocentric Vision是一种技术,这些技术存在于各种领域,如寿命,体育记录和机器人导航。大量的研究工作侧重于位置检测和活动识别,在环境辅助生活领域的应用。这项工作的基础是想法,可以通过特定对象的存在来表征位置。我们的目标是承认Egentric视频的位置,主要包括室内房屋场景。我们在卷积神经网络(CNN)和基于长期内存(LSTM)的分类方法之间进行广泛的比较,其目的是通过分类用现有技术提取的检测到的对象来查找内部位置对象探测器。我们表明位置分类受到检测到的对象的质量的影响,即,在一系列帧中正确的错误检测,但通过使用LSTM考虑信息的时间结构,可以大量限制这种效果。最后,我们争论有用的现实世界应用程序的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号