Where Am I? Comparing CNN and LSTM for Location Classification in Egocentric Videos

机译：我在哪里？比较CNN和LSTM在EgoCentric视频中的位置分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Egocentric vision is a technology that exists in a variety of fields such as life-logging, sports recording and robot navigation. Plenty of research work focuses on location detection and activity recognition, with applications in the area of Ambient Assisted Living. The basis of this work is the idea that locations can be characterized by the presence of specific objects. Our objective is the recognition of locations in egocentric videos that mainly consist of indoor house scenes. We perform an extensive comparison between Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) based classification methods that aim at finding the in-house location by classifying the detected objects which are extracted with a state-of-the-art object detector. We show that location classification is affected by the quality of the detected objects, i.e., the false detections among the correct ones in a series of frames, but this effect can be greatly limited by taking into account the temporal structure of the information by using LSTM. Finally, we argue about the potential for useful real-world applications.

机译：Egocentric Vision是一种技术，这些技术存在于各种领域，如寿命，体育记录和机器人导航。大量的研究工作侧重于位置检测和活动识别，在环境辅助生活领域的应用。这项工作的基础是想法，可以通过特定对象的存在来表征位置。我们的目标是承认Egentric视频的位置，主要包括室内房屋场景。我们在卷积神经网络（CNN）和基于长期内存（LSTM）的分类方法之间进行广泛的比较，其目的是通过分类用现有技术提取的检测到的对象来查找内部位置对象探测器。我们表明位置分类受到检测到的对象的质量的影响，即，在一系列帧中正确的错误检测，但通过使用LSTM考虑信息的时间结构，可以大量限制这种效果。最后，我们争论有用的现实世界应用程序的潜力。

著录项

来源
《IEEE International Conference on Pervasive Computing and Communications Workshops》|2018年|402p|共6页
会议地点
作者
Georgios Kapidis; Ronald W. Poppe; Elsbeth A. van Dam; Remco C. Veltkamp; Lucas P. J. J. Noldus;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;
关键词
Videos; Detectors; Object detection; Task analysis; Training; Activity recognition; Conferences;

机译：视频;探测器;对象检测;任务分析;培训;活动识别;会议;

相似文献

外文文献
中文文献
专利

1. Saliency Driven Object recognition in egocentric videos with deep CNN: toward application in assistance to Neuroprostheses [J] . Philippe Pérez de San Roman, Jenny Benois-Pineau, Jean-Philippe Domenger, Computer vision and image understanding . 2017,第NOVa期

机译：具有深层CNN的以自我为中心的视频中的显着性驱动对象识别：应用于辅助神经假体
2. AELA-DLSTMs: Attention-Enabled and Location-Aware Double LSTMs for aspect-level sentiment classification [J] . Shuang Kai, Ren Xintao, Yang Qianqian, Neurocomputing . 2019,第MARa21期

机译：AELA-DLSTM：用于方面级别情感分类的启用注意和位置感知的双重LSTM
3. Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos [J] . Koller Oscar, Camgoz Necati Cihan, Ney Hermann, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2020,第9期

机译：用多流CNN-LSTM-HMMS弱化学习，以发现手语视频中的顺序并行性
4. Where Am I? Comparing CNN and LSTM for Location Classification in Egocentric Videos [C] . Georgios Kapidis, Ronald W. Poppe, Elsbeth A. van Dam, IEEE International Conference on Pervasive Computing and Communications Workshops . 2018

机译：我在哪里？比较CNN和LSTM在以自我为中心的视频中进行位置分类
5. CNNs versus LSTMs for Time Series Forecasting [D] . Bhurtel, Bidur Prasad. 2021

机译：CNN与时间序列预测的LSTMS
6. Classification of Mental Stress Using CNN-LSTM Algorithms with Electrocardiogram Signals [O] . Mingu Kang, Siho Shin, Jaehyo Jung, 2021

机译：用心电图信号使用CNN-LSTM算法进行心理压力的分类
7. Comparing CNN and LSTM character-level embeddings in BiLSTM-CRF models for chemical and disease named entity recognition [O] . Zenan Zhai, Dat Quoc Nguyen, Karin Verspoor 2018

机译：比较CNN和LSTM字符级嵌入在Bilstm-CRF模型中的化学和疾病名为实体识别

Where Am I? Comparing CNN and LSTM for Location Classification in Egocentric Videos

摘要

著录项

相似文献

相关主题

期刊订阅