首页> 外文OA文献 >Deep learning for computer vision: a comparison between convolutional neural networks and hierarchical temporal memories on object recognition tasks
【2h】

Deep learning for computer vision: a comparison between convolutional neural networks and hierarchical temporal memories on object recognition tasks

机译:用于计算机视觉的深度学习:卷积神经网络和分层时间记忆在对象识别任务上的比较

摘要

In recent years, Deep Learning techniques have shown to perform well on a large variety of problems both in Computer Vision and Natural Language Processing, reaching and often surpassing the state of the art on many tasks. The rise of deep learning is also revolutionizing the entire field of Machine Learning and Pattern Recognition pushing forward the concepts of automatic feature extraction and unsupervised learning in general.udHowever, despite the strong success both in science and business, deep learning has its own limitations. It is often questioned if such techniques are only some kind of brute-force statistical approaches and if they can only work in the context of High Performance Computing with tons of data. Another important question is whether they are really biologically inspired, as claimed in certain cases, and if they can scale well in terms of "intelligence".udThe dissertation is focused on trying to answer these key questions in the context of Computer Vision and, in particular, Object Recognition, a task that has been heavily revolutionized by recent advances in the field. Practically speaking, these answers are based on an exhaustive comparison between two, very different, deep learning techniques on the aforementioned task: Convolutional Neural Network (CNN) and Hierarchical Temporal memory (HTM). They stand for two different approaches and points of view within the big hat of deep learning and are the best choices to understand and point out strengths and weaknesses of each of them.udCNN is considered one of the most classic and powerful supervised methods used today in machine learning and pattern recognition, especially in object recognition. CNNs are well received and accepted by the scientific community and are already deployed in large corporation like Google and Facebook for solving face recognition and image auto-tagging problems.udHTM, on the other hand, is known as a new emerging paradigm and a new meanly-unsupervised method, that is more biologically inspired. It tries to gain more insights from the computational neuroscience community in order to incorporate concepts like time, context and attention during the learning process which are typical of the human brain.udIn the end, the thesis is supposed to prove that in certain cases, with a lower quantity of data, HTM can outperform CNN.
机译:近年来,深度学习技术在计算机视觉和自然语言处理中的各种问题上表现良好,在许多任务上达到并经常超过最新技术水平。深度学习的兴起也正在彻底改变机器学习和模式识别的整个领域,从总体上推动了自动特征提取和无监督学习的概念。 ud尽管在科学和商业领域都取得了巨大的成功,但深度学习有其自身的局限性。人们经常质疑这样的技术是否只是某种形式的暴力统计方法,以及它们是否只能在具有大量数据的高性能计算的背景下工作。另一个重要的问题是,如在某些情况下所声称的那样,它们是否真的受到生物学启发,以及它们是否可以在“智能”方面很好地扩展。 ud本文的重点是试图在计算机视觉的背景下回答这些关键问题,并且,特别是对象识别,这项任务由于该领域的最新进展而发生了重大变化。实际上,这些答案是基于上述任务的两种非常不同的深度学习技术之间的详尽比较:卷积神经网络(CNN)和分层时间记忆(HTM)。它们代表深度学习的两种不同方法和观点,是理解和指出每种方法的优缺点的最佳选择。 udCNN被认为是当今使用的最经典,最强大的监督方法之一在机器学习和模式识别中,特别是在对象识别中。 CNN受到科学界的广泛认可和接受,并且已经部署在Google和Facebook等大型公司中,用于解决人脸识别和图像自动标记问题。另一方面, udHTM被称为一种新兴的范式和一种新的不受监督的方法,从生物学角度出发更具启发性。它试图从计算神经科学界获得更多见识,以便在学习过程中结合时间,上下文和注意力等概念,这是人脑的典型特征。 ud最后,本文旨在证明在某些情况下,数据量较小时,HTM的性能将优于CNN。

著录项

  • 作者

    Lomonaco Vincenzo;

  • 作者单位
  • 年度 2015
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号