Learning Transformations From Video.

机译：从视频学习转变。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Our survival depends on accurate understanding of the environment around us through sensory inputs. One way to achieve this is to build models of the surrounding environment that are able to provide explanations of the data. Statistical models such as PCA, ICA and sparse coding attempt to do so by exploiting the second- and higher-order structures of sensory data. While these models have been shown to reveal key properties of the mammalian sensory system and have been successfully applied in various engineering applications, one shared weakness of these models is that they assume each observation is independent. In reality, there is often a transformational relationship between sensory data observations. Exploiting this relationship allows us to tease apart the causes of the data and reason about the environment. In this thesis, I developed an unsupervised learning framework that attempts to find the translational relationship between data and infer the causes of the observed data.;This dissertation is divided into three chapters. First, I propose an unsupervised learning framework that is able to model the transformations between data points using a continuous transformation model. I highlight the difficulties faced by previous attempts using similar models. I overcome these hurdles by proposing a learning rule that is able to compute the learning updates for an exponential model in polynomial time. I also propose an adaptive inference algorithm that is able to avoid local minima. These improvements make learning transformation possible and efficient.;Second, I perform a detailed analysis of the proposed model. I show that the adaptive inference algorithm is able to simultaneously recover multiple transformation parameters with high accuracy when given synthetic data where the transformation is known. When learned on pairs of images containing affine transformations, the algorithm correctly recovers the transformation operators. The unsupervised learning algorithm is able to discover transformations such as translation, illumination adjustment, contrast enhancement and local deformations when learned on pairs of natural movie frames. I also show that the learned models provide a better description of the underlying transformation both qualitatively and quantitatively compare to commonly used motion models.;Third, I describe a plausible application for the continuous transformation model in video coding. In a hybrid coding scheme, I propose to replace the traditionally used exhaustive search motion model with transformation models learned on natural time-varying images. A detailed analysis of the rate distortion characteristics of different learned models is documented and I show that the learned model improves the performance of traditional motion models in various settings.

机译：我们的生存取决于通过感官投入对周围环境的准确理解。实现此目的的一种方法是建立能够提供数据说明的周围环境模型。诸如PCA，ICA和稀疏编码之类的统计模型试图通过利用感官数据的二阶和高阶结构来做到这一点。虽然这些模型已显示出揭示了哺乳动物感觉系统的关键特性，并已成功地应用于各种工程应用中，但这些模型的一个共同缺点是，它们假定每次观察都是独立的。实际上，感觉数据观察之间通常存在转换关系。利用这种关系可以使我们弄清数据的原因和环境的原因。本文建立了一个无监督的学习框架，试图寻找数据之间的转换关系，并推断出观察到的数据的起因。本文分为三章。首先，我提出了一种无监督学习框架，该框架能够使用连续转换模型对数据点之间的转换进行建模。我强调了以前使用类似模型的尝试所面临的困难。我通过提出一种学习规则来克服这些障碍，该学习规则能够在多项式时间内为指数模型计算学习更新。我还提出了一种能够避免局部极小值的自适应推理算法。这些改进使学习转换成为可能和有效。第二，我对提出的模型进行了详细的分析。我表明，当已知变换的已知合成数据时，自适应推理算法能够以高精度同时恢复多个变换参数。当在包含仿射变换的图像对上获知时，该算法正确地恢复了变换算子。在自然电影帧对上学习时，无监督学习算法能够发现诸如平移，照明调整，对比度增强和局部变形之类的变换。我还表明，与常用的运动模型相比，所学模型在定性和定量方面都可以更好地描述基础变换。第三，我描述了连续变换模型在视频编码中的合理应用。在混合编码方案中，我建议用在自然时变图像上学习的变换模型替换传统使用的穷举搜索运动模型。记录了对不同学习模型的速率失真特性的详细分析，我证明了该学习模型在各种设置下可以提高传统运动模型的性能。

著录项

作者
Wang, Jimmy Ching Ming.;
展开▼
作者单位

University of California, Berkeley.;

展开▼
授予单位 University of California, Berkeley.;
学科 Computer Science.
学位 Ph.D.
年度 2010
页码 95 p.
总页数 95
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Learnings from Local Collaborative Transformations: Setting a Basis for a Sustainability Framework [J] . International journal of applied mechanics . 2020 ,第3期

机译：来自当地协同转变的学习：为可持续发展框架设定基础
2. Assessing transformational change from institutionalising digital capabilities on implementation and development of Product-Service Systems: Learnings from the maritime industry [J] . Pagoropoulos Aris, Maier Anja, McAloone Tim C. Journal of Cleaner Production . 2017 ,第nova10期

机译：评估从将数字能力制度化到产品服务系统的实施和开发的变革性变化：海事行业的经验教训
3. Multi-Step Learning and Adaptive Search for Learning Complex Model Transformations from Examples [J] . ISLEM BAKI, HOUARI SAHRAOUI ACM transactions on software engineering and methodology . 2016 ,第3期

机译：从示例中学习复杂模型转换的多步学习和自适应搜索
4. Quantifying the uncertainty of event detection in full motion video. [C] . Niels M. P. Neumann, Rob Knegjens, Richard den Hollander, SPIE Conference on Counterterrorism, Crime Fighting, Forensic, and Surveillance Technologies . 2018

机译：量化完整运动视频中事件检测的不确定性。
5. Transformational physician leaders: The relationship between transformational leadership and transformative learning. [D] . Carter, Sandra K. 2010

机译：变革型医师领导者：变革型领导者与变革性学习之间的关系。
6. Learning to Solve Trigonometry Problems That Involve Algebraic Transformation Skills via Learning by Analogy and Learning by Comparison [O] . Bing Hiong Ngu, Huy P. Phan 2020

机译：学习解决三角形问题这些问题涉及通过比较学习通过学习来涉及代数转换技巧
7. Occupational therapy intervention with a child is based upon an understanding and appreciation of normal development. Knowledge of current concepts and theories related to child development is essential when occupational therapist evaluates children. This background information helps therapist to plan intervention for the child. The aim of this study is to make observation video about development of about one year old child. The purpose of my study is to help occupational therapy students learn about child development. My study is practice-based thesis. It includes product, which is the observation video and study rapport. I describe my whole process in my rapport. The process includes different kinds of stages. First, I studied those theories of child development, which are used in the studies of occupational therapy for children. These theories are Moseys Developmental Frame of Reference and the theory of development according to Sensory Integration Theory. These theories are the frames of reference of my study. I organize the child development areas according to child occupations and skills. Then I start to plan, film and edit my video based on the theories of child development and the principles of making a video. In my rapport I describe all the stages of my study and explain the sequence and the content of the stages. I also evaluate the process of my study. In the observation video you can see those stages of development where about one year old child is based on the frames of reference, which I have used in my study. I believe that my observation video can at least be good for inspiring occupational therapy students learning about child development. Keywords child development, learning, observation video [O] . Lehtinen Ann-Mari 2006

机译：对儿童的职业治疗干预基于对正常发育的理解和欣赏。当职业治疗师评估儿童时，与儿童发育相关的当前概念和理论的知识必不可少。这些背景信息可帮助治疗师为孩子计划干预措施。这项研究的目的是制作有关约一岁儿童发育的观察视频。我研究的目的是帮助职业治疗学生学习儿童成长。我的研究是基于实践的论文。它包括产品，这是观察视频和学习融洽的关系。我以融洽的方式描述我的整个过程。该过程包括不同阶段。首先，我研究了有关儿童发育的理论，这些理论被用于儿童的职业治疗研究中。这些理论是Moseys发展参考框架和根据感觉统合理论的发展理论。这些理论是我研究的参考框架。我根据儿童职业和技能组织儿童发展领域。然后，我根据儿童发育理论和视频制作原理开始计划，拍摄和编辑视频。在融洽的关系中，我描述了学习的所有阶段，并解释了这些阶段的顺序和内容。我还评估了我的学习过程。在观察视频中，您可以看到那些发展阶段，其中大约一岁的孩子基于我的研究框架。我相信，我的观察视频至少可以对激发职业治疗的学生学习儿童发育有帮助。关键字儿童发展，学习，观察视频
8. Statistical Relational Learning (SRL) as an Enabling Technology for Data Acquisition and Data Fusion in Video. [R] . Getoor, L., Jacobs, D. 2013

机译：统计关系学习（sRL）作为视频数据采集和数据融合的一种支持技术。

Learning Transformations From Video.

摘要

著录项

相似文献

相关主题

期刊订阅