Learning-Based Approaches for Pixel-Level Prediction

机译：基于学习的像素级预测方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Images are a rich source of information about our physical world. A fundamental limitation in developing interactive applications that leverage image data has been getting machines to understand what the stream of numbers composing images represents. We study the design of learning-based approaches for understanding images at a pixel level. Our work focuses on addressing the following questions: 1) What representation is most useful for pixel-level reasoning, and how can we obtain these features from image data? 2) How can we design and train deep models for problems where each pixel can have multiple correct interpretations? 3) How can we exploit spatial coherence within adjacent image regions to assist with reasoning about content at the pixel level?;We show that designing pixel-level descriptors by incorporating image-level information (in addition to information from the local neighborhood of a pixel) leads to significant improvements in our ability to estimate depth from a single image.;As it is challenging to learn such pixel-level representations due to a lack of labeled training data, we also study approaches for learning pixel-level representations in unsupervised settings, e.g., colorizing grayscale images and image inpainting.;We propose an architecture targeted at improving the ability of models to predict pixel-level data when there are multiple correct outputs possible for each pixel. We show how to train our proposed architecture to allow for diversity within the output hypothesis space.;Finally, we explore image inpainting as a mechanism for exploiting spatial coherence for improving the performance of patch-based image compression models. Our study reveals that there is a need to design new architectural components for extracting pixel-level information for performing inpainting. We also show that compression performance improves the most when the inpainting model is trained jointly (for an inpainting and compression objective) with a modified learning objective, allowing our model not only to learn how to inpaint effectively but also to discover what to inpaint for bringing about the greatest improvement in compression.

机译：图像是有关我们物理世界的丰富信息来源。开发利用图像数据的交互式应用程序的一个基本限制是使机器了解组成图像的数字流代表什么。我们研究基于学习的方法的设计，以了解像素级别的图像。我们的工作重点是解决以下问题：1）哪种表示形式最适合像素级推理，如何从图像数据中获得这些特征？ 2）我们如何设计和训练针对每个像素可以有多种正确解释的问题的深入模型？ 3）我们如何利用相邻图像区域内的空间相干性来辅助像素级内容的推理？;我们证明了通过合并图像级信息（除了来自像素本地邻域的信息之外）来设计像素级描述符）极大地提高了我们从单个图像估计深度的能力。;由于缺乏标记的训练数据，要学习这样的像素级表示具有挑战性，因此我们还研究了在无人监督的情况下学习像素级表示的方法；例如，为灰度图像着色和图像修补。；我们提出了一种旨在提高模型在每个像素可能有多个正确输出时预测像素级数据的能力的体系结构。我们将展示如何训练我们提出的体系结构，以允许在输出假设空间内实现多样性。最后，我们将图像修复作为一种利用空间相干性的机制来改善基于补丁的图像压缩模型的性能。我们的研究表明，需要设计新的体系结构组件以提取像素级信息以进行修补。我们还表明，在修改的学习目标共同作用下（针对修复和压缩目标）修复修复模型时，压缩性能会得到最大改善，这使我们的模型不仅可以学习如何有效修复，而且可以发现要修复的内容压缩方面的最大改进。

著录项

作者
Baig, Mohammad Haris.;
展开▼
作者单位

Dartmouth College.;

展开▼
授予单位 Dartmouth College.;
学科 Computer science.;Artificial intelligence.
学位 Ph.D.
年度 2017
页码 141 p.
总页数 141
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Deep learning-based pixel-level rock fragment recognition during tunnel excavation using instance segmentation model [J] . Qiao Weidong, Zhao Yufei, Xu Yang, Tunnelling and underground space technology . 2021,第Sepa期

机译：基于深度学习的像素级摇滚片段识别在隧道挖掘过程中使用实例分割模型
2. Machine learning-based approaches for disease gene prediction [J] . Duc-Hau Le Briefings in functional genomics . 2020,第5a6期

机译：基于机器学习的疾病基因预测方法
3. Comparing time series with machine learning-based prediction approaches for violation management in cloud SLAs [J] . Walayat Hussain, Farookh Khadeer Hussain, Morteza Saberi, Future generation computer systems . 2018,第DECa期

机译：将时间序列与基于机器学习的预测方法进行比较，以进行云SLA中的违规管理
4. Large scale testing of chemical shift prediction algorithms and improved machine learning-based approaches to shift prediction [C] . Arun, K., Langmead, . 2004

机译：化学位移预测算法的大规模测试和基于改进的基于机器学习的位移预测方法
5. Pixel-Level Prediction: Models and Applications [D] . Kong, Shu . 2019

机译：像素级预测：模型和应用
6. A deep learning-based method for grip strength prediction: Comparison of multilayer perceptron and polynomial regression approaches [O] . Jaejin Hwang, Jinwon Lee, Kyung-Sun Lee 2021

机译：基于深度学习的抓取强度预测方法：多层感知和多项式回归方法的比较
7. A deep learning-based method for grip strength prediction: Comparison of multilayer perceptron and polynomial regression approaches [O] . Jaejin Hwang, Jinwon Lee, Kyung-Sun Lee 2021

机译：基于深度学习的抓取强度预测方法：多层感知和多项式回归方法的比较

Learning-Based Approaches for Pixel-Level Prediction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅