...
首页> 外文期刊>IEEE Transactions on Signal Processing >Cross-Modal Localization via Sparsity
【24h】

Cross-Modal Localization via Sparsity

机译:通过稀疏性进行跨模式本地化

获取原文
获取原文并翻译 | 示例

摘要

Cross-modal analysis is a natural progression beyond processing of single-source signals. Simultaneous processing of two sources can reveal information that is unavailable when handling the sources separately. Indeed, human and animal perception, computer vision, weather forecasting, and various other scientific and technological fields can benefit from such a paradigm. A particular cross-modal problem is localization: out of the entire data array originating from one source, localize the components that best correlate with the other. For example, auditory and visual data sampled from a scene can be used to localize visual events associated with the sound track. In this paper we present a rigorous analysis of fundamental problems associated with the localization task. We then develop an approach that leads efficiently to a unique, high definition localization outcome. Our method is based on canonical correlation analysis (CCA), where inherent ill-posedness is removed by exploiting sparsity of cross-modal events. We apply our approach to localization of audio-visual events. The proposed algorithm grasps such dynamic audio-visual events with high spatial resolution. The algorithm effectively detects the pixels that are associated with sound, while filtering out other dynamic pixels, overcoming substantial visual distractions and audio noise. The algorithm is simple and efficient thanks to its reliance on linear programming, while being free of user-defined parameters
机译:跨模态分析是超越单源信号处理的自然发展。同时处理两个源可能会显示单独处理这些源时不可用的信息。实际上,人类和动物的感知,计算机视觉,天气预报以及其他各种科学技术领域都可以从这种范例中受益。一个特殊的跨模式问题是本地化:在源自一个来源的整个数据数组中,对与另一来源最相关的组件进行本地化。例如,从场景采样的听觉和视觉数据可用于定位与声轨关联的视觉事件。在本文中,我们对与本地化任务相关的基本问题进行了严格的分析。然后,我们开发一种方法,可以有效地产生独特的高清本地化结果。我们的方法基于规范相关分析(CCA),其中通过利用交叉模式事件的稀疏性消除了固有的不适感。我们将我们的方法应用于视听事件的本地化。所提出的算法以高空间分辨率掌握了这种动态视听事件。该算法有效地检测与声音相关的像素,同时滤除其他动态像素,从而克服了视觉上的干扰和音频噪声。由于算法依赖于线性编程,因此算法简单有效,并且没有用户定义的参数

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号