FACE AND LIP LOCALIZATION IN UNCONSTRAINED IMAGERY

机译：不受约束的图像中的脸部和唇部定位

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

When combined with acoustical speech information, visual speech information (lip movement) significantly improves Automatic Speech Recognition (ASR) in acoustically noisy environments. Previous research has demonstrated that visual modality is a viable tool for identifying speech. However, the visual information has yet to become utilized in mainstream ASR systems due to the difficulty in accurately tracking lips in real-world conditions. This paper presents our current progress in addressing this issue. We derive several algorithms based on a modified HSI color space to successfully locate the face, eyes, and lips. These algorithms are then tested over imagery collected in visually challenging environments.

机译：当与声学语音信息结合使用时，可视语音信息（嘴唇移动）可在声学嘈杂的环境中显着改善自动语音识别（ASR）。先前的研究表明，视觉模态是识别语音的可行工具。但是，由于在现实环境中难以准确跟踪嘴唇，视觉信息尚未在主流ASR系统中得到利用。本文介绍了我们在解决此问题上的最新进展。我们基于修改后的HSI颜色空间得出了几种算法，可以成功定位面部，眼睛和嘴唇。然后在视觉挑战性环境中收集的图像上测试这些算法。

著录项

来源
《Signal amp; image processing》|2008年|470-475|共6页
会议地点 Kailua-Kona HI(AT);Kailua-Kona HI(AT)
作者
Brandon Crow; Jane Xiaozheng Zhang;
展开▼
作者单位

Department of Electrical Engineering California Polytechnic State University 1. Grand Ave. San Luis Obispo, CA 93401 USA;

Department of Electrical Engineering California Polytechnic State University 1. Grand Ave. San Luis Obispo, CA 93401 USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
automatic visual speech recognition; face tracking; lip tracking; target localization; mean-shift; bhattacharyya coefficient;

机译：自动视觉语音识别；面部跟踪；嘴唇跟踪目标定位；平均移动巴塔赫里亚系数;

相似文献

外文文献
中文文献
专利

1. Multimodal clothing recognition for semantic search in unconstrained surveillance imagery [J] . Halstead Michael A., Denman Simon, Sridharan Sridha, Journal of visual communication & image representation . 2019,第JANa期

机译：多模式服装识别在无限制监视图像中进行语义搜索
2. Unconstrained approach for isolating individual trees using high-resolution aerial imagery [J] . Taejin Park, Jung-Kil Cho, Jong-Yeol Lee, International journal of remote sensing . 2014,第1a2期

机译：使用高分辨率航空影像隔离单个树木的无限制方法
3. A Study on Lip Localization Techniques used for Lip reading from a Video [J] . Lalitha S. D., Thyagharajan K. K. International Journal of Applied Engineering Research . 2016,第1aPta7期

机译：视频中用于唇读的唇定位技术研究
4. FACE AND LIP LOCALIZATION IN UNCONSTRAINED IMAGERY [C] . Brandon Crow, Jane Xiaozheng Zhang IASTED International Signal and Image Processing . 2008

机译：不受约束图像中的面部和唇部定位
5. Integration of orbital and ground imagery for automation of rover localization. [D] . Hwangbo, Ju Won. 2010

机译：整合轨道和地面影像，以实现流动站定位的自动化。
6. Association between localized geohazards in West Texas and human activities recognized by Sentinel-1A/B satellite radar imagery [O] . Jin-Woo Kim, Zhong Lu -1

机译：Sentinel-1A / B卫星雷达图像识别出西德克萨斯州局部地质灾害与人类活动之间的关联
7. Face and lip tracking in unconstrained imagery for improved automatic speech recognition [O] . Brandon Crow, Jane Xiaozheng Zhang 2009

机译：用于改进自动语音识别的无约束图像中的面部和嘴唇跟踪

FACE AND LIP LOCALIZATION IN UNCONSTRAINED IMAGERY

摘要

著录项

相似文献

相关主题

期刊订阅