首页>
外文OA文献
>Maximising audiovisual correlation with automatic lip tracking and vowel based segmentation
【2h】
Maximising audiovisual correlation with automatic lip tracking and vowel based segmentation
展开▼
机译:通过自动的嘴唇跟踪和基于元音的分割,最大程度地提高视听相关性
展开▼
免费
页面导航
摘要
著录项
引文网络
相似文献
相关主题
摘要
In recent years, the established link between the various human communication production domains has become more widely utilised in the field of speech processing. In this work, a state of the art Semi Adaptive Appearance Model (SAAM) approach developed by the authors is used for automatic lip tracking, and an adapted version of our vowel based speech segmentation system is employed to automatically segment speech. Canonical Correlation Analysis (CCA) on segmented and non segmented data in a range of noisy speech environments finds that segmented speech has a significantly better audiovisual correlation, demonstrating the feasibility of our techniques for further development as part of a proposed audiovisual speech enhancement system.
展开▼