Investigation of ART2-Based Audio-to-Visual Conversion for Multimedia Applications

机译：用于多媒体应用的基于ART2的音频对视觉转换的研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Audio-to-visual synchronization is important for multimedia applications involving talking human, either natural or synthetic. Close correlation exists between the acoustic speech signal and visible lip movement that can be exploited in developing real-time audio-to-visual conversions. In this article, we apply ART2 and a multi-audio-frame technique to derive lip movement sequence from its corresponding audio speech stream. The training process of ART2 is fast and it is capable of learning new things without necessarily forgetting things learned in the past. In the case of multi-user adaptation, we proposed a system which uses one user's ART2 model as the reference model together with audio adapting and visual learning mechanism for new user adaptation. The audio adaptation maps new user's audio features into reference model audio features, and the visual learning makes the reference ART2 model learn the new speech characteristics of the new user. Experimental results had shown that the proposed ART2-based method is both fast and effective for single user and multiuser.

机译：视听同步对于涉及谈话人类的多媒体应用是自然或合成的多媒体应用。在发声信号和可见唇部运动之间存在密切相关性，可以在开发实时音频到视觉转换中的应用。在本文中，我们应用ART2和多音频帧技术，以从其对应的音频语音流中导出唇部运动序列。 Art2的培训过程很快，它能够学习新事物，而不需要忘记过去的东西。在多用户适配的情况下，我们提出了一个系统，该系统将一个用户的ART2模型作为参考模型以及用于新用户自适应的音频适应和视觉学习机制。音频适配将新用户的音频功能映射到参考模型音频功能中，视觉学习使参考ART2模型学习新用户的新语音特性。实验结果表明，所提出的基于ART2的方法对于单个用户和多用户来说都是快速有效的。

著录项

来源
《International conference on intelligent data engineering and automated learning》|2003年||共5页
会议地点
作者
Jin-Jang Lin; Chi-Wen Hsieh; Tai-Lang Jong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
入库时间 2022-08-20 19:32:10

相似文献

外文文献
中文文献
专利

1. Audio-to-visual conversion for multimedia communication [J] . Rao R.R., Tsuhan Chen IEEE Transactions on Industrial Electronics . 1998,第1期

机译：多媒体通信的视听转换
2. CERTAIN INVESTIGATIONS ON VIDEO STREAMING AND FRAME RATE CLASSIFICATION FOR MULTIMEDIA APPLICATIONS [J] . K.VIDYAVATHI, R.S.SABEENIAN Journal of Theoretical and Applied Information Technology . 2014,第3期

机译：多媒体应用的视频流和帧速率分类的某些研究
3. Application of multimedia methodology for investigation of karst water in highland regions of Ha Giang Province, Vietnam [J] . Ngoc Thach Nguyen, Ngoc Hai Pham, Xuan Canh Pham, Environmental earth sciences . 2013,第2期

机译：多媒体方法在越南河江省高原地区岩溶水调查中的应用
4. Investigation of ART2-Based Audio-to-Visual Conversion for Multimedia Applications [C] . Jin-Jang Lin, Chi-Wen Hsieh, Tai-Lang Jong International conference on intelligent data engineering and automated learning . 2003

机译：用于多媒体应用的基于ART2的音频对视觉转换的研究
5. Experimental Investigation of Low-Voltage Silicon Carbide (SiC) Semiconductor Devices for Power Conversion Applications [D] . Alharbi, Saleh Salem H. 2020

机译：低压碳化硅（SIC）半导体器件用于电力转换应用的实验研究
6. Paper trails trailing behind: improving informed consent to IVF through multimedia applications [O] . Jody Lyneé Madeira, Barbara Andraka-Christou 2016

机译：论文追踪追踪追踪：通过多媒体应用程序改善对IVF的知情同意
7. Feasibility investigation for the application of direct AC-AC conversion in offshore wind power based on a comparative evaluation [O] . Astad Kristian Prestrud 2010

机译：基于比较评估的直接AC-AC转换在海上风电中应用的可行性研究

Investigation of ART2-Based Audio-to-Visual Conversion for Multimedia Applications

摘要

著录项

相似文献

相关主题

期刊订阅