The MMASCS multi-modal annotated synchronous corpus of audio, video, facial motion and tongue motion data of normal, fast and slow speech

机译：正常，快速和慢速语音的音频，视频，面部运动和舌头运动数据的MMASCS多模式注释同步语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we describe and analyze a corpus of speech data that we have recorded in multiple modalities simultaneously: facial motion via optical motion capturing, tongue motion via electro-magnetic articulography, as well as conventional video and high-quality audio. The corpus consists of 320 phonetically diverse sentences uttered by a male Austrian German speaker at normal, fast and slow speaking rate. We analyze the influence of speaking rate on phone durations and on tongue motion. Furthermore, we investigate the correlation between tongue and facial motion.

机译：在本文中，我们描述并分析了同时以多种方式记录的语音数据集：通过光学动作捕捉进行的面部动作，通过电磁关节造影进行的舌部动作以及常规视频和高质量音频。语料库由320个语音上不同的句子组成，这些句子由奥地利的德国男性男性以正常，快速和慢速的说话速度说出。我们分析了语速对电话持续时间和舌头运动的影响。此外，我们调查了舌头和面部运动之间的相关性。

著录项

来源
《9th International conference on language resources and evaluation》|2014年|1197-1202|共6页
会议地点
作者
Dietmar Schabus; Michael Pucher; Phil Hoole;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
multi-modal speech corpus; articulatory data; facial motion;

机译：多模态语料库;发音数据;面部动作;

相似文献

外文文献
中文文献
专利

1. Construction of Audio-Visual Speech Corpus Using Motion-Capture System and Corpus Based Facial Animation [J] . Tatsuo YOTSUKURA, Shigeo MORISHIMA, Satoshi NAKAMURA IEICE Transactions on Information and Systems . 2005,第11期

机译：基于动作捕捉系统和基于人脸动画的视听语音语料库的构建
2. SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla [J] . Sadia Sultana, M. Shahidur Rahman, M. Reza Selim, PLoS One . 2021,第4期

机译：Sust Bangla情感语音语料库（Subesco）：孟加拉的一个音频情绪语音语料库
3. Suitability of a UV-based video recording system for the analysis of small facial motions during speech [J] . Matthew Craig, Pascal van Lieshout, Willy Wong Speech Communication . 2007,第9期

机译：基于紫外线的视频记录系统适用于分析语音中小的面部动作
4. The MMASCS multi-modal annotated synchronous corpus of audio, video, facial motion and tongue motion data of normal, fast and slow speech [C] . Dietmar Schabus, Michael Pucher, Phil Hoole 9th International conference on language resources and evaluation . 2014

机译：MMSCS多模态注释同步语料库的音频，视频，面部运动和舌片运动数据正常，快速和慢速语音
5. Multimodal Sensing and Data Processing for Speaker and Emotion Recognition Using Deep Learning Models with Audio, Video and Biomedical Sensors [D] . Abtahi, Farnaz. 2018

机译：使用具有音频，视频和生物医学传感器的深度学习模型，对说话人和情感识别进行多模式传感和数据处理
6. SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla [O] . Sadia Sultana, M. Shahidur Rahman, M. Reza Selim, 2021

机译：Sull Bangla情感语音语料库（Subesco）：孟加拉的一个音频情绪语音语音
7. SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla [O] . Sadia Sultana, M. Shahidur Rahman, M. Reza Selim, 2021

机译：Sull Bangla情感语音语料库（Subesco）：孟加拉的一个音频情绪语音语音

The MMASCS multi-modal annotated synchronous corpus of audio, video, facial motion and tongue motion data of normal, fast and slow speech

摘要

著录项

相似文献

相关主题

期刊订阅