...
首页> 外文期刊>Advanced Robotics: The International Journal of the Robotics Society of Japan >Probabilistic nod generation model based on speech and estimated utterance categories
【24h】

Probabilistic nod generation model based on speech and estimated utterance categories

机译:基于语音和估计话语类别的概率点播模型

获取原文
获取原文并翻译 | 示例

摘要

We proposed and evaluated a probabilistic model that generates nod motions based on utterance categories estimated from the speech input. The model comprises two main blocks. In the first block, dialog act-related categories are estimated from the input speech. Considering the correlations between dialog acts and head motions, the utterances are classified into three categories having distinct nod distributions. Linguistic information extracted from the input speech is fed to a cluster of classifiers which are combined to estimate the utterance categories. In the second block, nod motion parameters are generated based on the categories estimated by the classifiers. The nod motion parameters are represented as probability distribution functions (PDFs) inferred from human motion data. By using speech energy features, the parameters are sampled from the PDFs belonging to the estimated categories. The effectiveness of the proposed model was evaluated using an android robot, through subjective experiments. Experiment results indicated that the motions generated by our proposed approach are considered more natural than those of a previous model using fixed nod shapes and hand-labeled utterance categories.
机译:我们提出并评估了概率模型,该模型基于语音输入估计的话语类别产生点头运动。该模型包括两个主块。在第一个块中,与输入语音估计有关的对话框行为相关类别。考虑到对话框作用和头部运动之间的相关性,话语分为三类具有不同点点分布的三个类别。从输入语音中提取的语言信息被馈送到组合以估计话语类别的分类器集群。在第二块中,基于分类器估计的类别生成NOD运动参数。点头运动参数表示为从人类运动数据推断的概率分布函数(PDF)。通过使用语音能量特征,从属于估计类别的PDF采样参数。通过主观实验使用Android机器人评估所提出的模型的有效性。实验结果表明,我们所提出的方法产生的动作被认为比使用固定点头形状和手工标记的话语类别更自然。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号